FBK is hiring a Junior Research Engineer in multimodal LLMs, with a focus on speech and video processing. Join a leading research institution in Trentino and contribute to building the next generation of AI technologies for multilingual and multimodal communication.
What to expect
This is a two-year position that gives you the chance to enter one of Italy’s most advanced research environments in AI. You’ll join FBK’s Digital Industry Center and work on multimodal models that connect speech, vision, and language. As a Junior Research Engineer, you’ll contribute to practical model development while learning directly from experienced researchers and engineers who work on multilingual and multimodal systems every day. The role is based in Trento, Northern Italy, within an international setting that values clear communication, collaboration, and steady professional growth. You’ll find flexible working conditions, training opportunities, meal vouchers, wellbeing services, and support with relocation and everyday needs — a solid foundation for building your early career in applied AI.Position
In this role, you will support the design, training, and fine-tuning of multimodal LLM architectures that integrate spoken language and visual inputs. You’ll experiment with methods to combine external knowledge sources, contribute to model optimization, and run evaluations to test system performance. Your work will include preparing datasets, conducting experiments, analyzing results, and helping move models toward practical use cases. You will collaborate closely with senior researchers and engineers, and you’ll have the chance to contribute to research papers and technical presentations.
Requirements
We are looking for someone who:
- Holds a Master’s degree in Computer Science, Artificial Intelligence, or a related discipline;
- Has a strong foundation in deep learning and experience with neural networks;
- Is familiar with large language models and training / fine-tuning workflows;
- Proficient in Python and frameworks such as PyTorch or TensorFlow;
- Has good command of English (spoken and written);
- Demonstrates a motivated, proactive mindset and an eagerness to learn and grow in a research environment.
Preferred Qualifications
These elements will be considered as added value:
- Ability to create, train, and optimize neural networks for multimodal (audio and/or video) processing;
- Experience training or evaluating LLMs;
- Publications in relevant AI conferences or journals.
How to Apply
Submit your application by December 10, 2025 through jobs.fbk.eu, including:
- detailed CV (PDF)
- cover letter explaining your motivation for this position (PDF)
Please read the Recruitment Regulations before applying. For questions or technical issues contact the People Innovation for Research Department at jobs@fbk.eu.