All roles

Applied Scientist (LLM)

Remote · USA Full-time New today

Team Summary Our distributed team is looking for an experienced Applied Scientist with a strong background in Large Language models to develop high-performance Generative AI features across Cloud and Edge environments. Job Summary In this role you will drive the transition from research to production by optimizing local inference through model compression and quantization for private, real-time Edge performance, while also engineering scalable RAG architectures and multi-agent systems for Cloud deployment. Your daily responsibilities encompass the full research lifecycle, including formulating hypotheses, generating synthetic datasets, fine-tuning LLMs, and validating safety and alignment, ultimately culminating in technical reports. Responsibilities and Duties Design and implement advanced methods in prompt orchestration, fine-tuning (SFT/RLHF/DPO), and autonomous agentic workflows Curate high-quality training data from large-scale text and multi-modal sources Identify patterns in model hallucinations and visualize evaluation metrics for clear interpretation Tune hyperparameters and improve inference speed/accuracy through PEFT (LoRA/QLoRA) and advanced prompt engineering Collaborate with Product and Data Engineering teams to seamlessly integrate LLM features into the broader ecosystem Track and report progress using industry-standard benchmarks (MMLU, HumanEval, etc.) and custom internal KPIs Stay at the forefront of the field (e.g., State Space Models, new Transformer variants) and evaluate cutting-edge techniques for production readiness Engage in continuous technical growth and mentor junior colleagues to elevate the team's expertise Qualifications and Skills 3+ years of commercial experience in Machine Learning, with a specific focus on the NLP or LLM domain Strong knowledge of Python3, NumPy, pandas, and modern text-processing libraries, PyTorch and Hugging Face (Transformers, PEFT, Accelerate) Proficiency in PEFT/LoRA and Reinforcement Learning techniques Deep understanding of attention mechanisms, tokenization, context window management, and embedding spaces Practical experience in at least one of the following: Retrieval-Augmented Generation (RAG), Fine-tuning, or Agentic frameworks Proven ability to manage and analyze massive datasets (>100GB) across text, image, and audio formats Hands-on experience crafting high-fidelity datasets and building robust data pipelines Expertise in prompt engineering, agentic framework design, and LLM pipeline orchestration Experience deploying LLMs to production environments using Triton Inference Server, vLLM, TGI, or ONNX Good written and spoken English Nice to have Practical experience with Pinecone, Weaviate, Milvus, or Chroma Advanced quantization (GGUF, AWQ, EXL2), pruning, and knowledge distillation Experience with LangChain, LlamaIndex, or AutoGen Basic understanding of web/client-server architecture and streaming API responses (Asyncio, aiohttp) Familiarity with RAGAS, DeepEval, or G-Eval Experience using Docker, Kubernetes, and cloud GPU orchestration (e.g., Run:ai, Lambda Labs) Knowledge of C++, Triton, or CUDA for custom kernel development We offer multiple benefits that include The environment of equal opportunities, transparent and value-based corporate culture and an individual approach to each team member Competitive compensation and perks Gig-contract 21 paid vacation days per year, paid public holidays according to the Ukrainian legislation Development opportunities like corporate courses, knowledge hubs, and free English classes as well as educational leaves Medical insurance is provided from day one. Sick leaves and medical leaves are available Remote working mode is available within Ukraine only Free meals, fruits, and snacks when working in the office. Apply To This Job

Related roles

Area Sales Manager

Remote · USA Full-time

Supply Chain Solutions Lead

Remote · USA Full-time

Senior Technical Program Manager

Remote · USA Full-time

Engineer - New Solution - Clinics / Provider Groups

Remote · USA Full-time

Lead Software Engineer

Remote · USA Full-time

Senior DevEx Engineer

Remote · USA Full-time

Implementation Manager

Remote · USA Full-time

Senior Software Engineer

Remote · USA Full-time

Senior IT Engineer, Field Systems

Remote · USA Full-time

Brand Designer

Remote · USA Full-time

NEW JOB OPENING SOFTWARE ENGINEER - CLOUD AND DATA INFRASTRUCTURE IN REMOTE, USA!

Remote · USA Full-time

Experienced Part-Time Evening Data Entry Specialist – Remote Opportunity at arenaflex

Remote · USA Full-time

Sr. Partner Solutions Architect - Insurance Software, Financial Services

Remote · USA Full-time

Territory Business Manager - Columbus, OH

Remote · USA Full-time

Experienced Part-Time Customer Service Representatives – Remote Work Opportunity in Insurance Support

Remote · USA Full-time

Experienced Remote Customer Support Professional – Work from Home Opportunity with Competitive Hourly Rates up to $35 per Hour at blithequark

Remote · USA Full-time

Experienced Full Stack Data Entry Specialist – Remote Work Opportunity with arenaflex

Remote · USA Full-time

Experienced Full Stack Data Modeler – Web & Cloud Application Development

Remote · USA Full-time

Postal Associate - Great Pay and Benefits

Remote · USA Full-time

Lifecycle & CRM Manager

Remote · USA Full-time