Shafiq Joty

Senior Director, Research

My Background

Shafiq (raihanjoty.github.io) directs the NLP group's work on large language modeling (LLM) and generative AI. Some of his group's recent projects include SFR-RAG, SFR-Judge, SFR-RAG-Agent and xGen. He is also a tenured Associate Professor (currently on leave) in the School of Computer Science and Engineering (SCSE) at NTU. He was a founding manager of the Salesforce Research Asia (Singapore) lab. His research contributed to 35+ patents and more than 170+ papers in top-tier NLP and ML conferences and journals. He severed as a PC chair of SIGDIAL-2023, best paper award committee of ICLR-23, NAACL-22 and a (senior) area chair for all the NLP and ML conferences.

5 authors

March 26, 2025

AI is rapidly transforming industries, helping businesses enhance customer experiences, improve efficiency, and make smarter decisions. But an essential question arises: How can we ensure that AI is creating accurate and grounded answers?…

Image shows a brain with wires going through floating above a circuit.

6 authors

October 28, 2024

The SFR-Embedding-Mistral marks a significant advancement in text-embedding models, building upon the solid foundations of E5-mistral-7b-instruct and Mistral-7B-v0.1.

4 authors

September 26, 2024

As the development and deployment of large language models (LLMs) accelerates, evaluating model outputs has become increasingly important. The established method of evaluating responses typically involves recruiting and training human evaluators, having them…

3 authors

September 17, 2024

Retrieval Augmented Generation (RAG) has not only gained steam as one of the most invested areas of research in generative AI but also gathered considerable popularity and commercialization opportunities. RAG is typically applied…

Shafiq Joty

Bert Legrand

July 18, 2024

Creating the world’s first LLM Benchmark for CRM

6 authors

October 20, 2023

TL;DR: With CodeChain, a pretrained large language model (LLM) can solve challenging coding problems by integrating modularity in generation samples and self-improve by employing a chain of self-revisions on representative sub-modules. CodeChain can…

An illustration showing screens, gears and a magnifying glass.

18 authors

June 28, 2023

TLDR We trained a series of 7B LLMs named XGen-7B with standard dense attention on up to 8K sequence length for up to 1.5T tokens. We also fine tune the models on public-domain…

Shafiq Joty

Does Context Matter? Introducing ContextualJudgeBench for RAG and Summarization Evaluation

SFR-Embedding-Mistral: Enhance Text Retrieval with Transfer Learning

Accelerating Your Model Evaluation and Fine-tuning with SFR-Judge

Building Contextually Faithful RAG Applications with SFR-RAG

Creating the World’s First LLM Benchmark for CRM

CodeChain: Towards Modular Code Generation through Chain of Self-revisions

Long Sequence Modeling with XGen: A 7B LLM Trained on 8K Input Sequence Length

Shafiq Joty

Get the latest articles in your inbox.

360 Highlights

IT

Commerce

Marketing

Service

Sales

Thanks, you're subscribed!