ModernBERT: A Leap Forward in Encoder-Only Models
ModernBERT emerges as a groundbreaking successor to the iconic BERT model, marking a significant leap…
Qwen2.5-1M: Million-Token Context Language Model
The Qwen2.5-1M series are the first open-source Qwen models capable of processing up to 1…
DeepSeek-R1: How Reinforcement Learning is Driving LLM Innovation
DeepSeek-R1 represents a significant advancement in the field of LLMs, particularly in enhancing reasoning capabilities…
NVIDIA Cosmos: A Platform for Building World Foundation Models
NVIDIA Cosmos is a platform that empowers developers to construct customized world models for physical…
World Foundation Models: A New Era of Physical AI
World foundation models (WFMs) bridge the gap between the digital and physical realms. These powerful…
TabPFN: A Foundation Model for Tabular Data
Tabular data, the backbone of countless scientific fields and industries, has long been dominated by…
Phi-4: A Powerful Small Language Model Specialised in Complex Reasoning
Microsoft has released Phi-4, designed to excel in mathematical reasoning and complex problem-solving. Phi-4, with…
SmolAgents: A Simple Yet Powerful AI Agent Framework
SmolAgents is an open-source Python library developed by Hugging Face for building and running powerful…
PromptWizard: LLM Prompts Made Easy
PromptWizard addresses the limitations of manual prompt engineering, making the process faster, more accessible, and…
DSPy: A New Era In Programming Language Models
What is DSPy? Declarative Self-improving Python (DSPy) is an open-source python framework [paper, github] developed…
Large Concept Models (LCM): A Paradigm Shift in AI
Large Concept Models (LCMs) [paper] represent a significant evolution in NLP. Instead of focusing on…
Top 20 Most Influential AI Research Papers of 2024
Here are the 20 influential AI papers in 2024: Mixtral of Experts (Jan 2024) [paper]…
The Future of AI in 2025: Insights and Predictions
As we approach 2025, the landscape of artificial intelligence (AI) is set to undergo significant…
Announcing Llama 3.3: A Smaller, More Efficient LLM
Meta has released Llama 3.3, a new open-source multilingual large language model (LLM). Llama 3.3…
Unlock the Power of AI with Amazon Nova
At the AWS re:Invent conference, Amazon unveiled Amazon Nova, a suite of advanced foundation models…