ModernBERT: A Leap Forward in Encoder-Only Models
ModernBERT emerges as a groundbreaking successor to the iconic BERT model, marking a significant leap forward in the domain of […]
ModernBERT: A Leap Forward in Encoder-Only Models Read More »
ModernBERT emerges as a groundbreaking successor to the iconic BERT model, marking a significant leap forward in the domain of […]
ModernBERT: A Leap Forward in Encoder-Only Models Read More »
The Qwen2.5-1M series are the first open-source Qwen models capable of processing up to 1 million tokens. This leap in
Qwen2.5-1M: Million-Token Context Language Model Read More »
DeepSeek-R1 represents a significant advancement in the field of LLMs, particularly in enhancing reasoning capabilities through reinforcement learning (RL). This
DeepSeek-R1: How Reinforcement Learning is Driving LLM Innovation Read More »
NVIDIA Cosmos is a platform that empowers developers to construct customized world models for physical AI systems at scale. It
NVIDIA Cosmos: A Platform for Building World Foundation Models Read More »
World foundation models (WFMs) bridge the gap between the digital and physical realms. These powerful neural networks can simulate real-world
World Foundation Models: A New Era of Physical AI Read More »
Tabular data, the backbone of countless scientific fields and industries, has long been dominated by gradient-boosted decision trees. However, TabPFN
TabPFN: A Foundation Model for Tabular Data Read More »
Microsoft has released Phi-4, designed to excel in mathematical reasoning and complex problem-solving. Phi-4, with only 14 billion parameters, demonstrates
Phi-4: A Powerful Small Language Model Specialised in Complex Reasoning Read More »
SmolAgents is an open-source Python library developed by Hugging Face for building and running powerful AI agents with minimal code.
SmolAgents: A Simple Yet Powerful AI Agent Framework Read More »
Prompt engineering plays a crucial role in LLM performance. However, manual prompt engineering is a laborious and domain-specific process, demanding
PromptWizard: LLM Prompts Made Easy Read More »
What is DSPy? Declarative Self-improving Python (DSPy) is an open-source python framework [paper, github] developed by researchers at Stanford, designed
DSPy: A New Era In Programming Language Models Read More »
Large Concept Models (LCMs) [paper] represent a significant evolution in NLP. Instead of focusing on individual words or subword tokens,
Large Concept Models (LCM): A Paradigm Shift in AI Read More »
Here are the 20 influential AI papers in 2024: Mixtral of Experts (Jan 2024) [paper] Vision Mamba: Efficient Visual Representation
Top 20 Most Influential AI Research Papers of 2024 Read More »