TLDR AI 2024-05-07

OpenAI & Stack Overflow 🀝, Elon’s plan for AI news πŸ“°, prompt engineering is dead πŸ’€

πŸš€
Headlines & Launches

Stack Overflow and OpenAI partnership (6 minute read)

Stack Overflow, a popular programming website, and OpenAI are partnering to provide a data API for OpenAI customers to retrieve real time and vetted data.

Elon Musk's Plan For AI News (4 minute read)

Elon Musk plans to enhance X's AI, Grok, to merge live news with social media commentary to provide updates and citations in real time. Grok will generate news summaries from user discussions on X, focusing on engagement and accuracy. The project faces challenges with proper citation and legal concerns.
🧠
Research & Innovation

Unsloth.ai: Easily finetune & train LLMs (58 minute read)

Video from the founder of Unsloth on how its team uses PyTorch, writes their kernels, and designs their API surface. Unsloth's framework and library are extremely powerful and easy to use.

Improving Heterogeneous Graph Neural Networks (18 minute read)

SlotGAT is a new approach that improves heterogeneous graph neural networks by addressing the semantic mixing issue in traditional message passing.

Deepfake Detection Using Masked Image Modeling (6 minute read)

This new method detects deepfakes by focusing on masked image modeling, especially in the frequency domain. The approach differs from traditional methods and shows significant improvement in identifying synthetic images, even from new AI generative techniques.
πŸ‘¨β€πŸ’»
Engineering & Resources

Hugging Face robotics library (GitHub Repo)

The team at Hugging Face has released a new project that nicely packages common tools needed for robotics development.

Enhancing Visual Abilities with Morph-Tokens (GitHub Repo)

Researchers have developed "Morph-Tokens" to improve AI's visual understanding and image generation capabilities. These tokens transform abstract concepts used for comprehension into detailed visuals for image creation, leveraging the advanced processing power of the MLLM framework.

Meet Vibe-Eval: Evaluating Multimodal Chat Models (GitHub Repo)

Vibe-Eval is a newly launched benchmark designed to test multimodal chat models with 269 visual understanding prompts, including 100 particularly challenging ones.
🎁
Miscellaneous

AI Prompt Engineering Is Dead (6 minute read)

Automating prompt optimization for AI models suggests a future where manual prompt engineering may become obsolete, pointing towards more efficient, model-driven methods of generating effective prompts.

Limits of Vision-Language Models in Visual Reasoning (GitHub Repo)

Vision-Language Models like GPT-4V are advancing rapidly in understanding and interacting with images and text. A recent study uncovers their significant limitations in visual deductive reasoning. Researchers tested these models using complex visual puzzles, like those found in IQ tests, and discovered that they struggle with multi-step reasoning and recognizing abstract patterns.

DeepSeek-V2 (Hugging Face Hub)

DeepSeek has released a 200B+ parameter model with 21B active parameters. It performs extremely well on code and reasoning. It's not clear if it is overall better than Llama 3 70B, but it is a welcome addition to the open model ecosystem.
⚑️
Quick Links

State of AI in 13 Charts (5 minute read)

A new report by Stanford tracking worldwide AI trends.

Cohere's Command R Model Family is Now Available In Amazon Bedrock (3 minute read)

Command R and R+ are state-of-the-art LLMs optimized for long context tasks and excel at key business-critical capabilities.

Microsoft says it did a lot for responsible AI in inaugural transparency report (2 minute read)

Microsoft's Responsible AI Transparency Report highlights its advancement in deploying AI responsibly in 2023, including the creation of 30 AI tools and implementing safety measures.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for