TLDR AI 2024-03-28

Databricks DBRX MoE model 🌐, Amazon invests $2.75B in Anthropic 💰, binary search vectors 🔍

🚀
Headlines & Launches

Amazon Invests Another $2.75B In Anthropic (2 minute read)

Amazon finalized a $4 billion investment in Anthropic, its largest venture investment yet.

Here’s Why AI Search Engines Really Can’t Kill Google (6 minute read)

Emerging AI-driven search tools challenge Google by offering direct and explorative answers, but struggle to match its speed, diverse functionalities, and efficient data presentation, underscoring the complexity of replacing traditional search with AI.

DBRX MoE (8 minute read)

Databrix and Mosaic have trained a 132B parameter MoE model with impressive performance. They trained the model on 3,000 H100s and have released the weights. The model is also available on the Databricks API.
🧠
Research & Innovation

Audio-Driven Animation (7 minute read)

AniPortrait is a framework designed to create lifelike animated portraits from a single reference image and audio input. By translating audio into 3D representations and then mapping these onto 2D facial landmarks, this method produces animations that excel in natural facial expressions, varied poses, and high visual quality.

Binary Search Vectors (12 minute read)

Searching over embedding vectors is a key to RAG pipelines. If you replace the fp32 numbers with a single 0 or 1, then use a KNN clusterer and reranker, you can maintain performance while shrinking memory requirements 30x.

Deepfake Technology and Detection Methods (24 minute read)

This comprehensive survey delves into the advancements and challenges of deepfake technology and its detection, highlighting the arms race between deepfake creators and those developing technologies to spot them.
👨‍💻
Engineering & Resources

Benchmark LLMs by playing Street Fighter (GitHub Repo)

LLMs are useful in as much as they are fast, accurate, and follow directions. This combination makes a street fighter emulator with text input an excellent way to figure out which models are good at these three criteria.

Boosting Efficiency in Models Without Extra Training (3 minute read)

The OPTIN framework introduces a novel way to enhance the efficiency of transformer-based AI models across various domains without the need for re-training. By using a technique called intermediate feature distillation, OPTIN can compress networks under specific constraints while barely affecting accuracy.

Image Generation with Text and Pose Conditions (4 minute read)

AID and its variant PAID are two techniques designed to improve image interpolation by incorporating conditions like text and poses. These methods ensure the production of images with enhanced consistency, smoothness, and fidelity without requiring additional training.
🎁
Miscellaneous

Inside The Shadowy Global Battle To Control AI (11 minute read)

The world is grappling with the challenge of regulating AI. A series of high-profile meetings and conferences involving global leaders, tech executives, and policymakers revealed divisions and a lack of consensus on how to control this transformative technology.

Build evaluation pipelines on your own data (4 minute read)

New models are regularly released that claim to be the state of the art on standard benchmarks. It is important to measure these models on your own tasks and data. Superpipe is a tool that helps build these evaluation pipelines on your data.

Hackers can read private AI-assistant chats even though they’re encrypted (11 minute read)

Researchers discovered a side-channel attack that can decipher encrypted AI assistant chats with high accuracy on specific topics by exploiting token transmission within the encryption. The attack utilizes large language models to reconstruct token sequences into readable text, potentially exposing sensitive user conversations. Major AI assistants, except for Google Gemini, are vulnerable to this method, prompting providers to seek mitigation strategies.
⚡️
Quick Links

Creatie (Product)

A one-stop product design tool amplified by AI.

OpenAI Is Starting To Test GPT Earning Sharing (1 minute read)

OpenAI is partnering with a small group of US builders to test usage-based GPT earnings.

Nvidia Tops MLPerf’s Inferencing Tests (4 minute read)

Nvidia’s GPUs, particularly the H200, led the MLPerf’s inferencing benchmarks.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for