TLDR AI 2024-05-03

Apple AI features šŸ“±, US newspapers sue OpenAI āš–ļø, how AI apps make money šŸ¤‘

šŸš€
Headlines & Launches

Major U.S. newspapers sue OpenAI, Microsoft for copyright infringement (3 minute read)

Eight prominent U.S. newspapers owned by investment giant Alden Global Capital are suing OpenAI and Microsoft for copyright infringement in a complaint filed Tuesday in the Southern District of New York. On top of a similar case filed by the New York Times against both companies, the new suits add heft to publishers' claims. Until now, the Times was the only major newspaper to take legal action against AI firms for copyright infringement.

Friends From the Old Neighborhood Turn Rivals in Big Tech's A.I. Race (6 minute read)

Mustafa Suleyman, co-founder of DeepMind, has been named the chief executive of Microsoft AI. He will contribute to Microsoft's expansion into AI consumer products, while his former colleague and DeepMind's other co-founder, Demis Hassabis, will lead AI research at Google. The journey of these two influential figures reflects the personal and competitive undercurrents driving the race to develop the next major computing platform.

Tim Cook to 'hint' at Apple AI features during iPad launch (1 minute read)

Apple CEO Tim Cook is anticipated to tease new AI features during the "Let Loose" event next week, with further details expected at the WWDC in June. The new iPad Pro models may receive the M4 chip, hinting at potential advanced AI capabilities in Apple's upcoming products.
šŸ§ 
Research & Innovation

Sound Event Detection (8 minute read)

Full-Frequency Dynamic Convolution (FFDConv) is a new method that enhances 2D convolution for sound event detection. By generating unique frequency kernels for each band, FFDConv improves the accuracy of detecting sound events, especially in terms of their frequency characteristics.

Optimizing Vision Transformers for Efficient Deployment (22 minute read)

This study discusses how combining algorithmic adjustments with tailored hardware can enhance ViTs' efficiency, particularly through model quantization.

Advancing Spiking Neural Networks with Self-Supervised Learning (22 minute read)

Spikformer V2 combines the self-attention mechanism with the biological efficiency of Spiking Neural Networks (SNNs). This innovative model uses a Spiking Self-Attention mechanism and a Convolutional Stem, enhancing its ability to process visual features while being energy-efficient.
šŸ‘Øā€šŸ’»
Engineering & Resources

Image Segmentation with Adversarial Tuning (4 minute read)

The Segment Anything Model (SAM) from Meta AI, a notable foundation model in computer vision, excels at image segmentation but struggles in certain specific areas. This project presents ASAM, an advancement on SAM that uses adversarial tuning to boost its performance.

Efficient and High-Quality 3D Rendering (3 minute read)

This project introduces SUNDAE, a new approach that enhances memory efficiency through spectral pruning and neural compensation.

Visual Document Understanding (GitHub Repo)

InstructDr is a model designed to excel in various visual document understanding tasks like question answering and information extraction. InstructDr can adapt to new tasks and datasets by combining document images with large language models, outperforming existing models.
šŸŽ
Miscellaneous

Scaling LLMs to 128K Context Lengths (GitHub Repo)

This study reveals a method to significantly extend the context length of language models up to 128K, emphasizing the importance of both the amount and variety of training data.

The Great Talent Dividend and NYC's AI Opportunity (7 minute read)

NYC's ascendancy in AI highlights the city's robust talent pool and growth as an AI hub. The NYC tech scene has drawn AI unicorns and tech workers. It is fuelled by resources like elite universities and a $400 million AI Research Consortium fund.

How AI Apps Make Money (8 minute read)

In recent years, the majority of AI applications have adopted traditional subscription-based pricing models, with a focus on per-user charges, reflecting their role as digital assistants rather than replacements for human workers. Innovative pricing strategies, such as outcome-based models, are emerging among newer AI companies, potentially enhancing customer adoption and revenue by charging only for successful results.
āš”ļø
Quick Links

Real-Time Interactive Image Creation (GitHub Repo)

StreamMultiDiffusion is a framework that enables real-time region-based text-to-image generation.

Meta plans to build $800 million, next-gen data center in Montgomery (3 minute read)

The $800 million investment aims to create over 100 jobs and is expected to be operational by the end of 2026.

Microsoft bans US police departments from using enterprise AI tool for facial recognition (2 minute read)

Microsoft updated Azure OpenAI Service's terms to prohibit U.S. police from using its generative AI for facial recognition, clarifying restrictions on law enforcement applications globally.
The most important AI, ML, and data science news in a free daily email.
Join 500,000 readers for