Blogs | TCET Open Source

This Month in AI - August 2023

September 28, 2023 · 3 min read

Sharukhali Syed

President - Mind Benders

Arya Mane

Publication Head - Mind Benders

Saurabha Sawant

Opensource - Mind Benders

Vrushali Sandam

Technical Head - Mind Benders

Code Llama by Meta¹

Meta, the tech giant formerly known as Facebook, has entered the AI arena with Code Llama. This cutting-edge large language model (LLM) is tailored for coding tasks, capable of generating both code and natural language explanations related to code. With three models available in varying sizes, Code Llama is poised to meet the diverse needs of developers and programmers, making coding more efficient and accessible.

AI2's Dolma Dataset ²

AI2 has made waves by releasing the colossal Dolma dataset, comprising a staggering 3 trillion tokens. What sets Dolma apart is its commitment to transparency. Unlike many other datasets, Dolma provides detailed insights into what information was removed, why it was removed, and how personal data was handled. This transparency underscores the ethical considerations surrounding data acquisition and usage in the AI community.

Dolma

SeamlessM4T by Meta ³

Meta is further extending its AI prowess with the development of SeamlessM4T, a foundational multimodal model for speech translation. A multimodal language model is an advanced artificial intelligence model designed to handle and generate content in multiple modes of communication simultaneously. These modes typically include text, images, and sometimes audio or other sensory data. This powerhouse model can handle an extensive range of text and speech tasks across a staggering 100 languages. SeamlessM4T boasts features such as automatic speech recognition, speech-to-text translation, speech-to-speech translation, text-to-text translation, and text-to-speech translation. This innovation opens up new possibilities for seamless communication and understanding across language barriers.

SeamlessM4T

DeepLearning.AI Course on Finetuning Large Language Models ⁴

In a bid to empower AI professionals, DeepLearning.AI has launched a free course dedicated to "Finetuning Large Language Models." This course equips practitioners with the knowledge and skills needed to harness the potential of finetuning on LLMs. From data preparation to training and evaluation, the course covers the intricacies of customizing models, updating neural network weights, and enhancing results through style, form, and new knowledge incorporation.

DeepLearning.AI

IDEFICS: An Open Reproduction of Visual Language Models ⁵

IDEFICS emerges as an impressive open-source visual language model with 9 billion and 80 billion parameters, drawing inspiration from DeepMind's Flamingo. This versatile model boasts the ability to describe images, generate narratives, and answer image-related questions. Trained on a diverse range of open datasets, including Wikipedia, Public Multimodal Dataset, LAION, and OBELICS, IDEFICS pushes the boundaries of visual AI.

IDEFICS

GPT-3.5 Turbo Fine-Tuning ⁶

OpenAI has unveiled a significant upgrade to its GPT-3.5 Turbo model by introducing fine-tuning. This enhancement promises improved performance on specific tasks, effectively rivaling the capabilities of the base GPT-4. Early testers have achieved remarkable results, reducing prompt size. Notably, the cost structure for training and usage input/output has been detailed at $0.008, $0.012, and $0.016 per 1K tokens, respectively. This advancement underlines the ever-increasing versatility and adaptability of AI models.

GPT-3.5 Turbo

Despite Microsoft's significant investments in AI-driven features like Bing AI Chat and Bing Image Creator, Bing's market share has remained largely stagnant at approximately 3%. While Microsoft disputes this data, experts question whether the missing interactions will significantly impact the overall landscape. This scenario highlights the challenges and competition in the search engine domain driven by AI.

This Month in AI - June 2023

June 30, 2023 · 4 min read

Kunal Agrawal

President - Mind Benders | SSoC'22

Keval Waghate

Publication Head - Mind Benders

Deexith Madas

Treasurer - Mind Benders

Ananta Pandey

Jt. Publication Head - Mind Benders

A robotic hand touching a speck of light

Code Optimization Revolutionized: Google DeepMind's AI Unleashes New Speed-Boosting Technique. ¹

DeepMind's AlphaDev AI achieves groundbreaking speed improvements in sorting algorithms, surpassing existing methods by up to 70%. Its innovative techniques have been adopted by millions of software developers, marking the first integration of AI-discovered algorithms in language updates. DeepMind's gamified approach, using reinforcement learning, trains AlphaDev to construct faster and correct algorithms. This breakthrough revolutionizes code optimization and sets the stage for further AI-driven innovations in computer science.

Elevating the Shopping Experience with AI Virtual Try-On on Google Shopping. ²

Example of Virtual Try of Clothes with AI

Google Shopping has introduced AI Virtual Try-On, a new feature that allows users to virtually try on beauty products before buying them. Using advanced machine learning algorithms and facial recognition technology, the feature provides realistic representations of the products on the user's face. It enhances convenience, addresses concerns about online shopping, and offers a seamless experience. Users can access the feature from Google Search, Google Shopping, or participating retailer websites, making informed decisions with detailed product information and user reviews. This innovation bridges the gap between online and in-store try-ons, revolutionizing the beauty shopping experience.

Ink AI Unveils ChatGPT-Based E-book Generator for Effortless Full-Length E-book Creation. ³

Ink AI has introduced a game-changing e-book generator tool that utilizes ChatGPT, an AI language model, to effortlessly create full-length e-books. By inputting prompts, users receive context-aware responses that form the basis of their e-books, making the writing process faster and more efficient. The tool allows customization of genre, style, and length, and the user-friendly interface simplifies content creation. Ink AI's e-book generator opens up new possibilities for authors, content creators, and publishers by revolutionizing the e-book creation process with AI technology.

Meta Unveils Stablediffusion: A Groundbreaking AI Model for Music Generation. ⁴

Meta has introduced Stablediffusion, an advanced AI model called MusicGen, designed for music generation. Unlike traditional models, Stablediffusion produces stable, coherent, and emotionally engaging musical compositions. Trained on diverse musical genres, the model incorporates stability mechanisms for smooth transitions and consistent structures.

It considers melody, harmonies, rhythms, and tonal variations, resulting in natural and professional-sounding compositions. Stablediffusion offers a powerful tool for musicians, composers, and music enthusiasts, revolutionizing AI-generated music with its stable and artistically satisfying output.

Draggan Goes Open Source: Empowering Developers with Advanced AI Framework. ⁵

Example of DragGAN to change perspective of Lion and more

Draggan, an advanced AI framework focused on reinforcement learning, has been released as open source. This allows developers worldwide to access and utilize its capabilities for training AI models. Draggan simplifies the process with its user-friendly interface, extensive documentation, and pre-built components, enabling faster development and deployment of AI systems. By democratizing access to this powerful tool, the open-source release of Draggan promotes collaboration and accelerates advancements in AI research and application development.

MIT Introduces New Model for Accelerated Drug Discovery. ⁶

MIT researchers have developed a groundbreaking AI model called AccelerateDrug, which revolutionizes the process of drug discovery. The model utilizes advanced machine learning algorithms to rapidly analyze chemical and biological data, predicting the effectiveness of potential drug compounds. By significantly reducing experimentation time and resources, AccelerateDrug streamlines the drug discovery process and expedites the development of new medications.

The model has demonstrated high accuracy and outperformed existing methods in predicting drug efficacy. AccelerateDrug has the potential to accelerate the availability of life-saving treatments, benefiting patients and advancing healthcare outcomes.

Google Introduces AudioPalm: Bridging the Gap between Text and Voice. ⁷

Brief Architecture of AudioPaLM

Google has introduced AudioPalm, an innovative technology that bridges the gap between text and voice. Using advanced AI algorithms, it converts written text into human-like speech and transcribes spoken language into written text accurately. AudioPalm enhances accessibility, user experiences, and content creation, benefiting individuals with visual impairments and those seeking a more immersive interaction.

The technology has applications in education, entertainment, and accessibility services, and it integrates with existing Google services like Google Assistant and Google Translate. Google's AudioPalm represents a significant advancement in natural language processing, enabling seamless conversion between text and voice for enhanced user experiences.

Introduction to APIs: Unlocking the Power of Integration

June 10, 2023 · 4 min read

Himanshu Agarwal

CEO @ TCET Open Source | 2x Kaggle Expert | Software Developer | Data Analyst

Applications and systems rely on smooth communication and data sharing to deliver improved functionality and services in today's interconnected digital environment. Application Programming Interfaces (APIs) are quite important in this situation. APIs serve as mediators, enabling interoperability, data sharing, and communication between various software programmes. This article will provide you a thorough introduction to APIs and their importance in contemporary software development, whether you're a developer, a tech enthusiast, or just interested about the world of APIs.

Code Llama by Meta1​

AI2's Dolma Dataset 2​

SeamlessM4T by Meta 3​

DeepLearning.AI Course on Finetuning Large Language Models 4​

IDEFICS: An Open Reproduction of Visual Language Models 5​

GPT-3.5 Turbo Fine-Tuning 6​

Bing's Market Share Stagnation 7​

Code Optimization Revolutionized: Google DeepMind's AI Unleashes New Speed-Boosting Technique. 1​

Elevating the Shopping Experience with AI Virtual Try-On on Google Shopping. 2​

Ink AI Unveils ChatGPT-Based E-book Generator for Effortless Full-Length E-book Creation. 3​

Meta Unveils Stablediffusion: A Groundbreaking AI Model for Music Generation. 4​

Draggan Goes Open Source: Empowering Developers with Advanced AI Framework. 5​

MIT Introduces New Model for Accelerated Drug Discovery. 6​

Google Introduces AudioPalm: Bridging the Gap between Text and Voice. 7​