Varshul Gupta — LessWrong

Converging to Multi-Modal Generative AI

The field of generative AI is evolving very quickly with new Papers and Models coming in every week ranging from text to text (GPT-4, Llama, etc.), text to image (stable Diffusion, Imagen, etc.), and text to speech (Tortoise-tts, Bark, etc.) A common pattern among these papers emerges and that is...

Sep 11, 20231

Contextual Translations - Attempt 1

Who doesn’t enjoy anything built to their liking? That which is custom-made for themselves and seemingly consistent with what their personal demands are instead of relying on an external factor or agency. I think this analogy fits quite well with what we’re aiming at Dubverse across our entire dubbing tech...

Aug 21, 2023-1

Self Supervised Learning (SSL)

Self Supervised Learning (SSL) "Unlocking Powerful Representations: The Frontier of Self-Supervised Learning" JASKARAN SINGH AUG 9, 2023 Share With all that’s been happening in the AI/ML industry for the past few weeks, it is important we address the elephant in the room. The Idea Behind SSL SSL comes under the...

Aug 10, 20235

ChatGPT for translation

This post is picking up from some of the points mentioned in our Q2’23 work post (and continuing our experiments with ChatGPT). One big hurdle we are currently facing is translations not bing contextual / not so vernacular. (If you just want to jump on to the results: here is...

Aug 2, 20231

Whisper's Word-Level Timestamps are Out

Hello, fellow tech and language enthusiasts! Today, we embark on a captivating journey into the domain of speech-to-text technology, where the remarkable creation known as Whisper, brought forth by OpenAI, has recently unveiled a remarkable advancement: word-level timestamps. Now allow me to simplify this for you. Whisper, an impressive automated...

Jul 25, 2023-18

Case for Foundation Models beyond English

We live in a world enthralled by technology, so precisely and skilfully woven into the very fabric of our lives that it's impossible to untangle. Each thread of code, each strand of data, is meticulously encoded with language that is then translated into technology that builds the structures of our...

Jul 21, 20231