Mohit Dulani

Building a real time chat system scaling to millions of users and to agents

Real time chat System Uber handling chats Tech stack GraphQL subscription kafka message queue

Learning Go-lang

Learning GO to get the minimum latency and get things right from start .. Go docs Go Tour The whole file needs to become a package and to make it whole a package we wrap that up in package main and this makes the whole code as a single package that is then converted to binary We create package to help us import those in other files and the importing helps in code seperation, else we would have to write the whole code in a single main.go file.. ...

learning React Native

Building a React Native App Very similar to react.js app , simple libraries , nothing much CONCEPTS : ScrollView vs Flatlist Flatlist is useful when you want to just show a single screen on the scroll ( like reels ) it loads the item present in the list and show them as per scroll, just update the list and your can continue scrolling indefinitely , ScrollView is useful when you dont want to bifurcate between scrolls, and you want to load the whole data, for continous app screen pages like (home screen ) scrollview is useful, else for reels Flatlist is useful ...

Low Rank Adaptation

LoRA This is beneficial only if the rank, r « d, where d is the dimension of the matrix Rank of a matrix No. of Linearly independent rows/ columns are ranks A matrix of size 4 x 5 with rank = 2 , can be broken down to ( 4 x 2 ) x ( 2 x 5) reducing the total no. of params from 20 to 18 And when applied at a large scale for sizes 1024 x 1024 or 2048 x 2048 , and rank = 2 or 4 the size decreases exponentially from (1024 x 1024) to (1024 x 4 x 2) by 128x times ...

SmolVLA paper from HG

SmolVLA Vision language action paper used for training real world Robots for task Input : Images of surrronding, task explanation in text, state ( senserimotor snapshot at a given time ) Sensorimotor states are projected into a single token using a linear layer to align with the token dimension of the language model. Ex: i } n p u t " " " ] s i i s ) m n t = a s a g t t 0 0 0 0 0 # { e r e . . . . . " u " 1 0 8 4 0 : c : 2 0 5 5 , t , , , , i i n 0 m o p - 0 0 . a a n . 1 . . 0 n g " a . 0 1 , y e : r 0 1 2 _ r 5 , , 0 o t " a , . t e P y 0 0 0 h n u ( 0 . . , e s t [ . 0 3 r o 4 0 3 1 r t 4 , , . r , h , 0 e e 0 , l 0 . e # r . 0 v e 3 0 a e d 1 , n . , t g c 0 . u - . s , b 0 0 e e . 0 n [ 2 , s 3 i 2 o , n , 0 r . 2 t 0 0 r 2 h . 0 e 4 e 0 , a , 8 d b , 0 i 2 i . n 2 n 1 0 g 4 " . 0 s ] , 5 , 7 R , G B i # # m # # # a j j g o o g e e e i i r n n n n i d d f t t p - - r p e e o p v e f f m o e r f f s l e e r i o o c c o t c p t t b i i e o o o o t n r r t n i ' s e ( p o s s 1 o r ( . s i c 7 ( 0 i e a ) 7 = t n m ) o i t e p o a r e n t a n i , ( o x n 0 , . ( 0 y q = , u c a l z t o ) e s r e n d i ) o n ) Output from Action expert that predict what action it should take next: Flow matching is a way to train the action expert so it can generate smooth, realistic action sequences quickly and efficiently. ...

Thinking and fiddling out with random ideas

Learning new ideas and fiddling with them When you visit a site what happens ? *The html , js , css gets transferred to the browser using http request and then the chrome engine runs that and display the content How does image upscaling works? How does real time stt works ? We store the 500ms chunk in /tmp file , send it to transcribe and then repeat this until we close it .. or a VAD (voice activity detection) is used to check till when the voice is detected .. ...

Dual booting arch setup with windows

So now we have arch installed and need to dual boot with windows ( obviously for nvidia broadcast to work and cracked premiere pro to work there, wine just doesnt work for me ) Partition The first role is to make a partition , a virtual partition in your drive so that we can install windows there .. How to make a partition ? if you want to part a drive that is mounted ( means that you are using it or is accessible to you ) then it cant be done as you need to unmount it first to create a partition .. ...

Tech terminologies that makes you a pro-pro developer

Serialization : To convert / store data in a compatible format that can be used later. Means lets say I want to store a list of dictionary [{'key1' : 'val1'},{'key2' : 'val2'}] in redis .. so there is not data structure supported in redis for this so only way to store this is to serialize it into bytes to store and then to retrive back we need to deserialize this .. How to do this : json.dumps() : convert the dictionary to json string then store it in bytes json.loads() : converts back the serialized object to the original state by deserialization ...

Learning F&O and game theory

F&O learning Derivatives Includes Futures and Options Financial Instruments Nifty 50, Nifty Bank , Stock Options , Commodities , Soybean Futures etc Index h , joh ki top 50 indian companies ko track krta h .. Ye apne aap m khuch nhi hota toh isko khareed bech nhi skte nifty m trading krne ka matlab hota h , ki nifty k futures m trade krna Futures No premium , just have to pay the margin 3 month in future , last thursday of each month ka dekha jata h aka near future and far-away future. Lot m kaam krta h ,multi-day kr skte ho and isme short kr skte ho Last thrusday of that month ko expire hoga mtlb agar aapne apna balance khtm nhi kiya uss din tk toh apka broker automatically apko uss din nikal dega .. Futures m you can go either go long or go short Long position : means you bullish on the stock price that it will go up Short position : means you are bearish that stock price will go down in future Already leveraged prodcut h … ...

Learning and Making CUDA kernels as a future skill

THIS IS A SKILL OF FUTURE CUDA DOCS How do GPU work by geohotz starts at 20:00 Terminologies : Warp : grp of 32 threads scheduled and working on the same streaming multiprocessor SIMT : single instruction , multiple threads Lockstep : all threads starts in parallel / works at the same time SM : Streaming Multiprocessor ( CPU aka processor , single processor , useful for all tasks … whereas .. gpu only for intensive application handed over to it by cpu ) Stream : like event loop in CPU used for cuda async operations PTX : parallel thread executions , has a seperate ISA used for cuda code and runs on GPU ...

Learning Celery

Background task Maintaining background tasks specially in python django is damn tough, so we to think spawning some daemon threads and getting that solved is very tough .. so rather we should use celery that starts a complete new process and doesnt maintain any-thing inside main process Celery What is the use of celery ? and its comparisons asyncio.create_task() : This creates and executes task in event loop ( efficient management of event loop ) ...

Editing the node modules and github commands

A noob method of playing with Node modules Steps to follow to update code from a node_module like d3-force directly !! So what we get in a node_modules are the build folder that dont support HRM (hot reload module) so we have to make changes and then build that folder using the rollup module ( maybe we can use other libs also !!) Steps: Make that change in the file that you want !!(be it debugging statements or logic change ) Install the required files, using npm install and if required change the package manager from yarn to npm Then lookup to package.json and you might find a function named “prepublishOnly” Run the command npm run prepublishOnly, If the command requires the some folder , build those or remove those folders from the npm command !! Then once you fix the errors , got the errors solved , then we build this file using npm run prepublishOnly Then run the development server using the npm run dev -- --cache This is how to rebuild a node-module !! ...