ML systems VS Traditional Software

SWE	ML systems
Underlying assumption that code and data are separated	They are part code, part data, and part artifacts created from the two
Only focus on testing and versioning code	Same, but ALSO DATA - version datasets because you need to know which data are more valuable than others (just getting more data blindly often is data poisoning)

Other challenge:

Size of ML models
- 100s to billions of parameters which requires GBs of RAM to load into memory
- Getting these large models in production, especially on edge devices, is a massive engineering challenge.
Speed
- An autocompletion model is useless if it can’t even get autocomplete in seconds
Lack of visibility
- debugging these models… + lack of visibility into their work
- hard to figure out what went wrong

leejunkim