- https://glinden.blogspot.com/2006/02/early-amazon-splitting-website.html
- https://tuftsdev.github.io/WebEngineering/spring2017/notes/ebay2006.pdf
- https://youtu.be/E91oEn1bnXM
- https://youtu.be/qsan-GQaeyk?t=415
- https://youtu.be/21EiKfQYZXc
- https://projecteuclid.org/journals/statistical-science/volume-16/issue-3/Statistical-Modeling--The-Two-Cultures-with-comments-and-a/10.1214/ss/1009213726.full
- https://agilemanifesto.org/
- https://en.wikipedia.org/wiki/Timeline_of_Amazon_Web_Services
- https://www.researchgate.net/publication/220101183_Performance_Study_of_the_First_Three_Intel_Multicore_Processors
- https://scipy.github.io/old-wiki/pages/History_of_SciPy
- https://www.iinterchange.com/is-moores-law-dead/
- https://static.googleusercontent.com/media/research.google.com/en//archive/mapreduce-osdi04.pdf
- https://papers.nips.cc/paper/2012/file/c399862d3b9d6b76c8436e924a68c45b-Paper.pdf
- https://allennlp.org/elmo
- https://arxiv.org/abs/1712.01208
- https://karpathy.medium.com/software-2-0-a64152b37c35
- https://www.extremetech.com/computing/295159-intel-acknowledges-its-long-10nm-delay-caused-by-being-too-aggressive
- https://calmcode.io/
- https://stackoverflow.com/questions/tagged/python
- https://www.w3schools.com/python/python_lists_comprehension.asp
- https://docs.python.org/3/library/typing.html
- https://pre-commit.com/
- https://cython.org/
- https://numba.pydata.org/
- https://cupy.dev/
- https://arrow.apache.org/
- https://llvm.org/
- https://jakevdp.github.io/PythonDataScienceHandbook/01.07-timing-and-profiling.html
- https://towardsdatascience.com/speed-up-jupyter-notebooks-20716cbe2025
- https://scoutapm.com/blog/identifying-bottlenecks-and-optimizing-performance-in-a-python-codebase
- https://code.tutsplus.com/tutorials/understand-how-much-memory-your-python-objects-use--cms-25609
- https://github.com/DerwenAI/ray_tutorial/blob/main/pi.ipynb
- https://github.com/rasbt/watermark
- https://github.com/pythonspeed/filprofiler
- https://mg.pov.lt/objgraph/
- https://docs.python.org/3/library/tracemalloc.html
- https://jiffyclub.github.io/snakeviz/
- https://docs.python.org/3/library/profile.html%23module-cProfile
- https://github.com/joerick/pyinstrument/
(no slide links)
(no transcription)
Paco Nathan
2021-06-12 23:41:00
In a Post-Moore's Law world, how do data science and data engineering need to change? This talk presents design patterns for idiomatic programming in Python so that hardware can optimize machine learning workflows. We'll look at ways of handling data that are either "sparse" or "dense" depending on the stage of ML workflow – plus, how to leverage profiling tools in Python to understand how to take advantage of hardware. We'll also consider four key abstractions which are outside of most programming languages, but vital in data science work.
Disclaimer: all trademarks, service marks, trade names, trade dress, product names, and logos appearing above are the property of their respective owners