Airflow Links Read some about Luigi and how to do pipeline/workflow stuff in standard python leveraging a package like Luigi. Then use your understanding to look into how Airflow works. Luigi https://luigi.readthedocs.
Bugs in Labs and Softare Often times, early in cohorts, students begin to discover two things. The first is that materials they are using are incomplete, vague and contain mistakes or errors that cause confusion. The second thing
the CPU Before the OS Concepts lecture, I'd like you to watch these videos. They break down the complexities of the CPU, Memory and so on within a modern microprocessor. They move at a decent
Notes on Conway's Game of Life Cellular Automata. https://bitstorm.org/gameoflife/ good sim on page, several great patterns pre-built. https://www.quantamagazine.org/john-conways-life-in-games-20150828/ bio of Conway, plus good desc of game. https://web.stanford.edu/class/sts145/
GIT: extra files and .gitignore Let’s say you have a bunch of files you don't want in your repo (like .class or .ctxt files?) but you've already pushed them! (doh!) And you'd like add them to your
Computational Thinking for Mortals Many people believe they are not "technical" and therefore, they have no capacity to understand technical ideas. This is pretty meaningless in the 21st century. If you were not technical, you'd
Introduction to COBOL What is COBOL? One computer programming language was designed specifically for business, Common Business-Oriented Language, COBOL. COBOL has been transforming and supporting business globally since its invention in 1959. COBOL is responsible for
Kafka on Mac (2) Now, let's do some python with our kafka. (Hey, your kafka and zookeeper should still be running.) (and if not, start 'em up again) in a directory named (lines10) in your projects somewhere,
Kafka on Mac (1) So make sure you have java 1.8 running $ java -version java version "1.8.0_66" Java(TM) SE Runtime Environment (build 1.8.0_66-b17) Java HotSpot(TM) 64-Bit
Apache Kafka for ZCW Data As part of your weekend agenda, read these articles, as many as you can. You might even decide to follow one through to the end. But that's up to you. Kafka is one
Apache Spark for ZCW Data Apache Spark is a fast and general-purpose cluster computing system. It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs. It also supports a
Data, Just Reading Read these articles, to help cement and settle some of the things you've been learning over these last weeks. None of them are challenging, you should start to see things you recognize, and
Interesting Data Sets Finding cool datasets should be something you do continuously as a Data Engineer. Ponder some of these: https://www.dataquest.io/blog/free-datasets-for-projects/ https://www.analyticsindiamag.com/popular-data-science-projects-for-aspiring-data-scientists/ https://www.springboard.com/blog/
Articles for Data {Engineers,Analysts,Scientists} Books for Data Engineering Data Engineering Data Engineering Cookbook Plumber of DS Python Primary: Think Python Python Notes SQL Primary: Learning SQL: Master SQL Fundamentals Database Design SQL Notes PostgreSQL Notes SQL 1
Books for ZCW Data Engineering Data Engineering Data Engineering Cookbook Plumber of DS Python Primary: Think Python Hitchhiker's Guide to Python Python Notes SQL Primary: Learning SQL: Master SQL Fundamentals SQL: The Complete Reference Database Design SQL Notes
Jupyter Notebook Tips To load a missing python library into your notebook server import sys print(sys.executable) will tell you what the executable path is for your python used within your notebooks. /Users/foo/anaconda3/
Some Algorithms The Guardian, a newspaper in the UK, has a great article on the author's top five picks of important algorithms. One can argue whether these are the top 5, but these five, at
GIT: never lose code Git is a free and open source distributed version control system designed to handle everything from small to very large projects with speed and efficiency. Git is easy to learn and has a
IF Patterns Patterns in code are a topic that a lot of professional coders put a lot of stock in. There is a famous book on it, and we do cover some of these things
Blockchain Overview Blockchain technology has become one of the more intersting advanced topics in comercial software. From bitcoin to Ethereum to Hyperledger Fabric, the fundamentals have been largely wrought. NIST, National Institute of Standards and
JavaFX: Some pix for you to consider Sometimes, a picture is worth 65,536 words. or to understand what you can do with the top of the tree:
Algorithms & Data Structures, Oh My! When you interview, you might get a question about algorithms or data structures: and then you panic. You went to Zip Code, not a CS degree, you're a solid programmer, but your theoretic
Feel the Need to Do Something? From The-I-Feel-The-Need-to-Do-Something Dept.: Before you arrive at ZipCodeWilmington for your cohort, here are a few things you can work with. Get yourself an account on GitHub.com if you don’t have one
In addition to Tech... in addition to tech, you should be thinking about how you can get smarter (more exposed) to the topics in this article. 5 Subjects Every Computer Science Student Should Learn
Oracle Java Docs You need to realize that Oracle is Java's "best buddy". All the very latest, and authoritative, information on Java is available on some very important documentation servers at Oracle.com. Well,