TechStack topics for computational (and data) scientists. The list of topics (see topics folder) is incomplete. The course was inspired by (i) a summer course run by Dr. Ed Overman at Ohio State University and (ii) MIT - The Missing Semester.
This course facilitates the development of proficiency with computational and data science tools. Critical elements of the computing ecosystem that could make workflows smoother and easier are covered. These topics include but are not limited to the command line, the use of text editors, and version control systems. Mastering such tools enables one to become more efficient at problem-solving and provides unique knowledge and skills.
After this course, students will be able to:
- navigate the file directory from the command line;
- add, remove, and rename files and directories at the CLI;
- clone a repository; add, commit, and push files to repository; create a pull request;
- typeset mathematics;
- develop a professional profile;
- create, edit, and save text files;
- obtain data from the website via web scraping;
- visualize and manipulate geometrical objects;
- perform basic data wrangling using R;
- consider data ethics issues;
- differentiate and integrate functions unable to be done by hand;
- examine computer security and privacy issues.
- VIM
- Git
- Virtualization
- Unix
- Scripting
- File permissions
- IDE (VSCode)
- AWS
- LaTeX
- Data Ethics
- Markdown
- Desmos
All contributions are more than welcome. Contributions may close an issue, fix a bug (reported or not reported), add new design blocks, improve the existing code, add new features, and so on. In the interest of fostering an open and welcoming environment, we as contributors and maintainers pledge to make participation in our project and our community a harassment-free experience for everyone, regardless of age, body size, disability, ethnicity, gender identity, and expression, level of experience, nationality, personal appearance, race, religion, or sexual identity and orientation. Read the full Code of Conduct.
The dev
branch is the project's default and base branch. It is used for development, and all Pull Requests should go there.
git clone https://github.com/jamesquinlan/dsc-techstack.git
cd dsc-techstack
Code and documentation copyright 2023 Quinlan. The code was released under the MIT License.
@book{blum2008linux,
title = {Linux command line and shell scripting bible},
author = {Blum, Richard},
volume = {481},
year = {2008},
publisher = {John Wiley \& Sons}
}
@book{chacon2014pro,
title = {Pro git},
author = {Chacon, Scott and Straub, Ben},
year = {2014},
publisher = {Springer Nature}
}
@book{cone2020markdown,
title = {Markdown Guide},
author = {Cone, Matt},
year = {2020},
publisher = {Independently Published}
}
@misc{david2018aws,
title = {AWS: Amazon Web Services Tutorial for Beginners},
author = {David, Bert},
year = {2018},
publisher = {Independently published}
}
@incollection{del2021introducing,
title = {Introducing Visual Studio Code},
author = {Del Sole, Alessandro},
booktitle = {Visual Studio Code Distilled: Evolved Code Editing for Windows, macOS, and Linux},
pages = {1--15},
year = {2021},
publisher = {Springer}
}
@article{ebert2014graphing,
title = {Graphing projects with Desmos},
author = {Ebert, David},
journal = {The Mathematics Teacher},
volume = {108},
number = {5},
pages = {388--391},
year = {2014},
publisher = {National Council of Teachers of Mathematics}
}
@article{herschel2017ethics,
title = {Ethics \& big data},
author = {Herschel, Richard and Miori, Virginia M},
journal = {Technology in Society},
volume = {49},
pages = {31--36},
year = {2017},
publisher = {Elsevier}
}
@book{lakshman2011linux,
title = {Linux shell scripting cookbook},
author = {Lakshman, Sarath and Tushar, S},
year = {2011},
publisher = {Packt Publishing}
}
@book{lamport1991latex,
title = {LaTeX},
author = {Lamport, Leslie},
year = {1991},
publisher = {{\'S}Company Cyfronet}
}
@book{mcdonnell2014pro,
title = {Pro vim},
author = {McDonnell, Mark},
year = {2014},
publisher = {Apress}
}
@article{oracle2011virtualbox,
title = {VirtualBox user manual},
author = {Oracle, VM},
journal = {Oracle Corporation.-2004.-C},
volume = {357},
year = {2011}
}
@misc{kernighan2020c,
title = {The C programming language.},
author = {Kernighan, Brian W and Ritchie, Dennis M},
year = {2020},
publisher = {Prentice Hall}
}
@misc{richardson2007beautiful,
title = {Beautiful soup documentation},
author = {Richardson, Leonard},
year = {2007},
publisher = {April},
url = {https://buildmedia.readthedocs.org/media/pdf/beautiful-soup-4/latest/beautiful-soup-4.pdf}
}
@book{tsitoara2020beginning,
title = {Beginning Git and GitHub},
author = {Tsitoara, Mariot},
year = {2020},
publisher = {Springer}
}
@article{van2017personality,
title = {Personality perception based on LinkedIn profiles},
author = {Van de Ven, Niels and Bogaert, Aniek and Serlie, Alec and Brandt, Mark J and Denissen, Jaap JA},
journal = {Journal of Managerial Psychology},
volume = {32},
number = {6},
pages = {418--429},
year = {2017},
publisher = {Emerald Publishing Limited}
}
@book{xie2020r,
title = {R markdown cookbook},
author = {Xie, Yihui and Dervieux, Christophe and Riederer, Emily},
year = {2020},
publisher = {CRC Press}
}
@book{xie2018r,
title = {R markdown: The definitive guide},
author = {Xie, Yihui and Allaire, Joseph J and Grolemund, Garrett},
year = {2018},
publisher = {CRC Press}
}