Open Thoughts
Articles for the technology minded

apache-spark career conflit containers data devops docker documentation download games getting-things-done git gitlab gtd helm home how-to inspire java javascript kubernetes management meeting microsoft office pandas programming pyspark python remote-working scala scripting spark teams tech4good tensorflow testing tutorial typing windows
Can you learn to code on your phone?

Can you learn to code on your phone?

Wednesday, Jul 1, 2020

@ Openthought

Given the limited screen size and options for programming on your phone is it even possible to get a good grounding in a programming language just from a smartphone? The benefits of learning to code have been pushed heavily to people of all ages as an option for upskilling or just becoming more computer literate. There are a lot of options out there, but I’ll just look at a handful and see if they are any good.

Python for teams

Python for teams

Saturday, Jun 27, 2020

@ Openthought

If you’re a beginner or even if you’ve been doing it for years it’s easy to get lazy and just get by, on doing what you know. Churning out code or maintaining a creaking code base is a bit depressing, but relatively easy to do and you can plod along through that for a while. But there comes a time when it feels like you need to up your game. You can feel the technical debt racking up and you think there should be a better way to do this.

Pythonista to Python Pro

Pythonista to Python Pro

Friday, Jun 26, 2020

@ Openthought

You’ve been doing small scripts for a while and you’ve started getting bigger and bigger projects. But you’re getting the fear, the dreaded imposter syndrome. Now to be fair we all get it and we all have to start somewhere. So here are a few tips to get you from somebody who does Python to a Python developer.

Installing Apache Spark on windows? A getting started guide.

Installing Apache Spark on windows? A getting started guide.

Thursday, Jun 25, 2020

@ Openthought

We all read these articles about how big data is taking over the world. One of the tools widely used for this large scale data processing is Apache Spark. Apache Spark is a big data analytics agent and is the base framework of a lot of machine learning and data science used across the industry. It’s all well and good doing data analysis projects with your Jupyter Notebook, pyspark and Pandas, but if you want it to scale you need to design it a little differently.
Saving £millions for the NHS with Pandas

Saving £millions for the NHS with Pandas

Thursday, Jun 25, 2020

@ Openthought

There has been an open data initiative in the UK since 2010 when data.gov.uk was created. After 9 years we now have a huge amount of browseable datasets on the website that can be downloaded and used for your own analysis.

One of the larger data sets available on data.gov.uk is the GP Practice prescription data coming in at around 10 million rows of data every month. This is a lot of data for your average spreadsheet to handle, so this is where tools like Pandas comes in. Pandas is a data analysis library for python that can handle many millions of rows of data and run statistical analysis on them to try and extract useful information.

Social Links