Python - Basic Types & Type Conversion
Overview
Questions:Objectives:
What kinds of data do programs store?
How can I convert one type to another?
Requirements:
Explain key differences between integers and floating point numbers.
Explain key differences between numbers and character strings.
Use built-in functions to convert between integers, floating point numbers, and strings.
Time estimation: 30 minutesLevel: Introductory IntroductorySupporting Materials:Last modification: Apr 25, 2022
Best viewed in a Jupyter Notebook
This tutorial is best viewed in a Jupyter notebook! You can load this notebook one of the following ways
Launching the notebook in Jupyter in Galaxy
- Instructions to Launch JupyterLab
- Open a Terminal in JupyterLab with File -> New -> Terminal
- Run
wget https://training.galaxyproject.org/training-material/topics/data-science/tutorials/python-types/data-science-python-types.ipynb
- Select the notebook that appears in the list of files on the left.
Downloading the notebook
- Right click one of these links: Jupyter Notebook (With Solutions), Jupyter Notebook (Without Solutions)
- Save Link As..
Python is a typed language, data has a type, and different types of data cannot always be connected immediately and might need some conversion step before they can be used together. For instance if you add a number to a number, what should happen? If you add a number to a message, what do you expect will happen?
Agenda
In this tutorial, we will cover:
Types
Every value in a program has a specific type.
Name | Python Code | Represents |
---|---|---|
Integer | int |
represents positive or negative whole numbers like 3 or -512. |
Floating point number | float |
represents real numbers like 3.14159 or -2.5. |
Character string | str |
text, written with either ' or " quotes (they must match) |
Checking the Type
Use the built-in function type
to find out what type a value has. This works on values as well as variables. But remember: the value has the type — the variable is just a label.
Check the type of values with the type()
function:
print(type(52))
print(type(3.14159))
You can also check the types of variables
fitness = 'average'
print(type(fitness))
Methods
A value’s type determines what the program can do to it. Some operations may work
print(5 - 3)
And some operations may not work:
print('hello' - 'h')
For instance, you can use the +
and *
operators on strings.
full_name = 'Ahmed' + ' ' + 'Walsh'
print(full_name)
separator = '=' * 10
print(separator)
Some methods only accept specific types, or only work on specific types.
The built-in function len
returns the length of your data. Which of the following would you expect to work? len(string)
? len(int)
?
print(len(full_name))
print(len(52))
Matching Types
Not all types support all operations, adding an integer to a string doesn’t make much sense:
print(1 + '2')
This does not work because it’s ambiguous: should 1 + '2'
be 3
(a number) or '12'
(a string)? Some types can be converted to other types by using the type name as a function.
print(1 + int('2'))
print(str(1) + '2')
Operation Support
Here is a quick chart showing which operations are allowed for each pair:
Left\Right | int |
float |
str |
---|---|---|---|
int |
+-*/ |
+-*/ |
* |
float |
+-*/ |
+-*/ |
`` |
str |
* |
`` | + |
As you can see you can do 3 * "test"
and "test" * 3
, but it doesn’t work with floats.
Can mix integers and floats freely in operations.
Integers and floating-point numbers can be mixed in arithmetic. Python 3 (which we use) automatically converts integers to floats as needed.
print(f'half is {1 / 2.0}')
print(f'three squared is {3.0 ** 2}')
Variables only change value when something is assigned to them.
If we make one cell in a spreadsheet depend on another, and update the latter, the former updates automatically. However, this does not happen in programming languages.
variable_one = 1
variable_two = 5 * variable_one
variable_one = 2
print(f'first is {variable_one} and second is {variable_two}')
The computer reads the value of first
when doing the multiplication, creates
a new value, and assigns it to second
. After that, second
does not remember
where it came from. Every computation happens line-by-line.
question Question: Fractions
What type of value is 3.14159? How can you find out?
solution Solution
It is a floating-point number (often abbreviated “float”). It is possible to find out by using the built-in function
type()
.print(type(3.14159)) <class 'float'>
# Test out solutions here!
question Question: Automatic Type Conversion
What type of value is the result of (3.25 + 4)?
solution Solution
It is a float: integers are automatically converted to floats as necessary.
result = 3.25 + 4 print(f'result is {type(result)}')
7.25 is <class 'float'>
# Test out solutions here!
question Question: Choose a Type
What type of value (integer, floating point number, or character string) would you use to represent each of the following? Try to come up with more than one good answer for each problem. For example, in # 1, when would counting days with a floating point variable make more sense than using an integer?
- Number of days since the start of the year.
- Time elapsed from the start of the year until now in days.
- Serial number of a piece of lab equipment.
- A lab specimen’s age
- Current population of a city.
- Average population of a city over time.
solution Solution
The answers to the questions are:
- Integer, since the number of days would lie between 1 and 365.
- Floating point, since fractional days are required
- Character string if serial number contains letters and numbers, otherwise integer if the serial number consists only of numerals
- This will vary! How do you define a specimen’s age? whole days since collection (integer)? date and time (string)?
- Choose floating point to represent population as large aggregates (eg millions), or integer to represent population in units of individuals.
- Floating point number, since an average is likely to have a fractional part.
question Question: Division Types
In Python 3, the
//
operator performs integer (whole-number) floor division, the/
operator performs floating-point division, and the%
(or modulo) operator calculates and returns the remainder from integer division:print(f'5 // 3: {5 // 3}') print(f'5 / 3: {5 / 3}') print(f'5 % 3: {5 % 3}')
5 // 3: 1 5 / 3: 1.6666666666666667 5 % 3: 2
If
num_subjects
is the number of subjects taking part in a study, andnum_per_survey
is the number that can take part in a single survey, write an expression that calculates the number of surveys needed to reach everyone once.solution Solution
We want the minimum number of surveys that reaches everyone once, which is the rounded up value of
num_subjects/ num_per_survey
. This is equivalent to performing a floor division with//
and adding 1. Before the division we need to subtract 1 from the number of subjects to deal with the case wherenum_subjects
is evenly divisible bynum_per_survey
.num_subjects = 600 num_per_survey = 42 num_surveys = (num_subjects - 1) // num_per_survey + 1 print(num_subjects, 'subjects,', num_per_survey, 'per survey:', num_surveys)
600 subjects, 42 per survey: 15
# Test out solutions here!
question Question: Strings to Numbers
Where reasonable,
float()
will convert a string to a floating point number, andint()
will convert a floating point number to an integer:print("string to float:", float("3.4")) print("float to int:", int(3.4))
string to float: 3.4 float to int: 3
If the conversion doesn’t make sense, however, an error message will occur.
code-in Python
print("string to float:", float("Hello world!"))
code-out Output
Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: could not convert string to float: 'Hello world!'
Given this information, what do you expect the following program to do?
What does it actually do?
Why do you think it does that?
print("fractional string to int:", int("3.4"))
solution Solution
What do you expect this program to do? It would not be so unreasonable to expect the Python 3
int
command to convert the string “3.4” to 3.4 and an additional type conversion to 3. After all, Python 3 performs a lot of other magic - isn’t that part of its charm?int("3.4")
However, Python 3 throws an error.
Traceback (most recent call last): File "<stdin>", line 1, in <module> ValueError: invalid literal for int() with base 10: '3.4'
Why? To be consistent, possibly. If you ask Python to perform two consecutive typecasts, you must convert it explicitly in code.
int(float("3.4"))
3
# Test out solutions here!
question Question: Arithmetic with Different Types
Which of the following will return the floating point number
2.0
? Note: there may be more than one right answer.first = 1.0 second = "1" third = "1.1"
first + float(second)
float(second) + float(third)
first + int(third)
first + int(float(third))
int(first) + int(float(third))
2.0 * second
solution Solution
Answer: 1 and 4 give exactly 2.0. Answer 5 gives the value
2
which may be considered equivalent, but is not returning a float specifically.
# Test out solutions here!
Key points
Every value has a type.
Use the built-in function
type
to find the type of a value.Types control what operations can be done on values.
Strings can be added and multiplied.
Strings have a length (but numbers don’t).
Must convert numbers to strings or vice versa when operating on them.
Can mix integers and floats freely in operations.
Variables only change value when something is assigned to them.
Frequently Asked Questions
Have questions about this tutorial? Check out the FAQ page for the Foundations of Data Science topic to see if your question is listed there. If not, please ask your question on the GTN Gitter Channel or the Galaxy Help ForumFeedback
Did you use this material as an instructor? Feel free to give us feedback on how it went.
Did you use this material as a learner or student? Click the form below to leave feedback.
Citing this Tutorial
- The Carpentries, Helena Rasche, Donny Vrins, Bazante Sanders, 2022 Python - Basic Types & Type Conversion (Galaxy Training Materials). https://training.galaxyproject.org/training-material/topics/data-science/tutorials/python-types/tutorial.html Online; accessed TODAY
- Batut et al., 2018 Community-Driven Data Analysis Training for Biology Cell Systems 10.1016/j.cels.2018.05.012
details BibTeX
@misc{data-science-python-types, author = "The Carpentries and Helena Rasche and Donny Vrins and Bazante Sanders", title = "Python - Basic Types & Type Conversion (Galaxy Training Materials)", year = "2022", month = "04", day = "25" url = "\url{https://training.galaxyproject.org/training-material/topics/data-science/tutorials/python-types/tutorial.html}", note = "[Online; accessed TODAY]" } @article{Batut_2018, doi = {10.1016/j.cels.2018.05.012}, url = {https://doi.org/10.1016%2Fj.cels.2018.05.012}, year = 2018, month = {jun}, publisher = {Elsevier {BV}}, volume = {6}, number = {6}, pages = {752--758.e1}, author = {B{\'{e}}r{\'{e}}nice Batut and Saskia Hiltemann and Andrea Bagnacani and Dannon Baker and Vivek Bhardwaj and Clemens Blank and Anthony Bretaudeau and Loraine Brillet-Gu{\'{e}}guen and Martin {\v{C}}ech and John Chilton and Dave Clements and Olivia Doppelt-Azeroual and Anika Erxleben and Mallory Ann Freeberg and Simon Gladman and Youri Hoogstrate and Hans-Rudolf Hotz and Torsten Houwaart and Pratik Jagtap and Delphine Larivi{\`{e}}re and Gildas Le Corguill{\'{e}} and Thomas Manke and Fabien Mareuil and Fidel Ram{\'{\i}}rez and Devon Ryan and Florian Christoph Sigloch and Nicola Soranzo and Joachim Wolff and Pavankumar Videm and Markus Wolfien and Aisanjiang Wubuli and Dilmurat Yusuf and James Taylor and Rolf Backofen and Anton Nekrutenko and Björn Grüning}, title = {Community-Driven Data Analysis Training for Biology}, journal = {Cell Systems} }
Congratulations on successfully completing this tutorial!
Do you want to extend your knowledge? Follow one of our recommended follow-up trainings: