After completing this module, the learner should be able to use Python to:
- Perform basic math operations
- Assign values to variables
- Identify and distinguish between string, integer, and float data types
- Use built-in Python functions
- Find documentation on a Python object in the documentation
- Write and execute a simple program
length, class, strcharacternumericlogicalintegerlistmatrixdata.framefactor==, !=, >, >=, %in%&, |, &&, ||NA)Python is a general purpose programming language that supports rapid development of scripts and applications.
Python's main advantages:
Python is an interpreted language. As a consequence, we can use it in two ways:
user:host:~$ python
Python 3.5.1 (default, Oct 23 2015, 18:05:06)
[GCC 4.8.3] on linux2
Type "help", "copyright", "credits" or "license" for more information.
>>> 2 + 2
4
>>> print("Hello World")
Hello World
*.py extension:user:host:~$ python my_script.py
Hello World
The most basic data types in Python are strings, integers and floats:
text = "Data Carpentry"
number = 42
pi_value = 3.1415
Here we've assigned data to variables, namely text, number and pi_value,
using the assignment operator =. The variable called text is a string which
means it can contain letters and numbers. We could reassign the variable text
to an integer too - but be careful reassigning variables as this can get
confusing.
To print out the value stored in a variable we can simply type the name of the variable into the interpreter:
>>> text
"Data Carpentry"
however, in scripts we must use the print function:
# Comments start with #
# Next line will print out text
print(text)
"Data Carpentry"
We can perform mathematical calculations in Python using the basic operators
+, -, /, *, %:
>>> 2 + 2
4
>>> 6 * 7
42
>>> 2 ** 16 # power
65536
>>> 13 % 5 # modulo
3
We can also use comparison and logic operators:
<, >, ==, !=, <=, >= etc.
and, or, not
>>> 3 > 4
False
>>> True and True
True
>>> True or False
True
Lists are a common data structure to hold an ordered sequence of elements. Each element can be accessed by an index:
>>> numbers = [1,2,3]
>>> numbers[0]
1
A for loop can be used to access the elements in a list or other Python data
structure one at a time:
for num in numbers:
print(num)
1
2
3
Indentation is very important in Python. Note that the second line in the example above is indented. This is Python's way of marking a block of code. We will discuss this in more detail later.
To add elements to the end of a list, we can use the append method:
>>> numbers.append(4)
>>> print(numbers)
[1,2,3,4]
Methods are a way to interact with an object (a list, for example). We can invoke
a method using the dot . followed by the method name and a list of arguments in parentheses.
To find out what methods are available for an object, we can use the built-in help command:
help(numbers)
Help on list object:
class list(object)
| list() -> new empty list
| list(iterable) -> new list initialized from iterable's items
...
We can also access a list of methods using dir. Some methods names are
surrounded by double underscores. Those methods are called "special", and
usually we access them in a different way. For example __add__ method is
responsible for the + operator.
dir(numbers)
>>> dir(numbers)
['__add__', '__class__', '__contains__', ...]
A tuple is similar to a list in that it's an ordered sequence of elements. However,
tuples can not be changed once created (they are "immutable"). Tuples are
created by placing comma-separated values inside parentheses ().
# tuples use parentheses
a_tuple= (1,2,3)
another_tuple = ('blue','green','red')
# Note: lists use square brackets
a_list = [1,2,3]
a_tuple[2]=5 vs a_list[1]=5 ?type(a_tuple) into python - what is the object type?A dictionary is a container that holds pairs of objects - keys and values.
>>> translation = {"one" : 1, "two" : 2}
>>> translation["one"]
1
Dictionaries work a lot like lists - except that you index them with keys. You can think about a key as a name for or a unique identifier for a set of values in the dictionary. Keys can only have particular types - they have to be "hashable". Strings and numeric types are acceptable, but lists aren't.
>>> rev = {1 : "one", 2 : "two"}
>>> rev[1]
'one'
>>> bad = {[1,2,3] : 3}
...
TypeError: unhashable type: 'list'
To add an item to the dictionary we assign a value to a new key:
>>> rev = {1 : "one", 2 : "two"}
>>> rev[3] = "three"
>>> rev
{1: 'one', 2: 'two', 3: 'three'}
Using for loops with dictionaries is a little more complicated. We can do this
in two ways:
>>> for key, value in rev.items():
... print(key, "->", value)
...
1 -> one
2 -> two
3 -> three
or
>>> for key in rev.keys():
... print(key, "->", rev[key])
...
1 -> one
2 -> two
3 -> three
>>>
Can you do reassignment in a dictionary? Give it a try.
First check what rev is right now (remember rev is the name of our dictionary).
Type:
>>> rev
You should see the following output:
{1: 'one', 2: 'two', 3: 'three'}
Try to reassign the second value (in the key value pair) so that it no longer reads "two" but instead reads "apple-sauce".
>>> rev[2] = "apple-sauce"
Now display rev again to see if it has changed; you should see the following:
{1: 'one', 2: 'apple-sauce', 3: 'three'}
It is important to note that dictionaries are "unordered" and do not remember the sequence of their items (i.e. the order in which key:value pairs were added to the dictionary). Because of this, the order in which items are returned from loops over dictionaries might appear random and can even change with time.
Defining part of a program in Python as a function is done using the def
keyword. For example a function that takes two arguments and returns their sum
can be defined as:
def add_function(a, b):
result = a + b
return result
z = add_function(20, 22)
print(z)
42
Key points here:
defreturn keyword precedes returned value