-
Notifications
You must be signed in to change notification settings - Fork 17
/
Copy pathsolution_72.py
49 lines (35 loc) · 1.88 KB
/
solution_72.py
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
### 7.3
# In this exercise, we will find the 'gene' and 'CDS' features for the spike
# protein. The spike protein (S protein) is a large type I trans-membrane
# protein, which is highly glycosylated.
# Spike proteins assemble into trimers on the virion surface to form the
# distinctive "corona", or crown-like appearance. The gene is usually called
# the **S** gene and the protein product is called **surface glycoprotein**.
# **1.** First, write a function, which accepts a single argument, a record,
# which has to be of `SeqRecord` type.
# This function should loop over all features of this record and return a
# `list` of features whose `.type` is either **'gene' or 'CDS'**. You will
# use this function in all other questions of **7.3**!
def get_gene_cds_features(rec):
result = []
for feature in rec.features:
if feature.type in ['CDS', 'gene']:
result.append(feature)
return result
# Using this function, print the **keys** of the qualifiers for all 'gene'
# and 'CDS' features.
# What is the single **key** that can be found in all qualifiers?
# *Hint:* Qualifiers are a special kind of dictionary called OrderedDict.
# Their keys can be accessed by the `.keys()` method.
for feature in get_gene_cds_features(rec):
print("{}: {}".format(feature.type, list(feature.qualifiers.keys())))
# **2.** Now, using the same function, print the `'gene'` qualifier for all.
# Can you spot the 'S' gene?
for feature in get_gene_cds_features(rec):
print("{}: {}".format(feature.type, feature.qualifiers['gene']))
# **3.** Using the `'gene'` qualifier from the previous exercise,
# print the `location` of all 'gene' and 'CDS' features for the 'S' gene.
# *Attention, the value of the `'gene'` qualifier is a `list`*
for feature in get_gene_cds_features(rec):
if feature.qualifiers['gene'][0] == "S":
print("{}: {}".format(feature.type, feature.location))