; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008073 (gene) of Snake gourd v1 genome

Gene IDTan0008073
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRibonucloprotein
Genome locationLG08:2022860..2024708
RNA-Seq ExpressionTan0008073
SyntenyTan0008073
Gene Ontology termsGO:0000398 - mRNA splicing, via spliceosome (biological process)
GO:0000470 - maturation of LSU-rRNA (biological process)
GO:0030490 - maturation of SSU-rRNA (biological process)
GO:0005730 - nucleolus (cellular component)
GO:0005840 - ribosome (cellular component)
GO:0031428 - box C/D snoRNP complex (cellular component)
GO:0032040 - small-subunit processome (cellular component)
GO:0046540 - U4/U6 x U5 tri-snRNP complex (cellular component)
GO:0071011 - precatalytic spliceosome (cellular component)
GO:0003723 - RNA binding (molecular function)
InterPro domainsIPR002415 - H/ACA ribonucleoprotein complex, subunit Nhp2, eukaryote
IPR004037 - Ribosomal protein L7Ae conserved site
IPR004038 - Ribosomal protein L7Ae/L30e/S12e/Gadd45
IPR018492 - Ribosomal protein L7Ae/L8/Nhp2 family
IPR029064 - 50S ribosomal protein L30e-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_002524478.1 NHP2-like protein 1 [Ricinus communis]1.2e-6198.44Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLK+QIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

XP_004139318.1 NHP2-like protein 1 [Cucumis sativus]5.5e-6299.22Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

XP_020206464.1 NHP2-like protein 1 [Cajanus cajan]9.4e-6298.44Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITI+DLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

XP_022142972.1 NHP2-like protein 1 [Momordica charantia]2.5e-62100Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

XP_027332262.1 NHP2-like protein 1 isoform X2 [Abrus precatorius]1.2e-6197.66Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITI+DLVQQAANYKQLKKGANEATKTLNRGISEF+VMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

TrEMBL top hitse value%identityAlignment
A0A0A0LF58 Ribonucloprotein2.7e-6299.22Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

A0A444WXD4 Ribonucloprotein2.7e-6299.22Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

A0A6J1CPF3 Ribonucloprotein1.2e-62100Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

A0A6J1FDG8 Ribonucloprotein1.2e-62100Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

A0A6J1HQD3 Ribonucloprotein2.7e-6299.22Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGEAVNPKAYPLAD+QLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVTTNEGSQLKSQIQQLKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

SwissProt top hitse value%identityAlignment
P55769 NHP2-like protein 11.6e-5178.91Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT   VNPKAYPLAD+ LT  +LDLVQQ+ NYKQL+KGANEATKTLNRGISEF+VMAAD EPLEI+LHLPLL EDKNVPYVFV SKQALGRACGV+RPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT  EGSQLK QIQ ++ +IE+LL+
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

Q3B8S0 NHP2-like protein 11.6e-5178.91Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT   VNPKAYPLAD+ LT  +LDLVQQ+ NYKQL+KGANEATKTLNRGISEF+VMAAD EPLEI+LHLPLL EDKNVPYVFV SKQALGRACGV+RPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT  EGSQLK QIQ ++ +IE+LL+
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

Q4R5C6 NHP2-like protein 11.6e-5178.91Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT   VNPKAYPLAD+ LT  +LDLVQQ+ NYKQL+KGANEATKTLNRGISEF+VMAAD EPLEI+LHLPLL EDKNVPYVFV SKQALGRACGV+RPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT  EGSQLK QIQ ++ +IE+LL+
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

Q5XH16 NHP2-like protein 13.7e-5381.25Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT   VNPKAYPLAD+QLT T+LDLVQQ+ANYKQL+KGANEATKTLNRGI+EF+VMAAD EPLEI+LHLPLL EDKNVPYVFV SKQALGRACGV+RPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT  EGSQLK QIQ ++ AIE+LL+
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

Q6P8E9 NHP2-like protein 13.7e-5381.25Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT   VNPKAYPLAD+QLT T+LDLVQQAANYKQL+KGANEATKTLNRGI+EF+VMAAD EPLEI+LHLPLL EDKNVPYVFV SKQALGRACGV+RPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        AC+VT  EGSQLK QIQ L+ +IE+LL+
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

Arabidopsis top hitse value%identityAlignment
AT4G12600.1 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein2.9e-6192.97Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MT E VNPKAYPLADSQL+ITILDLVQQA NYKQLKKGANEATKTLNRGISEF+VMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRAC VTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT+NE SQLKSQIQ LKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

AT4G12600.2 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein1.3e-5365.03Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDK------------------------
        MT E VNPKAYPLADSQL+ITILDLVQQA NYKQLKKGANEATKTLNRGISEF+VMAADTEPLEILLHLPLLAEDK                        
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDK------------------------

Query:  -------------------------------NVPYVFVPSKQALGRACGVTRPVIACSVTTNEGSQLKSQIQQLKDAIEKLLI
                                       NVPYVFVPSKQALGRAC VTRPVIACSVT+NE SQLKSQIQ LKDAIEKLLI
Subjt:  -------------------------------NVPYVFVPSKQALGRACGVTRPVIACSVTTNEGSQLKSQIQQLKDAIEKLLI

AT4G22380.1 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein2.0e-6293.75Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGE VNPKAYPLADSQL+ITI+DLVQQA NYKQLKKGANEATKTLNRGISEFVVMAAD EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT+NE SQLKSQIQ LKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

AT5G20160.1 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein1.2e-6294.53Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
        MTGE VNPKAYPLADSQL+ITILDLVQQA NYKQLKKGANEATKTLNRGISEFVVMAAD EPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVI

Query:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI
        ACSVT+NE SQLKSQIQ LKDAIEKLLI
Subjt:  ACSVTTNEGSQLKSQIQQLKDAIEKLLI

AT5G20160.2 Ribosomal protein L7Ae/L30e/S12e/Gadd45 family protein1.2e-5775.62Show/hide
Query:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANE--------------------------------ATKTLNRGISEFVVMAADTEPLEILLH
        MTGE VNPKAYPLADSQL+ITILDLVQQA NYKQLKKGANE                                ATKTLNRGISEFVVMAAD EPLEILLH
Subjt:  MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANE--------------------------------ATKTLNRGISEFVVMAADTEPLEILLH

Query:  LPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVTTNEGSQLKSQIQQLKDAIEKLLI
        LPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVT+NE SQLKSQIQ LKDAIEKLLI
Subjt:  LPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVTTNEGSQLKSQIQQLKDAIEKLLI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAGGAGAAGCGGTGAATCCCAAAGCCTATCCTTTGGCCGATTCACAGCTTACAATTACTATTCTTGATCTTGTTCAGCAAGCTGCTAACTACAAGCAGCTAAAGAA
GGGTGCGAATGAAGCCACTAAGACTCTGAATAGAGGGATTTCGGAGTTTGTAGTGATGGCTGCGGACACTGAACCGCTTGAGATTCTTCTCCATCTTCCATTGTTGGCCG
AAGATAAGAATGTGCCCTATGTATTTGTCCCTTCAAAGCAAGCTCTTGGCCGAGCATGTGGAGTCACGAGACCTGTAATCGCATGTTCTGTAACGACAAACGAAGGAAGT
CAATTGAAATCCCAGATACAGCAACTGAAGGATGCCATTGAGAAGCTGTTGATCTGA
mRNA sequenceShow/hide mRNA sequence
GAGAATTTACCAGTAGCTCTTAAACCCTAATTTTTTTCCTTCAAATTTTGCACACTGAATTAAAACCCTAACTTTTCTTCTTCGATCTAAAATGACAGGAGAAGCGGTGA
ATCCCAAAGCCTATCCTTTGGCCGATTCACAGCTTACAATTACTATTCTTGATCTTGTTCAGCAAGCTGCTAACTACAAGCAGCTAAAGAAGGGTGCGAATGAAGCCACT
AAGACTCTGAATAGAGGGATTTCGGAGTTTGTAGTGATGGCTGCGGACACTGAACCGCTTGAGATTCTTCTCCATCTTCCATTGTTGGCCGAAGATAAGAATGTGCCCTA
TGTATTTGTCCCTTCAAAGCAAGCTCTTGGCCGAGCATGTGGAGTCACGAGACCTGTAATCGCATGTTCTGTAACGACAAACGAAGGAAGTCAATTGAAATCCCAGATAC
AGCAACTGAAGGATGCCATTGAGAAGCTGTTGATCTGAAGTGTTATATTCTATATGGCATTCGGTATGATGGGCTTCTTTGATCGTCATGGTCGCTGCTCTGGAGGCTTT
GACTGAAGTTGTTGTGCAATTAGGAAAAAAAAAATGTCATGGTATTAACCTATATATTGAGCTTCCTTTTTTTTTTGTTTCCATTAAGAGTGGGACAAAACACTCGTACA
ATTTTAATGACAGTTTGTTAAGCTAATGTTACCATCTTTATCCTTGTGAGTTTACTGTTGAAGATGTTCATGGTTCAGAAGCATAGCTTGTTCATCAACCTTTTTTTTTT
GCTTTTTTGCATTATTATTGATTATAGATTTGAACAGAAA
Protein sequenceShow/hide protein sequence
MTGEAVNPKAYPLADSQLTITILDLVQQAANYKQLKKGANEATKTLNRGISEFVVMAADTEPLEILLHLPLLAEDKNVPYVFVPSKQALGRACGVTRPVIACSVTTNEGS
QLKSQIQQLKDAIEKLLI