; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g30760 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g30760
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionTy3/gypsy retrotransposon protein
Genome locationchr6:23139787..23148174
RNA-Seq ExpressionMoc06g30760
SyntenyMoc06g30760
Gene Ontology termsGO:0006259 - DNA metabolic process (biological process)
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR016197 - Chromo-like domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0058900.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]6.0e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++ +          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

TYK10830.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]6.0e-4234.11Show/hide
Query:  KRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKV----------------KAD------TLKITEWIDAECGRDKGKYV
        K+ EER E +++EI  I+ +L R+P +E  I++    +       +E   H Q++                ++D        K+ E ID   G +K    
Subjt:  KRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKV----------------KAD------TLKITEWIDAECGRDKGKYV

Query:  APRGVDEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVK
             ++R KFKK+EMP+F+GE+ DAWLFRA RYF+I++LT+ EK+IV+ +SFE   ++W+R+ + R  F  W +LK RL  +F S ++  +C +FL ++
Subjt:  APRGVDEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVK

Query:  QEGTVAEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAK-----------------IHPPNLSDDFEWLAI
        QE ++ E+R  F+ L A +  L D V+E TF NGL   ++ EV     VGL E+M   Q  ++ E+ +                    P L ++ EW A 
Subjt:  QEGTVAEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAK-----------------IHPPNLSDDFEWLAI

Query:  PEEVLDLHLHPENQHVKLLICWKGLPDFEATWEPRQQFQQQFP
        PEEV   ++  +     +LI WKGL   EATWE   + QQ+FP
Subjt:  PEEVLDLHLHPENQHVKLLICWKGLPDFEATWEPRQQFQQQFP

TYK23724.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]6.0e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++ +          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

TYK28460.1 gypsy/ty3 element polyprotein [Cucumis melo var. makuwa]3.5e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++++          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

XP_022158744.1 uncharacterized protein LOC111025209 [Momordica charantia]1.9e-4846.35Show/hide
Query:  MVSKVPPALYASSQLTCLSHLTKTVEAIKKKLTPRQMRLFRKTVVGHLLDVDLVFNGPLIHS--------------------------------------
        M+ K+ PA YAS++L CLSH+ KT   IK KLTP+Q+ +FRKT+  HLLDVDLVFNGPL+ +                                      
Subjt:  MVSKVPPALYASSQLTCLSHLTKTVEAIKKKLTPRQMRLFRKTVVGHLLDVDLVFNGPLIHS--------------------------------------

Query:  ---LLLREVNES---------------LPDTISLNLFGSKRSMKYDNSLLGTTDDWEVCCNHDWGQLSFEKTIRSLQRALTKKTKEGRLRKSYSLYGFPW
           LLL E+ +                L   + L L G +RS K+D+ LLG  DDWE CCNHDW  LSF+KTI SLQR  + K+KEG LRKSYSLYGFPW
Subjt:  ---LLLREVNES---------------LPDTISLNLFGSKRSMKYDNSLLGTTDDWEVCCNHDWGQLSFEKTIRSLQRALTKKTKEGRLRKSYSLYGFPW

Query:  VFQVWGYEIISSMTGRVARKISDDVIPRMLRWR
         FQVW YEIISS++G +   +S DV+PR+L+WR
Subjt:  VFQVWGYEIISSMTGRVARKISDDVIPRMLRWR

TrEMBL top hitse value%identityAlignment
A0A5A7UV13 Gypsy/ty3 element polyprotein2.9e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++ +          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

A0A5D3CHI3 Transposon Tf2-1 polyprotein isoform X12.9e-4234.11Show/hide
Query:  KRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKV----------------KAD------TLKITEWIDAECGRDKGKYV
        K+ EER E +++EI  I+ +L R+P +E  I++    +       +E   H Q++                ++D        K+ E ID   G +K    
Subjt:  KRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKV----------------KAD------TLKITEWIDAECGRDKGKYV

Query:  APRGVDEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVK
             ++R KFKK+EMP+F+GE+ DAWLFRA RYF+I++LT+ EK+IV+ +SFE   ++W+R+ + R  F  W +LK RL  +F S ++  +C +FL ++
Subjt:  APRGVDEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVK

Query:  QEGTVAEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAK-----------------IHPPNLSDDFEWLAI
        QE ++ E+R  F+ L A +  L D V+E TF NGL   ++ EV     VGL E+M   Q  ++ E+ +                    P L ++ EW A 
Subjt:  QEGTVAEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAK-----------------IHPPNLSDDFEWLAI

Query:  PEEVLDLHLHPENQHVKLLICWKGLPDFEATWEPRQQFQQQFP
        PEEV   ++  +     +LI WKGL   EATWE   + QQ+FP
Subjt:  PEEVLDLHLHPENQHVKLLICWKGLPDFEATWEPRQQFQQQFP

A0A5D3DJA9 Ty3/gypsy retrotransposon protein2.9e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++ +          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

A0A5D3DZ01 Gypsy/ty3 element polyprotein1.7e-4236.23Show/hide
Query:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE
        MTQK +EERL+A ++EIE IK ++ R+P +E ++ ++   ++++          ++     T    ++  + +D +   +    + P    +R KFKK+E
Subjt:  MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADT----LKITEWIDAECGRDKGKYVAPRGVDEREKFKKVE

Query:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL
        MP+F+GE+ D W++RA  YF+++ L  +EK+ +++VS E   + WFR  ++R+ F  W+ELK RL+++F + +  + CARFLA+KQEG+V E+ + FE L
Subjt:  MPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEAL

Query:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE
        +A LP ++++VL   FTNGLD V+R EV  +  VGLE++M A +  E++ E+A+      + DF+
Subjt:  AASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDR-EVAKIHPPNLSDDFE

A0A6J1E0A9 uncharacterized protein LOC1110252099.3e-4946.35Show/hide
Query:  MVSKVPPALYASSQLTCLSHLTKTVEAIKKKLTPRQMRLFRKTVVGHLLDVDLVFNGPLIHS--------------------------------------
        M+ K+ PA YAS++L CLSH+ KT   IK KLTP+Q+ +FRKT+  HLLDVDLVFNGPL+ +                                      
Subjt:  MVSKVPPALYASSQLTCLSHLTKTVEAIKKKLTPRQMRLFRKTVVGHLLDVDLVFNGPLIHS--------------------------------------

Query:  ---LLLREVNES---------------LPDTISLNLFGSKRSMKYDNSLLGTTDDWEVCCNHDWGQLSFEKTIRSLQRALTKKTKEGRLRKSYSLYGFPW
           LLL E+ +                L   + L L G +RS K+D+ LLG  DDWE CCNHDW  LSF+KTI SLQR  + K+KEG LRKSYSLYGFPW
Subjt:  ---LLLREVNES---------------LPDTISLNLFGSKRSMKYDNSLLGTTDDWEVCCNHDWGQLSFEKTIRSLQRALTKKTKEGRLRKSYSLYGFPW

Query:  VFQVWGYEIISSMTGRVARKISDDVIPRMLRWR
         FQVW YEIISS++G +   +S DV+PR+L+WR
Subjt:  VFQVWGYEIISSMTGRVARKISDDVIPRMLRWR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding3.2e-0925.41Show/hide
Query:  DEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTV
        +E  +  +V  P+   ENL   L     YF  N +  +E++ +   + E     W +    +   T W+E K  +  +  +T   +    +  ++QEG+V
Subjt:  DEREKFKKVEMPIFSGENLDAWLFRAGRYFEINKLTNEEKVIVSLVSFE---VSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTV

Query:  AEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAKIHPPNLSDDFEWLAIPEEVLDL
         E+RE FEAL      L  + LE+ F  GL   L+  V  L   G+ ++M   Q +E+     ++   LS   E    P    +L
Subjt:  AEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVEVLCLNLVGLEEIMKAGQRIEDREVAKIHPPNLSDDFEWLAIPEEVLDL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTCAGAAAAGGGTTGAAGAAAGATTGGAAGCGGTAGATAGAGAAATTGAGATTATCAAACATGACTTGCTGAGACTACCGTCAATAGAACATTCCATTTCTCAGTT
AACCATGACAGTGTCTAGATTGGCCACTAAAATGGATGAACATTTTGATCATACCCAGAAGGTAAAAGCAGACACACTTAAAATAACAGAATGGATCGATGCAGAGTGTG
GAAGGGATAAGGGGAAGTATGTAGCTCCAAGGGGAGTGGATGAAAGGGAAAAATTCAAAAAAGTAGAAATGCCCATCTTTTCCGGCGAAAATCTGGATGCATGGTTATTT
CGAGCTGGCCGATATTTCGAGATCAACAAGTTGACGAACGAGGAAAAAGTCATCGTTTCTTTGGTGAGTTTTGAGGTGTCGTGGTTCCGGTTGACCGATAGCCGACAACC
ATTCACAGAGTGGGAGGAACTGAAACTCCGGCTATTCGACCAGTTCGGGTCGACCCAAGATCGAAGCTTATGCGCACGATTTTTAGCGGTAAAACAAGAGGGAACAGTCG
CGGAATTCAGAGAGTCGTTCGAAGCATTAGCGGCGTCGCTTCCACATTTATCAGATGAAGTATTGGAAAGCACGTTTACTAACGGTTTAGATTCTGTGCTAAGGGTTGAA
GTTCTATGTTTAAATCTAGTTGGGCTAGAGGAAATAATGAAGGCAGGCCAAAGGATAGAAGATCGTGAGGTGGCCAAGATCCACCCACCCAACCTATCTGATGATTTTGA
ATGGTTGGCCATCCCTGAGGAAGTCTTAGACCTGCATCTACACCCAGAAAATCAGCACGTGAAGTTACTCATATGTTGGAAGGGGCTGCCGGATTTTGAAGCCACATGGG
AGCCACGCCAGCAGTTTCAACAACAATTTCCTGCCTTCCACCTTGAGGACAAGGACATATTATCTTTTGTTGGTTATCAATCAATTGTAAGGAAAGATTCCTTACAATTG
GACCCCTGGATCTCGAGACAAGGACATTGTATTCCGGTTTGGGTGAGAGAAGAGATGAGGCAAAACTCAGTGGCGTGCCTCCTTCAACGTGAGGGAAGATGGAATGAAAA
TATTGTTCGTGAAAACTTTTGTGCTGAGGAAGCTGAAATGATTCTAAAGATCCCTCTGCCCCAGAAAAGTCAAGAGGATGAAATAATATGGAACATGGACAAGAGAGGAA
TTTTTCGGTCAAAAGTGCATATAACCTACCAACGGTTAATAAAGGTCAATGGTGGAAACTATTTTGGAAGATTCCAGTGCAACTTGAGAGATGCAAATCAAGGAATTCGT
AGACCTGGGGAAAGGGAGAGAGAGAGATGCATCTGTCAAGCGCTTCACCTCAAGGACGCCCAACTCATAGAAGATTTATACAGATTCAGTGTGGCAAGCAGACAAAGGGT
TGGATTATTAGAGATGAGAGGTGGAGATGAAAGAGAAGTTACTTACCTTAGAACTTATGGCAATTTGCAAAGGGATGGAAGCGGCGACAGAAGAAACACCCAAACCCATT
CTCTTCCAAATAGATTCTTTGGAAGCATTCCACTTGATTCAAGGCTGATCAACCACATAAGACGTGATGAAAATCGTGTAGAAATGGAAATGGTATCGAAGGTCCCTCCC
GCGCTGTATGCCTCTTCCCAACTGACCTGTCTATCGCACTTAACGAAGACAGTCGAGGCCATTAAAAAGAAACTTACCCCCCGTCAGATGCGTCTATTTAGGAAGACTGT
AGTTGGCCATCTGCTTGATGTGGACCTTGTCTTTAACGGACCACTAATCCATAGTTTGCTGCTTAGGGAGGTGAATGAGAGTCTCCCAGACACCATTAGCTTAAACTTAT
TTGGGAGTAAGCGAAGCATGAAGTACGACAATAGTTTGCTTGGAACAACAGACGATTGGGAAGTGTGCTGCAACCATGATTGGGGACAGCTATCGTTCGAGAAGACAATA
AGAAGTCTGCAGCGAGCACTGACGAAGAAGACAAAGGAGGGGAGGTTGAGGAAATCGTATAGTCTGTACGGTTTCCCGTGGGTATTCCAGGTGTGGGGGTACGAGATTAT
ATCTTCTATGACTGGACGAGTTGCTAGGAAGATTAGTGACGATGTCATCCCCCGTATGCTCCGGTGGAGGGTATGCGAGCCACCAACATCAGATGAGGTGGAAATGGATG
AAGAAGTCCCCGAACCTTCAAAGACTGCATGTGGGGGTCAGGACAATGTTGGTGCTCCCACTGATGCCGCTATCGATGATACACAGGACGGAGAGGGCACAAAGAAGAAA
GATAAAAATAAAAAGAAGAGTAACAAGAAAGTGTTCAGGCGACTTAGAAGTTTGGACATGCGTATGGGCGCTATGGACACGCGTATGAGTGCTATGGAGAAGAGGCTAGA
AGGTGTGAAGGGTGAGCTGAAATCCATACGCAAGTATTTGAGACGGATTGCCAAGGGTCAACTAGTCGACCCGGCCGACATGCGAAGAGGTAAGGGAGGTGACGCTGATG
GTGGAGCGGGAAGTACGGGTGATGGAGGGTATACTCAAGGGGATGCAACGGACAGTTATCCTAAAGGTGACGGACGTGGGGATGCAGCGGGCGATGGACGTGGGGATGGT
GCATCTGATGTTCATGACGACACAATTGTAGCATACGTCGACACTGGTGCGGTTTGTACAGAACAAGCCGTGGACCTTGAGGTAGGAGATGTGCATCAAGGAACGCCTCA
TGTTCATCCAAAGGACGTCCAGAGTACGGGCATTGGTACATCGAAGCATGTGGGAGCTGTTGAGTCCGATGTCATTCAGTACGAGTCACTGATACAGGTACAAATTCTTC
ATGATTCATCAGACACCGACACTGACCCGAATATCGTATCTCAGTCACAAGCGACCAAGGAATCCCAGTCGCAGGCACCCGAAGAAACACAATTTCAAGAGAAATCACAA
GGCCAGTCACGTGTGGACGACAACGAGGTACGTGTGTTACTGCAGCCAGTCACGCGTCCAAACCCTCGTCGTGGAGAACGAGAAAAGAAAGTGCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGACTCAGAAAAGGGTTGAAGAAAGATTGGAAGCGGTAGATAGAGAAATTGAGATTATCAAACATGACTTGCTGAGACTACCGTCAATAGAACATTCCATTTCTCAGTT
AACCATGACAGTGTCTAGATTGGCCACTAAAATGGATGAACATTTTGATCATACCCAGAAGGTAAAAGCAGACACACTTAAAATAACAGAATGGATCGATGCAGAGTGTG
GAAGGGATAAGGGGAAGTATGTAGCTCCAAGGGGAGTGGATGAAAGGGAAAAATTCAAAAAAGTAGAAATGCCCATCTTTTCCGGCGAAAATCTGGATGCATGGTTATTT
CGAGCTGGCCGATATTTCGAGATCAACAAGTTGACGAACGAGGAAAAAGTCATCGTTTCTTTGGTGAGTTTTGAGGTGTCGTGGTTCCGGTTGACCGATAGCCGACAACC
ATTCACAGAGTGGGAGGAACTGAAACTCCGGCTATTCGACCAGTTCGGGTCGACCCAAGATCGAAGCTTATGCGCACGATTTTTAGCGGTAAAACAAGAGGGAACAGTCG
CGGAATTCAGAGAGTCGTTCGAAGCATTAGCGGCGTCGCTTCCACATTTATCAGATGAAGTATTGGAAAGCACGTTTACTAACGGTTTAGATTCTGTGCTAAGGGTTGAA
GTTCTATGTTTAAATCTAGTTGGGCTAGAGGAAATAATGAAGGCAGGCCAAAGGATAGAAGATCGTGAGGTGGCCAAGATCCACCCACCCAACCTATCTGATGATTTTGA
ATGGTTGGCCATCCCTGAGGAAGTCTTAGACCTGCATCTACACCCAGAAAATCAGCACGTGAAGTTACTCATATGTTGGAAGGGGCTGCCGGATTTTGAAGCCACATGGG
AGCCACGCCAGCAGTTTCAACAACAATTTCCTGCCTTCCACCTTGAGGACAAGGACATATTATCTTTTGTTGGTTATCAATCAATTGTAAGGAAAGATTCCTTACAATTG
GACCCCTGGATCTCGAGACAAGGACATTGTATTCCGGTTTGGGTGAGAGAAGAGATGAGGCAAAACTCAGTGGCGTGCCTCCTTCAACGTGAGGGAAGATGGAATGAAAA
TATTGTTCGTGAAAACTTTTGTGCTGAGGAAGCTGAAATGATTCTAAAGATCCCTCTGCCCCAGAAAAGTCAAGAGGATGAAATAATATGGAACATGGACAAGAGAGGAA
TTTTTCGGTCAAAAGTGCATATAACCTACCAACGGTTAATAAAGGTCAATGGTGGAAACTATTTTGGAAGATTCCAGTGCAACTTGAGAGATGCAAATCAAGGAATTCGT
AGACCTGGGGAAAGGGAGAGAGAGAGATGCATCTGTCAAGCGCTTCACCTCAAGGACGCCCAACTCATAGAAGATTTATACAGATTCAGTGTGGCAAGCAGACAAAGGGT
TGGATTATTAGAGATGAGAGGTGGAGATGAAAGAGAAGTTACTTACCTTAGAACTTATGGCAATTTGCAAAGGGATGGAAGCGGCGACAGAAGAAACACCCAAACCCATT
CTCTTCCAAATAGATTCTTTGGAAGCATTCCACTTGATTCAAGGCTGATCAACCACATAAGACGTGATGAAAATCGTGTAGAAATGGAAATGGTATCGAAGGTCCCTCCC
GCGCTGTATGCCTCTTCCCAACTGACCTGTCTATCGCACTTAACGAAGACAGTCGAGGCCATTAAAAAGAAACTTACCCCCCGTCAGATGCGTCTATTTAGGAAGACTGT
AGTTGGCCATCTGCTTGATGTGGACCTTGTCTTTAACGGACCACTAATCCATAGTTTGCTGCTTAGGGAGGTGAATGAGAGTCTCCCAGACACCATTAGCTTAAACTTAT
TTGGGAGTAAGCGAAGCATGAAGTACGACAATAGTTTGCTTGGAACAACAGACGATTGGGAAGTGTGCTGCAACCATGATTGGGGACAGCTATCGTTCGAGAAGACAATA
AGAAGTCTGCAGCGAGCACTGACGAAGAAGACAAAGGAGGGGAGGTTGAGGAAATCGTATAGTCTGTACGGTTTCCCGTGGGTATTCCAGGTGTGGGGGTACGAGATTAT
ATCTTCTATGACTGGACGAGTTGCTAGGAAGATTAGTGACGATGTCATCCCCCGTATGCTCCGGTGGAGGGTATGCGAGCCACCAACATCAGATGAGGTGGAAATGGATG
AAGAAGTCCCCGAACCTTCAAAGACTGCATGTGGGGGTCAGGACAATGTTGGTGCTCCCACTGATGCCGCTATCGATGATACACAGGACGGAGAGGGCACAAAGAAGAAA
GATAAAAATAAAAAGAAGAGTAACAAGAAAGTGTTCAGGCGACTTAGAAGTTTGGACATGCGTATGGGCGCTATGGACACGCGTATGAGTGCTATGGAGAAGAGGCTAGA
AGGTGTGAAGGGTGAGCTGAAATCCATACGCAAGTATTTGAGACGGATTGCCAAGGGTCAACTAGTCGACCCGGCCGACATGCGAAGAGGTAAGGGAGGTGACGCTGATG
GTGGAGCGGGAAGTACGGGTGATGGAGGGTATACTCAAGGGGATGCAACGGACAGTTATCCTAAAGGTGACGGACGTGGGGATGCAGCGGGCGATGGACGTGGGGATGGT
GCATCTGATGTTCATGACGACACAATTGTAGCATACGTCGACACTGGTGCGGTTTGTACAGAACAAGCCGTGGACCTTGAGGTAGGAGATGTGCATCAAGGAACGCCTCA
TGTTCATCCAAAGGACGTCCAGAGTACGGGCATTGGTACATCGAAGCATGTGGGAGCTGTTGAGTCCGATGTCATTCAGTACGAGTCACTGATACAGGTACAAATTCTTC
ATGATTCATCAGACACCGACACTGACCCGAATATCGTATCTCAGTCACAAGCGACCAAGGAATCCCAGTCGCAGGCACCCGAAGAAACACAATTTCAAGAGAAATCACAA
GGCCAGTCACGTGTGGACGACAACGAGGTACGTGTGTTACTGCAGCCAGTCACGCGTCCAAACCCTCGTCGTGGAGAACGAGAAAAGAAAGTGCCATAG
Protein sequenceShow/hide protein sequence
MTQKRVEERLEAVDREIEIIKHDLLRLPSIEHSISQLTMTVSRLATKMDEHFDHTQKVKADTLKITEWIDAECGRDKGKYVAPRGVDEREKFKKVEMPIFSGENLDAWLF
RAGRYFEINKLTNEEKVIVSLVSFEVSWFRLTDSRQPFTEWEELKLRLFDQFGSTQDRSLCARFLAVKQEGTVAEFRESFEALAASLPHLSDEVLESTFTNGLDSVLRVE
VLCLNLVGLEEIMKAGQRIEDREVAKIHPPNLSDDFEWLAIPEEVLDLHLHPENQHVKLLICWKGLPDFEATWEPRQQFQQQFPAFHLEDKDILSFVGYQSIVRKDSLQL
DPWISRQGHCIPVWVREEMRQNSVACLLQREGRWNENIVRENFCAEEAEMILKIPLPQKSQEDEIIWNMDKRGIFRSKVHITYQRLIKVNGGNYFGRFQCNLRDANQGIR
RPGERERERCICQALHLKDAQLIEDLYRFSVASRQRVGLLEMRGGDEREVTYLRTYGNLQRDGSGDRRNTQTHSLPNRFFGSIPLDSRLINHIRRDENRVEMEMVSKVPP
ALYASSQLTCLSHLTKTVEAIKKKLTPRQMRLFRKTVVGHLLDVDLVFNGPLIHSLLLREVNESLPDTISLNLFGSKRSMKYDNSLLGTTDDWEVCCNHDWGQLSFEKTI
RSLQRALTKKTKEGRLRKSYSLYGFPWVFQVWGYEIISSMTGRVARKISDDVIPRMLRWRVCEPPTSDEVEMDEEVPEPSKTACGGQDNVGAPTDAAIDDTQDGEGTKKK
DKNKKKSNKKVFRRLRSLDMRMGAMDTRMSAMEKRLEGVKGELKSIRKYLRRIAKGQLVDPADMRRGKGGDADGGAGSTGDGGYTQGDATDSYPKGDGRGDAAGDGRGDG
ASDVHDDTIVAYVDTGAVCTEQAVDLEVGDVHQGTPHVHPKDVQSTGIGTSKHVGAVESDVIQYESLIQVQILHDSSDTDTDPNIVSQSQATKESQSQAPEETQFQEKSQ
GQSRVDDNEVRVLLQPVTRPNPRRGEREKKVP