; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020312 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020312
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr03:14455447..14472011
RNA-Seq ExpressionPay0020312
SyntenyPay0020312
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039558.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.2e-14786.16Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM DITA+AAMAEMERK+NFLMK  EER HEITALREQM+ RET E SQTP+VKATDKG NVVQENQPQQQSVS+ASLSVQQLQDMI N IRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        PP+T+FMYSKPYTKRID+LRMPLGYQPPKFQQFDGKGN KQHI HFVETCENAGSRGDQLVRQFVRSLKGN F+WYTDL+PEVIDSWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
         RRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK  TFEELATRAHDMELSIA+  TKDF +P+VRK+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMV
        ETKSAEKVVKS+ KESMV
Subjt:  ETKSAEKVVKSTAKESMV

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]3.0e-14583.49Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM D+  + AMAEMERK+N LMKVV+ER HEI AL+EQMQ RETAE SQTP+VK  DKG NVVQENQPQQQS S+ASLSVQQLQDMI NSIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        P +TSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQH+ HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+PE I+SWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TRRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK RTFEELATRAHDMELSIA+  TKDFL+P+V+K+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMVSYT
        E K AEK+VKST+KESMV  T
Subjt:  ETKSAEKVVKSTAKESMVSYT

XP_031739134.1 uncharacterized protein LOC116402863 [Cucumis sativus]5.1e-14583.49Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM D+  + AMAEMERK+N LMKVV+ER HEI AL+EQMQ RETAE SQTP+VK  DKG NVVQENQPQQQS S+ASLSVQQLQDMI +SIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        P +TSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQH+ HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+PE I+SWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TRRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK RTFEELATRAHDMELSIA+  TKDFL+P+V+K+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMVSYT
        E K AEK+VKSTAKESMV  T
Subjt:  ETKSAEKVVKSTAKESMVSYT

XP_031740568.1 uncharacterized protein LOC116403508 [Cucumis sativus]1.1e-14483.18Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM D+  + AMAEMERK+N LMKVV+ER HEI AL+EQMQ RETAE SQTP+VK  DKG NVVQENQPQQQS S+ASLSVQQLQDMI +SIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        P +TSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQH+ HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+PE I+SWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TRRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK RTFEELATRAHDMELSIA+  TKDFL+P+V+K+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMVSYT
        E K AEK+VKST+KESMV  T
Subjt:  ETKSAEKVVKSTAKESMVSYT

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]3.0e-14583.49Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM D+  + AMAEMERK+N LMKVV+ER HEI AL+EQMQ RETAE SQTP+VK  DKG NVVQENQPQQQS S+ASLSVQQLQDMI NSIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        P +TSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQH+ HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+PE I+SWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TRRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK RTFEELATRAHDMELSIA+  TKDFL+P+V+K+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMVSYT
        E K AEK+VKST+KESMV  T
Subjt:  ETKSAEKVVKSTAKESMVSYT

TrEMBL top hitse value%identityAlignment
A0A5A7SU65 Ty3-gypsy retrotransposon protein1.2e-13679.56Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        +S MMVD+TA+A MA+MERK+NFLMKVVEER HEI AL++QM+  ETAE SQTP+VKATDK  NVVQENQPQQQSVS+ASLSVQQLQDMI NSIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        PP+TSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQHI HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+PEVIDSWEQLE EFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TR  +SMMELTNTKQ K EPVI+YINRWRALSLD KD+LTELS VEMCTQGMHW LLYILQGIK RTFEELATRAHD+ELSIA    KDFL+ + R +K 
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMV
        E    +K+  +   ESM+
Subjt:  ETKSAEKVVKSTAKESMV

A0A5A7SXL9 Ty3-gypsy retrotransposon protein6.1e-13678.62Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        +S MMVD+TA+A + EMERK+NFLMKV+EER HEI AL++QM+  ET E SQTP+VKATDKG NVVQENQPQQQSVS+ASLSVQQLQDMIANSIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        PP+TSFMYSK YTKRIDNLRMPLGYQPPKFQQFDG+GN KQHI HFVETCENAGSRGDQLV+QFVRSLKGNAF+WYTDL+PEVID+WEQLE EFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
        TRR VSMMELTNTKQ K EPVI+YINRWRALSLD KD+LTELS VEMCTQGMHW LLYILQGIK RTFEELATRAHDMELSIA +  KDFL+ + R ++ 
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMV
        E    +K+  +   ESM+
Subjt:  ETKSAEKVVKSTAKESMV

A0A5A7T9K9 Ty3-gypsy retrotransposon protein1.6e-14786.16Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        MS MM DITA+AAMAEMERK+NFLMK  EER HEITALREQM+ RET E SQTP+VKATDKG NVVQENQPQQQSVS+ASLSVQQLQDMI N IRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        PP+T+FMYSKPYTKRID+LRMPLGYQPPKFQQFDGKGN KQHI HFVETCENAGSRGDQLVRQFVRSLKGN F+WYTDL+PEVIDSWEQLEKEFLNR  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
         RRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS VEMCTQGMHWGLLYILQGIK  TFEELATRAHDMELSIA+  TKDF +P+VRK+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMV
        ETKSAEKVVKS+ KESMV
Subjt:  ETKSAEKVVKSTAKESMV

A0A5A7V8K5 Ty3-gypsy retrotransposon protein3.0e-14385.71Show/hide
Query:  MAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRGPPETSFMYSKPYT
        MAEMERK+NFLM VVEER HEITALR+QM+ RETAE SQTP+VKATDKG NVVQENQPQQQSVS+ASLSVQQLQDMIANSIRAQYRG P+T+FMYSK YT
Subjt:  MAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRGPPETSFMYSKPYT

Query:  KRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCSTRRTVSMMELTNT
        KRIDNL MPLGYQPPKFQQFDGKGN KQHI HFVETCENAGSRGDQLVRQFVRSLKGNAF+WYTDL+P+VIDSWEQLEKEFLNR  STRRT+SMMELTNT
Subjt:  KRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCSTRRTVSMMELTNT

Query:  KQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKKETKSAEKVVKSTA
        KQWK EPVI+YINRWRALSLD KDRLT+LS VEMCTQGMHWGLLYILQGIK RTFEELATRAHDMELSIA+  TKDFL+ +V+K+KKE  S EKVVKS+ 
Subjt:  KQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKKETKSAEKVVKSTA

Query:  KESMVSYT
        KESMV  T
Subjt:  KESMVSYT

A0A5D3CD35 Ty3-gypsy retrotransposon protein3.0e-14383.49Show/hide
Query:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG
        M  MM DITA+AAMA+ME+K+NFLMK +EE  HEITALREQM+ RETAE SQTP+VKATDKG NVVQ+NQPQQQSVSIASLSVQ+LQDMIANSIRAQY G
Subjt:  MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRG

Query:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS
        PP+T+FMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGN KQHI HFVETC NAGSRGDQLVRQFVRSLKGNAF+WYT+L+ EVID+WEQLEKEFL+R  S
Subjt:  PPETSFMYSKPYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCS

Query:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK
         RRTVSMMELTNTKQ K EPVI+YINRWRALSLD KDRLTELS+VEMCTQGMHW LLYILQGIK RTFEE ATRAHDMELSIA+  TKDF +P+VRK+KK
Subjt:  TRRTVSMMELTNTKQWKEEPVINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAE-TKDFLIPKVRKNKK

Query:  ETKSAEKVVKSTAKESMVSYT
        ETKS EKVVKST K SMV  T
Subjt:  ETKSAEKVVKSTAKESMVSYT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCCATGATGGTTGATATAACAGCTAAGGCTGCCATGGCAGAGATGGAGAGGAAAGTTAACTTCCTGATGAAGGTTGTGGAGGAACGATATCATGAAATCACAGC
TTTAAGAGAGCAGATGCAAATTCGTGAAACTGCTGAGTTAAGTCAAACTCCTATTGTTAAAGCTACTGATAAAGGGAATAATGTGGTGCAGGAAAACCAACCACAACAAC
AATCAGTTTCTATTGCTTCTCTCTCAGTTCAGCAGCTACAGGATATGATCGCGAATTCCATTAGAGCTCAGTATAGAGGACCTCCGGAAACTTCTTTCATGTACTCTAAG
CCGTACACCAAGAGAATCGACAACTTGAGAATGCCACTTGGGTACCAACCTCCAAAATTCCAACAATTCGATGGAAAGGGTAACCTAAAGCAGCATATCACCCACTTTGT
TGAAACATGTGAAAATGCAGGATCAAGAGGAGACCAGCTTGTCAGGCAATTCGTTCGAAGCTTGAAAGGAAATGCTTTTAAGTGGTACACCGATCTGCAACCAGAAGTCA
TTGACAGTTGGGAACAGTTAGAAAAGGAGTTCCTCAACCGTTTATGTAGCACCAGACGTACCGTGAGCATGATGGAGCTTACAAATACCAAACAGTGGAAGGAAGAGCCA
GTCATCAATTACATAAACCGATGGAGAGCTCTAAGTCTTGACTACAAAGATCGACTCACCGAACTGTCAACTGTAGAAATGTGCACCCAAGGCATGCATTGGGGACTCCT
CTACATTCTACAAGGAATAAAGCGACGCACATTTGAAGAATTAGCAACTCGCGCTCATGATATGGAATTGAGCATCGCCGCAGAAACTAAGGATTTTCTTATTCCTAAAG
TAAGAAAAAATAAGAAGGAGACGAAGAGTGCTGAAAAGGTAGTGAAGAGCACTGCGAAGGAATCTATGGTCTCGTACACCAAGAGAATCGATAACTTGAGAATGCCACTT
GGGTACCAACCTCCAAAATTCCAGAAATCCTATGGAAAGGGCAACCTAAAGCAACATATCGCCCAATTCGTCGAAACATGTGAAAATGCAGAATCAAGAGGAGACCAACT
TGTCAGGCAATACATTCGAAGCTTGAAAGGAAATGCCTTCGAGTGGTACACCGATCTGGAGCCGAAAGTCATTTACAACTGGGAACAGTTAGAAAAGGTGTTCCTCAACC
GTTTCTATAGCACTAGACGTACCCTAAGCATCACGGAGCTTACAAACACCAAGCAGCGGAAGGGAGAGTCAGTCATCGATTACATAAACCGATGGAGAGCTCTAAGTCTA
GATTGCAAAAACAAGCTCACAGAACTGTCTGCAGTAGAAATGTGCACCCAAGGTATGCACTGGGAACTTCTCTATATTTTACAGGGAATAAAACATCGCACGTTTGAATA
A
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCCATGATGGTTGATATAACAGCTAAGGCTGCCATGGCAGAGATGGAGAGGAAAGTTAACTTCCTGATGAAGGTTGTGGAGGAACGATATCATGAAATCACAGC
TTTAAGAGAGCAGATGCAAATTCGTGAAACTGCTGAGTTAAGTCAAACTCCTATTGTTAAAGCTACTGATAAAGGGAATAATGTGGTGCAGGAAAACCAACCACAACAAC
AATCAGTTTCTATTGCTTCTCTCTCAGTTCAGCAGCTACAGGATATGATCGCGAATTCCATTAGAGCTCAGTATAGAGGACCTCCGGAAACTTCTTTCATGTACTCTAAG
CCGTACACCAAGAGAATCGACAACTTGAGAATGCCACTTGGGTACCAACCTCCAAAATTCCAACAATTCGATGGAAAGGGTAACCTAAAGCAGCATATCACCCACTTTGT
TGAAACATGTGAAAATGCAGGATCAAGAGGAGACCAGCTTGTCAGGCAATTCGTTCGAAGCTTGAAAGGAAATGCTTTTAAGTGGTACACCGATCTGCAACCAGAAGTCA
TTGACAGTTGGGAACAGTTAGAAAAGGAGTTCCTCAACCGTTTATGTAGCACCAGACGTACCGTGAGCATGATGGAGCTTACAAATACCAAACAGTGGAAGGAAGAGCCA
GTCATCAATTACATAAACCGATGGAGAGCTCTAAGTCTTGACTACAAAGATCGACTCACCGAACTGTCAACTGTAGAAATGTGCACCCAAGGCATGCATTGGGGACTCCT
CTACATTCTACAAGGAATAAAGCGACGCACATTTGAAGAATTAGCAACTCGCGCTCATGATATGGAATTGAGCATCGCCGCAGAAACTAAGGATTTTCTTATTCCTAAAG
TAAGAAAAAATAAGAAGGAGACGAAGAGTGCTGAAAAGGTAGTGAAGAGCACTGCGAAGGAATCTATGGTCTCGTACACCAAGAGAATCGATAACTTGAGAATGCCACTT
GGGTACCAACCTCCAAAATTCCAGAAATCCTATGGAAAGGGCAACCTAAAGCAACATATCGCCCAATTCGTCGAAACATGTGAAAATGCAGAATCAAGAGGAGACCAACT
TGTCAGGCAATACATTCGAAGCTTGAAAGGAAATGCCTTCGAGTGGTACACCGATCTGGAGCCGAAAGTCATTTACAACTGGGAACAGTTAGAAAAGGTGTTCCTCAACC
GTTTCTATAGCACTAGACGTACCCTAAGCATCACGGAGCTTACAAACACCAAGCAGCGGAAGGGAGAGTCAGTCATCGATTACATAAACCGATGGAGAGCTCTAAGTCTA
GATTGCAAAAACAAGCTCACAGAACTGTCTGCAGTAGAAATGTGCACCCAAGGTATGCACTGGGAACTTCTCTATATTTTACAGGGAATAAAACATCGCACGTTTGAATA
A
Protein sequenceShow/hide protein sequence
MSAMMVDITAKAAMAEMERKVNFLMKVVEERYHEITALREQMQIRETAELSQTPIVKATDKGNNVVQENQPQQQSVSIASLSVQQLQDMIANSIRAQYRGPPETSFMYSK
PYTKRIDNLRMPLGYQPPKFQQFDGKGNLKQHITHFVETCENAGSRGDQLVRQFVRSLKGNAFKWYTDLQPEVIDSWEQLEKEFLNRLCSTRRTVSMMELTNTKQWKEEP
VINYINRWRALSLDYKDRLTELSTVEMCTQGMHWGLLYILQGIKRRTFEELATRAHDMELSIAAETKDFLIPKVRKNKKETKSAEKVVKSTAKESMVSYTKRIDNLRMPL
GYQPPKFQKSYGKGNLKQHIAQFVETCENAESRGDQLVRQYIRSLKGNAFEWYTDLEPKVIYNWEQLEKVFLNRFYSTRRTLSITELTNTKQRKGESVIDYINRWRALSL
DCKNKLTELSAVEMCTQGMHWELLYILQGIKHRTFE