; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG02G006140 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG02G006140
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionRetrotrans_gag domain-containing protein
Genome locationCG_Chr02:6817791..6820248
RNA-Seq ExpressionClCG02G006140
SyntenyClCG02G006140
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035960.1 uncharacterized protein E6C27_scaffold56G001640 [Cucumis melo var. makuwa]3.0e-4947.18Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK+LG +  KE+  L  EV  LRKFVE ELH LR  V   R EC + H++NG  STST+ ++   T  +KVPKP  Y GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP
         LSDKDA F+FRDGLKDWA+IELDRRNV+TLDDAIAA + L D   K K T   EGE   FE  + + +       +E+ G+ P
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP

KAA0035961.1 uncharacterized protein E6C27_scaffold56G001660 [Cucumis melo var. makuwa]2.3e-4649.03Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK LG +  KE+  +  EV  LRKFVE ELH LR  V   R E  + H++NG  STST+ ++   T  +KVPKP TY GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE
         LSDKDA F+FRD LKDWA+IELDRRNV+TLDDAIAA   L D  +K K     EGE
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE

KAA0042140.1 uncharacterized protein E6C27_scaffold67G006290 [Cucumis melo var. makuwa]8.6e-4138.74Show/hide
Query:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR+   +   +   ++ EEG +S VE+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDRRNVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK

KAA0065760.1 polyprotein [Cucumis melo var. makuwa]1.5e-4039.19Show/hide
Query:  VATEEGRSSSVERVQVEGPVTQRKRQD------------------DARLP-----------------------TWKSTEITVVTKEMIKELGWTISKEVS
        ++ EEG +S VE+V +EGPVT+ +++                   D RL                          ++ EIT V KEMI+++G T  +E+ 
Subjt:  VATEEGRSSSVERVQVEGPVTQRKRQD------------------DARLP-----------------------TWKSTEITVVTKEMIKELGWTISKEVS

Query:  TLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMM
         L   V  L+ FVEGELH L  K     TR      EC++ H  +   STST  + +  T  +KVPKP  Y G R+ TVV+NFLFGLE+YF ALG     
Subjt:  TLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMM

Query:  ALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKD
        A R+    T ++R       +  Y         SW     E                KLRRLR + SI EY+K+FT LMLEI  L +K+A F F+DGLKD
Subjt:  ALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKD

Query:  WARIELDRRNVQTLDDAIAAVEMLTDFSAKGK----TTNKYEGEVSK
        WA+IELDRRNVQTLDDAIAA E L D+SA+ K       KY G+  K
Subjt:  WARIELDRRNVQTLDDAIAAVEMLTDFSAKGK----TTNKYEGEVSK

TYK18079.1 uncharacterized protein E5676_scaffold306G004150 [Cucumis melo var. makuwa]8.6e-4138.74Show/hide
Query:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR+   +   +   ++ EEG +S VE+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDRRNVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK

TrEMBL top hitse value%identityAlignment
A0A5A7SY30 Retrotrans_gag domain-containing protein1.4e-4947.18Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK+LG +  KE+  L  EV  LRKFVE ELH LR  V   R EC + H++NG  STST+ ++   T  +KVPKP  Y GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP
         LSDKDA F+FRDGLKDWA+IELDRRNV+TLDDAIAA + L D   K K T   EGE   FE  + + +       +E+ G+ P
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP

A0A5A7T2W8 Retrotrans_gag domain-containing protein1.1e-4649.03Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK LG +  KE+  +  EV  LRKFVE ELH LR  V   R E  + H++NG  STST+ ++   T  +KVPKP TY GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE
         LSDKDA F+FRD LKDWA+IELDRRNV+TLDDAIAA   L D  +K K     EGE
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE

A0A5A7TFP3 Retrotrans_gag domain-containing protein4.2e-4138.74Show/hide
Query:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR+   +   +   ++ EEG +S VE+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDRRNVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK

A0A5A7VEX8 Polyprotein7.1e-4139.19Show/hide
Query:  VATEEGRSSSVERVQVEGPVTQRKRQD------------------DARLP-----------------------TWKSTEITVVTKEMIKELGWTISKEVS
        ++ EEG +S VE+V +EGPVT+ +++                   D RL                          ++ EIT V KEMI+++G T  +E+ 
Subjt:  VATEEGRSSSVERVQVEGPVTQRKRQD------------------DARLP-----------------------TWKSTEITVVTKEMIKELGWTISKEVS

Query:  TLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMM
         L   V  L+ FVEGELH L  K     TR      EC++ H  +   STST  + +  T  +KVPKP  Y G R+ TVV+NFLFGLE+YF ALG     
Subjt:  TLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMM

Query:  ALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKD
        A R+    T ++R       +  Y         SW     E                KLRRLR + SI EY+K+FT LMLEI  L +K+A F F+DGLKD
Subjt:  ALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKD

Query:  WARIELDRRNVQTLDDAIAAVEMLTDFSAKGK----TTNKYEGEVSK
        WA+IELDRRNVQTLDDAIAA E L D+SA+ K       KY G+  K
Subjt:  WARIELDRRNVQTLDDAIAAVEMLTDFSAKGK----TTNKYEGEVSK

A0A5D3D3V4 Retrotrans_gag domain-containing protein4.2e-4138.74Show/hide
Query:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR+   +   +   ++ EEG +S VE+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALL---LHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----TWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDRRNVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAACAAATTGCGCCACAAACGCCTATAACTCGAGGAAAGGCCAAGACGTCCCCAAGCATAGCACACGCAAGTGCCAAGGCACCACACATGGTTGCCAAGGGTGC
CAGGCCACGCCACTGTCTCAAGACACATGGTCGAGTGGCTGCGCAGCCAGCCACAGATGTGCAGCATACATGCGCTCCTGCCAAGATAGGCAGCGCCGTGTGCCCACCCG
CACACGGATGCCAATCATGTGTGCCAAGCACACAACCAAGGCACGCGCTTGTTGATAGGCCTATGCGTGCACCCCGCACATGTACGAGGTGTGCGCCCATTGACTGCTCT
GCAAGCGCACCCCTCGCACAACCTATGCACACGTCTGTGCCACAGTGGGCACAACAACCTATGCGCGTCTGGAAGGATACGCAGCCTGCTAGGCAGTGCCAGCGCCTATG
TCCTGACACTAGAAGGGGCCTGGCTACTCGCCCACGCCCCACACATCCCCGGAAGCCTCTAGACAAACCTCGAAGGCGTCAGACTACTCGAGAAGAGGCTCGATGGTGTT
GGTGCGTCTCAGAAGGAATGTCAGCTAGGCACCCCTATAAATACCCGATGGGGCATCATTTGTACACACATCCGAGAATCAAGCAATCCAAGCGACATTCTTCAAGCTTG
AGCGATTGTCTCCGGTTGGAAGATCCTAGAGCCTCAAAAGGCTTACTTGGTGTGTCTTTTGACAGTAAGATCACCCAAGTGTCGCGCGGTTGGTTAGGTCGAAACACTAA
CCACGTGACAGGTGGTATCAGAGCCTTACTCCTCCATCCTAGATACCCACACACACTTGTATTGTCGATAGTGAGATCGGTGGCCACTGAGGAAGGACGAAGCTCATCGG
TGGAGCGTGTGCAAGTTGAAGGACCGGTGACCCAAAGGAAACGGCAAGATGATGCCCGTCTTCCAACTTGGAAAAGCACCGAGATCACGGTAGTTACCAAGGAGATGATC
AAGGAGCTGGGATGGACAATAAGTAAAGAGGTAAGTACCCTCTTCGATGAAGTAGCCAAACTAAGGAAGTTCGTGGAGGGGGAGCTTCACGAGCTTCGTGGGAAAGTCCA
CAACACACGTAAGGAGTGTCAGGCAAACCATAGTGCTAATGGAGGCACATCCACCAGCACAACATCATCTATTGCCCATGCCACCTGTGGGGTAAAGGTGCCAAAGCCCT
ATACTTATGAAGGTACAAGAAGTGTCACGGTTGTGGAGAACTTCTTGTTCGGCCTAGAGCAATACTTTGAGGCCCTAGGCACGTCGTCGATGATGGCGCTAAGATTGCAA
ATGCTCCTAACTTCCTACGTGAGGCAGCCCAACTATGGTGGCGTAGAAAGCACGCTGAGCGTGAGCTGGACAGATGCAACATTCGAACGTGGGAATAAGTCAAGCAAATT
GAGGCGATTAAGGCAAAGCAGTAGCATCCCCGAGTACATAAAAAAATTCACAATCCTCATGCTGGAAATTGAGGGTCTATCCGACAAAGATGCATTCTTTTATTTCCGCG
ATGGTCTTAAAGATTGGGCGAGGATTGAGCTCGATAGGCGGAATGTGCAGACGCTTGATGATGCCATAGCCGCTGTTGAGATGCTTACTGACTTCTCGGCCAAGGGAAAG
ACGACCAACAAATATGAAGGAGAAGTGTCGAAGTTCGAGGAATCTGATGCGCATATAAGATTGAAGGGAGCCATAGGAATGGAAGAAAGAATGGGAAGGCTGCCGACAAG
AACAGAGGGAATGTCCAAAGAGGAAGTCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAACAAATTGCGCCACAAACGCCTATAACTCGAGGAAAGGCCAAGACGTCCCCAAGCATAGCACACGCAAGTGCCAAGGCACCACACATGGTTGCCAAGGGTGC
CAGGCCACGCCACTGTCTCAAGACACATGGTCGAGTGGCTGCGCAGCCAGCCACAGATGTGCAGCATACATGCGCTCCTGCCAAGATAGGCAGCGCCGTGTGCCCACCCG
CACACGGATGCCAATCATGTGTGCCAAGCACACAACCAAGGCACGCGCTTGTTGATAGGCCTATGCGTGCACCCCGCACATGTACGAGGTGTGCGCCCATTGACTGCTCT
GCAAGCGCACCCCTCGCACAACCTATGCACACGTCTGTGCCACAGTGGGCACAACAACCTATGCGCGTCTGGAAGGATACGCAGCCTGCTAGGCAGTGCCAGCGCCTATG
TCCTGACACTAGAAGGGGCCTGGCTACTCGCCCACGCCCCACACATCCCCGGAAGCCTCTAGACAAACCTCGAAGGCGTCAGACTACTCGAGAAGAGGCTCGATGGTGTT
GGTGCGTCTCAGAAGGAATGTCAGCTAGGCACCCCTATAAATACCCGATGGGGCATCATTTGTACACACATCCGAGAATCAAGCAATCCAAGCGACATTCTTCAAGCTTG
AGCGATTGTCTCCGGTTGGAAGATCCTAGAGCCTCAAAAGGCTTACTTGGTGTGTCTTTTGACAGTAAGATCACCCAAGTGTCGCGCGGTTGGTTAGGTCGAAACACTAA
CCACGTGACAGGTGGTATCAGAGCCTTACTCCTCCATCCTAGATACCCACACACACTTGTATTGTCGATAGTGAGATCGGTGGCCACTGAGGAAGGACGAAGCTCATCGG
TGGAGCGTGTGCAAGTTGAAGGACCGGTGACCCAAAGGAAACGGCAAGATGATGCCCGTCTTCCAACTTGGAAAAGCACCGAGATCACGGTAGTTACCAAGGAGATGATC
AAGGAGCTGGGATGGACAATAAGTAAAGAGGTAAGTACCCTCTTCGATGAAGTAGCCAAACTAAGGAAGTTCGTGGAGGGGGAGCTTCACGAGCTTCGTGGGAAAGTCCA
CAACACACGTAAGGAGTGTCAGGCAAACCATAGTGCTAATGGAGGCACATCCACCAGCACAACATCATCTATTGCCCATGCCACCTGTGGGGTAAAGGTGCCAAAGCCCT
ATACTTATGAAGGTACAAGAAGTGTCACGGTTGTGGAGAACTTCTTGTTCGGCCTAGAGCAATACTTTGAGGCCCTAGGCACGTCGTCGATGATGGCGCTAAGATTGCAA
ATGCTCCTAACTTCCTACGTGAGGCAGCCCAACTATGGTGGCGTAGAAAGCACGCTGAGCGTGAGCTGGACAGATGCAACATTCGAACGTGGGAATAAGTCAAGCAAATT
GAGGCGATTAAGGCAAAGCAGTAGCATCCCCGAGTACATAAAAAAATTCACAATCCTCATGCTGGAAATTGAGGGTCTATCCGACAAAGATGCATTCTTTTATTTCCGCG
ATGGTCTTAAAGATTGGGCGAGGATTGAGCTCGATAGGCGGAATGTGCAGACGCTTGATGATGCCATAGCCGCTGTTGAGATGCTTACTGACTTCTCGGCCAAGGGAAAG
ACGACCAACAAATATGAAGGAGAAGTGTCGAAGTTCGAGGAATCTGATGCGCATATAAGATTGAAGGGAGCCATAGGAATGGAAGAAAGAATGGGAAGGCTGCCGACAAG
AACAGAGGGAATGTCCAAAGAGGAAGTCGCTTAA
Protein sequenceShow/hide protein sequence
MAQQIAPQTPITRGKAKTSPSIAHASAKAPHMVAKGARPRHCLKTHGRVAAQPATDVQHTCAPAKIGSAVCPPAHGCQSCVPSTQPRHALVDRPMRAPRTCTRCAPIDCS
ASAPLAQPMHTSVPQWAQQPMRVWKDTQPARQCQRLCPDTRRGLATRPRPTHPRKPLDKPRRRQTTREEARWCWCVSEGMSARHPYKYPMGHHLYTHPRIKQSKRHSSSL
SDCLRLEDPRASKGLLGVSFDSKITQVSRGWLGRNTNHVTGGIRALLLHPRYPHTLVLSIVRSVATEEGRSSSVERVQVEGPVTQRKRQDDARLPTWKSTEITVVTKEMI
KELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQ
MLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRRNVQTLDDAIAAVEMLTDFSAKGK
TTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLPTRTEGMSKEEVA