; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc02G06595 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc02G06595
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationClcChr02:6432233..6434690
RNA-Seq ExpressionClc02G06595
SyntenyClc02G06595
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032830.1 uncharacterized protein E6C27_scaffold708G00950 [Cucumis melo var. makuwa]5.6e-4038.41Show/hide
Query:  LHPRYPRTLVLSIVRSVATEEGRSSSMERVQVEGPVTQRKRQD------------------DARLP-----------------------AWKSTEITVVT
        + PR  R  +  +   ++ +EG +S +E+V +EGPVT+ +++                   D RL                          ++ EIT V 
Subjt:  LHPRYPRTLVLSIVRSVATEEGRSSSMERVQVEGPVTQRKRQD------------------DARLP-----------------------AWKSTEITVVT

Query:  KEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK-------VHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFL
        KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K       +     EC++ H  +   STST  + +  T  +KVPKP  Y G R+ TVV+NFL
Subjt:  KEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK-------VHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFL

Query:  FGLEQYFEALGTSSMMALRLQMLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDW
        FGLE+YF ALG     A R+    T ++R           +  W   + E      KLRRLR + SI +Y+K+FT LMLEI  L +K+A F F+DGLKDW
Subjt:  FGLEQYFEALGTSSMMALRLQMLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDW

Query:  ARIELDRQNVQTLDDAIAAVEMLTDFSA
        A+IELDR+NVQTL+DAIAA E L D+SA
Subjt:  ARIELDRQNVQTLDDAIAAVEMLTDFSA

KAA0035960.1 uncharacterized protein E6C27_scaffold56G001640 [Cucumis melo var. makuwa]8.6e-4946.83Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK+LG +  KE+  L  EV  LRKFVE ELH LR  V   R EC + H++NG  STST+ ++   T  +KVPKP  Y GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP
         LSDKDA F+FRDGLKDWA+IELDR+NV+TLDDAIAA + L D   K K T   EGE   FE  + + +       +E+ G+ P
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP

KAA0035961.1 uncharacterized protein E6C27_scaffold56G001660 [Cucumis melo var. makuwa]6.8e-4648.64Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK LG +  KE+  +  EV  LRKFVE ELH LR  V   R E  + H++NG  STST+ ++   T  +KVPKP TY GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE
         LSDKDA F+FRD LKDWA+IELDR+NV+TLDDAIAA   L D  +K K     EGE
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE

KAA0042140.1 uncharacterized protein E6C27_scaffold67G006290 [Cucumis melo var. makuwa]1.9e-4039.01Show/hide
Query:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR  T V  ++ S+ + EEG +S +E+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDR+NVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK

TYK18079.1 uncharacterized protein E5676_scaffold306G004150 [Cucumis melo var. makuwa]1.9e-4039.01Show/hide
Query:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR  T V  ++ S+ + EEG +S +E+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDR+NVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK

TrEMBL top hitse value%identityAlignment
A0A5A7SPK5 Uncharacterized protein2.7e-4038.41Show/hide
Query:  LHPRYPRTLVLSIVRSVATEEGRSSSMERVQVEGPVTQRKRQD------------------DARLP-----------------------AWKSTEITVVT
        + PR  R  +  +   ++ +EG +S +E+V +EGPVT+ +++                   D RL                          ++ EIT V 
Subjt:  LHPRYPRTLVLSIVRSVATEEGRSSSMERVQVEGPVTQRKRQD------------------DARLP-----------------------AWKSTEITVVT

Query:  KEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK-------VHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFL
        KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K       +     EC++ H  +   STST  + +  T  +KVPKP  Y G R+ TVV+NFL
Subjt:  KEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK-------VHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFL

Query:  FGLEQYFEALGTSSMMALRLQMLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDW
        FGLE+YF ALG     A R+    T ++R           +  W   + E      KLRRLR + SI +Y+K+FT LMLEI  L +K+A F F+DGLKDW
Subjt:  FGLEQYFEALGTSSMMALRLQMLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDW

Query:  ARIELDRQNVQTLDDAIAAVEMLTDFSA
        A+IELDR+NVQTL+DAIAA E L D+SA
Subjt:  ARIELDRQNVQTLDDAIAAVEMLTDFSA

A0A5A7SY30 Retrotrans_gag domain-containing protein4.2e-4946.83Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK+LG +  KE+  L  EV  LRKFVE ELH LR  V   R EC + H++NG  STST+ ++   T  +KVPKP  Y GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP
         LSDKDA F+FRDGLKDWA+IELDR+NV+TLDDAIAA + L D   K K T   EGE   FE  + + +       +E+ G+ P
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLP

A0A5A7T2W8 Retrotrans_gag domain-containing protein3.3e-4648.64Show/hide
Query:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF
        + EI+   KEMIK LG +  KE+  +  EV  LRKFVE ELH LR  V   R E  + H++NG  STST+ ++   T  +KVPKP TY GTR+ T+VENF
Subjt:  STEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENF

Query:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE
        LFGLEQY++ALG     + +A     L  S    + R+      + T+  +W     E                KLRRLRQ  SIP+YIK+FT LMLEIE
Subjt:  LFGLEQYFEALG---TSSMMALRLQMLLTS----YVRQPNYGGVESTLSVSWTDATFERG----------NKSSKLRRLRQSSSIPEYIKKFTILMLEIE

Query:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE
         LSDKDA F+FRD LKDWA+IELDR+NV+TLDDAIAA   L D  +K K     EGE
Subjt:  GLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGKTTNKYEGE

A0A5A7TFP3 Retrotrans_gag domain-containing protein9.3e-4139.01Show/hide
Query:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR  T V  ++ S+ + EEG +S +E+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDR+NVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK

A0A5D3D3V4 Retrotrans_gag domain-containing protein9.3e-4139.01Show/hide
Query:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------
        ++N   GGI   L     PR  T V  ++ S+ + EEG +S +E+V +EGPVT+ RK+Q                  D RL                   
Subjt:  NTNHVTGGIRALLLHPRYPR--TLVLSIVRSV-ATEEGRSSSMERVQVEGPVTQ-RKRQD-----------------DARLP------------------

Query:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP
               ++ EIT V KEMI+++G T  KE+  L   V  L+ FVEGELH+L  K     TR      EC + H  +   STST  + +  T  +KVPKP
Subjt:  -----AWKSTEITVVTKEMIKELGWTISKEVSTLFDEVAKLRKFVEGELHELRGK--VHNTR-----KECQANHSANGGTSTSTTSSIAHATCGVKVPKP

Query:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI
          Y G R+ TVV+NFLFGLE+YF ALG     A R+    T ++R       +  Y         SW     E                KLR LR   SI
Subjt:  YTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQMLLTSYVR-------QPNYGGVESTLSVSWTDATFE----------RGNKSSKLRRLRQSSSI

Query:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK
         +Y+K+FT LMLEI  L +K+A F F+DGLKDWA+IELDR+NVQTLDDAIAA E L D+SA+ K
Subjt:  PEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCCAACAAATTGCGCCACAAACGCCTATAACTCGAGGAAAGGCCAAGACGTCCCCAAGCATAGCACACGCAAGTGCCAAGGCACCACACATGGTTGCCAAGGGTGC
CAGGCCACGCCACTGTCTCAAGACACATGGTCGAGTGGCTGCGCAGCCAGCCACAGATGTGCAGCATACATGCGCTCTTGCCAAGATAGGCAGCGCCGTGTGCCCACCCG
CACACGGATGCCAATCATGTGTGCCAAGCACACAACCAAGGCACGCGCTTGTTGATAGGCCTATGCGTGCACCCCGCACATGTACGAGGTGTGCGCCCATTGACTGCTCT
GCAAGCGCACCCCTCGCACAACCTATGCACACGTCTGTGCCACAGTGGGCACAACAACCTATGCGCGTCTGGAAGGATACGCAGCCTGCTAGGCAGTGCCAGCGCCTATG
TCCTGACACTAGAAGGGGCCTGGCTACTCGCCCACGCCCCACACATCCCCGGAAGCCTCTAGACAAACCTCGAAGGCGTCAGACTACTCGAGAAGAGGCTCGATGGTGTT
GGTGCGTCTCAGAAGGAATGTCAGCTAGGCACCCCTATAAATACCCGATGGGGCATCATTTGTACACACATCCGAGAATCAAGCAATCCAAGCGACATTCTTCAAGCTTG
AGCGATTGTCTCCGGTTGGAAGATCCTAGAGCCTCAAAAGGCTTACTTGGTGTGTCTTTTGACAGTAAGATCACCCAAGTGTCGCGCGGTTGGTTAGGTCGAAACACTAA
CCACGTGACAGGTGGTATCAGAGCCTTACTCCTCCATCCTAGATACCCACGCACACTTGTATTGTCGATAGTGAGATCGGTGGCCACTGAGGAAGGACGAAGCTCATCGA
TGGAGCGTGTGCAAGTTGAAGGACCGGTGACCCAAAGGAAACGGCAAGATGATGCCCGTCTTCCAGCTTGGAAAAGCACCGAGATCACGGTAGTTACCAAGGAGATGATC
AAGGAGCTGGGATGGACAATAAGTAAAGAGGTAAGTACCCTCTTCGATGAAGTAGCCAAACTAAGGAAGTTCGTGGAGGGGGAGCTTCACGAGCTTCGTGGGAAAGTCCA
CAACACACGTAAGGAGTGTCAGGCAAACCATAGTGCTAATGGAGGCACATCCACCAGCACAACATCATCTATTGCCCATGCCACCTGTGGGGTAAAGGTGCCAAAGCCCT
ATACTTATGAAGGTACAAGAAGTGTCACGGTTGTGGAGAACTTCTTGTTCGGCCTAGAGCAATACTTTGAGGCCCTAGGCACGTCGTCGATGATGGCGCTAAGATTGCAA
ATGCTCCTAACTTCCTACGTGAGGCAGCCCAACTATGGTGGCGTAGAAAGCACGCTGAGCGTGAGCTGGACAGATGCAACATTCGAACGTGGGAATAAGTCAAGCAAATT
GAGGCGATTAAGGCAAAGCAGTAGCATCCCCGAGTACATAAAAAAATTCACAATCCTCATGCTGGAAATTGAGGGTCTATCCGACAAAGATGCATTCTTTTATTTCCGCG
ATGGTCTTAAAGATTGGGCGAGGATTGAGCTCGATAGGCAGAATGTGCAGACGCTTGATGATGCCATAGCTGCTGTTGAGATGCTTACTGACTTCTCGGCCAAGGGAAAG
ACGACCAACAAATATGAAGGAGAAGTGTCGAAGTTCGAGGAATCTGATGCGCATATAAGATTGAAGGGAGCCATAGGAATGGAAGAAAGAATGGGAAGGCTGCCGACAAG
AACAGAGGGAATGTCCAAAGAGGAAGTCGCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCCCAACAAATTGCGCCACAAACGCCTATAACTCGAGGAAAGGCCAAGACGTCCCCAAGCATAGCACACGCAAGTGCCAAGGCACCACACATGGTTGCCAAGGGTGC
CAGGCCACGCCACTGTCTCAAGACACATGGTCGAGTGGCTGCGCAGCCAGCCACAGATGTGCAGCATACATGCGCTCTTGCCAAGATAGGCAGCGCCGTGTGCCCACCCG
CACACGGATGCCAATCATGTGTGCCAAGCACACAACCAAGGCACGCGCTTGTTGATAGGCCTATGCGTGCACCCCGCACATGTACGAGGTGTGCGCCCATTGACTGCTCT
GCAAGCGCACCCCTCGCACAACCTATGCACACGTCTGTGCCACAGTGGGCACAACAACCTATGCGCGTCTGGAAGGATACGCAGCCTGCTAGGCAGTGCCAGCGCCTATG
TCCTGACACTAGAAGGGGCCTGGCTACTCGCCCACGCCCCACACATCCCCGGAAGCCTCTAGACAAACCTCGAAGGCGTCAGACTACTCGAGAAGAGGCTCGATGGTGTT
GGTGCGTCTCAGAAGGAATGTCAGCTAGGCACCCCTATAAATACCCGATGGGGCATCATTTGTACACACATCCGAGAATCAAGCAATCCAAGCGACATTCTTCAAGCTTG
AGCGATTGTCTCCGGTTGGAAGATCCTAGAGCCTCAAAAGGCTTACTTGGTGTGTCTTTTGACAGTAAGATCACCCAAGTGTCGCGCGGTTGGTTAGGTCGAAACACTAA
CCACGTGACAGGTGGTATCAGAGCCTTACTCCTCCATCCTAGATACCCACGCACACTTGTATTGTCGATAGTGAGATCGGTGGCCACTGAGGAAGGACGAAGCTCATCGA
TGGAGCGTGTGCAAGTTGAAGGACCGGTGACCCAAAGGAAACGGCAAGATGATGCCCGTCTTCCAGCTTGGAAAAGCACCGAGATCACGGTAGTTACCAAGGAGATGATC
AAGGAGCTGGGATGGACAATAAGTAAAGAGGTAAGTACCCTCTTCGATGAAGTAGCCAAACTAAGGAAGTTCGTGGAGGGGGAGCTTCACGAGCTTCGTGGGAAAGTCCA
CAACACACGTAAGGAGTGTCAGGCAAACCATAGTGCTAATGGAGGCACATCCACCAGCACAACATCATCTATTGCCCATGCCACCTGTGGGGTAAAGGTGCCAAAGCCCT
ATACTTATGAAGGTACAAGAAGTGTCACGGTTGTGGAGAACTTCTTGTTCGGCCTAGAGCAATACTTTGAGGCCCTAGGCACGTCGTCGATGATGGCGCTAAGATTGCAA
ATGCTCCTAACTTCCTACGTGAGGCAGCCCAACTATGGTGGCGTAGAAAGCACGCTGAGCGTGAGCTGGACAGATGCAACATTCGAACGTGGGAATAAGTCAAGCAAATT
GAGGCGATTAAGGCAAAGCAGTAGCATCCCCGAGTACATAAAAAAATTCACAATCCTCATGCTGGAAATTGAGGGTCTATCCGACAAAGATGCATTCTTTTATTTCCGCG
ATGGTCTTAAAGATTGGGCGAGGATTGAGCTCGATAGGCAGAATGTGCAGACGCTTGATGATGCCATAGCTGCTGTTGAGATGCTTACTGACTTCTCGGCCAAGGGAAAG
ACGACCAACAAATATGAAGGAGAAGTGTCGAAGTTCGAGGAATCTGATGCGCATATAAGATTGAAGGGAGCCATAGGAATGGAAGAAAGAATGGGAAGGCTGCCGACAAG
AACAGAGGGAATGTCCAAAGAGGAAGTCGCTTAA
Protein sequenceShow/hide protein sequence
MAQQIAPQTPITRGKAKTSPSIAHASAKAPHMVAKGARPRHCLKTHGRVAAQPATDVQHTCALAKIGSAVCPPAHGCQSCVPSTQPRHALVDRPMRAPRTCTRCAPIDCS
ASAPLAQPMHTSVPQWAQQPMRVWKDTQPARQCQRLCPDTRRGLATRPRPTHPRKPLDKPRRRQTTREEARWCWCVSEGMSARHPYKYPMGHHLYTHPRIKQSKRHSSSL
SDCLRLEDPRASKGLLGVSFDSKITQVSRGWLGRNTNHVTGGIRALLLHPRYPRTLVLSIVRSVATEEGRSSSMERVQVEGPVTQRKRQDDARLPAWKSTEITVVTKEMI
KELGWTISKEVSTLFDEVAKLRKFVEGELHELRGKVHNTRKECQANHSANGGTSTSTTSSIAHATCGVKVPKPYTYEGTRSVTVVENFLFGLEQYFEALGTSSMMALRLQ
MLLTSYVRQPNYGGVESTLSVSWTDATFERGNKSSKLRRLRQSSSIPEYIKKFTILMLEIEGLSDKDAFFYFRDGLKDWARIELDRQNVQTLDDAIAAVEMLTDFSAKGK
TTNKYEGEVSKFEESDAHIRLKGAIGMEERMGRLPTRTEGMSKEEVA