; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011400 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011400
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr10:9584349..9585319
RNA-Seq ExpressionCmoCh10G011400
SyntenyCmoCh10G011400
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.7e-10672.57Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA++K K +RGSSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9869.1Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ
        +  EAN+A +K K  RGSSSK+K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK  Q
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ

KAA0051437.1 gag/pol protein [Cucumis melo var. makuwa]1.8e-9766.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + NSIVQLLAS+KLNGDNY  WKSNLNTILV+DDL+FVLTEECP  P SNANRT+R AYDRW+KANDKAR YILA++SDVLAKKH+ +   KEIM+SLK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLR+EAIKYIY   MKEGTSVREHVLDMM+HFN+AE N+  ID  +QVSFI+ SLPKSF  F TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISKK-LLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K+  LRGSSSK K+G S+  +  +KKKGK K    T ++NK +KA +K KC+HC +NGHW +NCPKYL +K+AEK  QG
Subjt:  QTGEANVAISKK-LLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

TYK11933.1 gag/pol protein [Cucumis melo var. makuwa]7.5e-9968.06Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K K  RG SSK+K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

TYK31700.1 gag/pol protein [Cucumis melo var. makuwa]2.6e-9969.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DVLAKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVAI+K K  RGSSSK K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein6.2e-9969.1Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ
        +  EAN+A +K K  RGSSSK+K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK  Q
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ

A0A5A7UAN2 Gag/pol protein8.9e-9866.78Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + NSIVQLLAS+KLNGDNY  WKSNLNTILV+DDL+FVLTEECP  P SNANRT+R AYDRW+KANDKAR YILA++SDVLAKKH+ +   KEIM+SLK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLR+EAIKYIY   MKEGTSVREHVLDMM+HFN+AE N+  ID  +QVSFI+ SLPKSF  F TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISKK-LLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K+  LRGSSSK K+G S+  +  +KKKGK K    T ++NK +KA +K KC+HC +NGHW +NCPKYL +K+AEK  QG
Subjt:  QTGEANVAISKK-LLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

A0A5D3CJ27 Gag/pol protein3.6e-9968.06Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K K  RG SSK+K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

A0A5D3E7W3 Gag/pol protein1.2e-9969.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DVLAKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVAI+K K  RGSSSK K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

E2GK51 Gag/pol protein (Fragment)8.0e-10772.57Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA++K K +RGSSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.4e-0721.97Show/hide
Query:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRH
        K NGDN ++TW+  +  +L+   L  VL  +        A        + W   +++A   I   +SD +        TA+ I   L+ ++   + + + 
Subjt:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRH

Query:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI
           K +Y   M EGT+   H+                I+E+ +   ++ SLP S+    T ++  K    L  + + L   + +       G+A +    
Subjt:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI

Query:  SKKLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP
         +   R S++  +SG           +GK KN+  +  +N         C++CN+ GH+KR+CP
Subjt:  SKKLLRGSSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTAGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CACTTCTTAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTAGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CACTTCTTAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCAAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGAAACTAG
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSK
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGN