; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh18G006060 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh18G006060
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr18:7347775..7348742
RNA-Seq ExpressionCmoCh18G006060
SyntenyCmoCh18G006060
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]1.4e-10572.57Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK TQG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]3.7e-9869.12Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EAN+A +K K  RGSSS++K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEK

TYK01653.1 gag/pol protein [Cucumis melo var. makuwa]2.4e-9768.77Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDVLAKKH+ + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RGSSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC ENGHW +NCPKYLA+KK  +
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK

TYK11933.1 gag/pol protein [Cucumis melo var. makuwa]1.4e-9767.96Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RG SS++K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK

TYK31700.1 gag/pol protein [Cucumis melo var. makuwa]1.7e-9869.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DVLAKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG
        +  EANVAI+K K  RGSSS+ K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK TQG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein1.8e-9869.12Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EAN+A +K K  RGSSS++K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEK

A0A5D3BPM3 Gag/pol protein1.2e-9768.77Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDVLAKKH+ + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RGSSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC ENGHW +NCPKYLA+KK  +
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK

A0A5D3CJ27 Gag/pol protein6.8e-9867.96Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RG SS++K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK

A0A5D3E7W3 Gag/pol protein8.0e-9969.2Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DVLAKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG
        +  EANVAI+K K  RGSSS+ K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK TQG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG

E2GK51 Gag/pol protein (Fragment)6.8e-10672.57Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK TQG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEK-TQG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.1e-0722.06Show/hide
Query:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRH
        K NGDN ++TW+  +  +L+   L  VL  +        A        + W   +++A   I   +SD +        TA+ I   L+ ++   + + + 
Subjt:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLRH

Query:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI
           K +Y   M EGT+   H+                I+E+ +   ++ SLP S+    T ++  K    L  + + L   + +       G+A +    
Subjt:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI

Query:  SKKLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP---KYLAEKKAEKTQGN
         +   R S++  +SG           +GK KN+  +  +N         C++CN+ GH+KR+CP   K   E   +K   N
Subjt:  SKKLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP---KYLAEKKAEKTQGN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCTCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAAGGAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCTCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTGGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAAGGAAACTAG
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVLAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQ
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQGN