; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh10G011410 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh10G011410
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr10:9592135..9593561
RNA-Seq ExpressionCmoCh10G011410
SyntenyCmoCh10G011410
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]2.8e-10271.08Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKH+ + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  ++G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EANVA++K K ++ SSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++ HWKRNCPKYLAEKKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9568.06Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH  + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   + 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EAN+A +K K  + SSSK+K GPS     + +KKGKGK    T ++NK +K   KGKC+HC EN H   NCPKYL +KKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

TYK01653.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9467.13Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDVLAKKH  + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   +G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQR
        K  EANVA +K K  + SSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC EN HW +NCPKYLA+KK  +   R
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQR

TYK11933.1 gag/pol protein [Cucumis melo var. makuwa]5.6e-9566.9Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH  +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   +G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EANVA +K K  +  SSK+K GPS     + +KKGKG N  P   K K +  +KGKC+HC EN HW RNCPKYLA+KKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

TYK31700.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-9565.59Show/hide
Query:  IRRVVTWESEQRKLQKWIGFLSMTN-SIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASI
        +RRV  W     K++   G  S  N SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA +
Subjt:  IRRVVTWESEQRKLQKWIGFLSMTN-SIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASI

Query:  SDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNK
        +DVLAKKH  + TAKEIM++LK MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NK
Subjt:  SDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNK

Query:  IEYNLTALLNELQTYQSLLTNEGKTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENEHWKRNCPKY
        IE+NLT LLNELQ +Q+L   +GK  EANVAI+K K  + SSSK K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +N HW RNCPKY
Subjt:  IEYNLTALLNELQTYQSLLTNEGKTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENEHWKRNCPKY

Query:  LAEKKAEKTQQ
        LAEKKAEK  Q
Subjt:  LAEKKAEKTQQ

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein7.1e-9668.06Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDVLAKKH  + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   + 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EAN+A +K K  + SSSK+K GPS     + +KKGKGK    T ++NK +K   KGKC+HC EN H   NCPKYL +KKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

A0A5D3BPM3 Gag/pol protein1.3e-9467.13Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDVLAKKH  + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   +G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQR
        K  EANVA +K K  + SSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC EN HW +NCPKYLA+KK  +   R
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQR

A0A5D3CJ27 Gag/pol protein2.7e-9566.9Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DVLAKKH  +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   +G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EANVA +K K  +  SSK+K GPS     + +KKGKG N  P   K K +  +KGKC+HC EN HW RNCPKYLA+KKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

A0A5D3E7W3 Gag/pol protein7.1e-9665.59Show/hide
Query:  IRRVVTWESEQRKLQKWIGFLSMTN-SIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASI
        +RRV  W     K++   G  S  N SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA +
Subjt:  IRRVVTWESEQRKLQKWIGFLSMTN-SIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASI

Query:  SDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNK
        +DVLAKKH  + TAKEIM++LK MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NK
Subjt:  SDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNK

Query:  IEYNLTALLNELQTYQSLLTNEGKTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENEHWKRNCPKY
        IE+NLT LLNELQ +Q+L   +GK  EANVAI+K K  + SSSK K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +N HW RNCPKY
Subjt:  IEYNLTALLNELQTYQSLLTNEGKTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENEHWKRNCPKY

Query:  LAEKKAEKTQQ
        LAEKKAEK  Q
Subjt:  LAEKKAEKTQQ

E2GK51 Gag/pol protein (Fragment)1.3e-10271.08Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DVLAKKH+ + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  ++G
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEG

Query:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ
        K  EANVA++K K ++ SSSKNK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++ HWKRNCPKYLAEKKAEK  Q
Subjt:  KTGEANVAISK-KLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQ

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.9e-0622.22Show/hide
Query:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRH
        K NGDN ++TW+  +  +L+   L  VL  +        A        + W   +++A   I   +SD +        TA+ I   L+ ++   + + + 
Subjt:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARVYILASISDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRH

Query:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEGKTGEANVAISK
           K +Y   M EGT+   H+                I+E+ +   ++ SLP S+    T ++  K    L  + + L   + +       G+A +   +
Subjt:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNEGKTGEANVAISK

Query:  KLLQESSSKN--KSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQRN
            + SS N  +SG           +GK KN+  +  +N         C++CN+  H+KR+CP    + K E + Q+N
Subjt:  KLLQESSSKN--KSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKAEKTQQRN

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCGCCTGCATGTTGCCCTAGAGCAACTACCCATTCGGAGGGTCGTAACATGGGAGTCGGAACAACGCAAACTCCAGAAATGGATAGGATTTCTTAGCATG
ACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAG
TTTGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAACTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTG
TACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACAATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCT
TTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTG
GCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCTAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAAC
AAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAATGAGGGAAAAACAGGAGAAGCAAATGTTGCTATCTCCAAG
AAATTACTACAAGAATCGTCCTCCAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAAC
CGCAAGAACAAGGTTCAAAAGGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGAGCACTGGAAGAGAAACTGCCCAAAATACCTTGCAGAGAAGAAAGCC
GAAAAGACACAACAAAGAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTCCGCCTGCATGTTGCCCTAGAGCAACTACCCATTCGGAGGGTCGTAACATGGGAGTCGGAACAACGCAAACTCCAGAAATGGATAGGATTTCTTAGCATG
ACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGCGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAG
TTTGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAACTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTG
TACATTCTAGCCAGCATATCTGATGTTTTGGCTAAGAAACACAATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCT
TTCTCCCTTAGACATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTG
GCAGAAGAAAATGAAGCTGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCTAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAAC
AAAATAGAATATAACTTGACTGCTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACGAATGAGGGAAAAACAGGAGAAGCAAATGTTGCTATCTCCAAG
AAATTACTACAAGAATCGTCCTCCAAAAATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAAC
CGCAAGAACAAGGTTCAAAAGGCAGATAAAGGAAAATGTTTCCATTGCAACGAAAACGAGCACTGGAAGAGAAACTGCCCAAAATACCTTGCAGAGAAGAAAGCC
GAAAAGACACAACAAAGAAACTAG
Protein sequenceShow/hide protein sequence
MFRLHVALEQLPIRRVVTWESEQRKLQKWIGFLSMTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTTRDAYDRWIKANDKARV
YILASISDVLAKKHNVMGTAKEIMESLKGMFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMN
KIEYNLTALLNELQTYQSLLTNEGKTGEANVAISKKLLQESSSKNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENEHWKRNCPKYLAEKKA
EKTQQRN