; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh20G009680 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh20G009680
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGag/pol protein
Genome locationCmo_Chr20:5352989..5354031
RNA-Seq ExpressionCmoCh20G009680
SyntenyCmoCh20G009680
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR036875 - Zinc finger, CCHC-type superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADJ18449.1 gag/pol protein, partial [Bryonia dioica]8.2e-10671.88Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DV AKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

KAA0048103.1 gag/pol protein [Cucumis melo var. makuwa]4.8e-9868.4Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDV AKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ
        +  EAN+A +K K  RGSSS++K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK  Q
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ

TYK01653.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-9768.42Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDV AKKH+ + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RGSSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC ENGHW +NCPKYLA+KK  +
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK

TYK11933.1 gag/pol protein [Cucumis melo var. makuwa]2.8e-9867.36Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DV AKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K K  RG SS++K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

TYK31700.1 gag/pol protein [Cucumis melo var. makuwa]1.3e-9868.51Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DV AKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVAI+K K  RGSSS+ K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

TrEMBL top hitse value%identityAlignment
A0A5A7TWX1 Gag/pol protein2.3e-9868.4Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M + IVQLLASEKLN DNYTTWKSNLNTILV+DDL+FVLTEECP  P SNANRT+R+AYDRWIKAN+KARVYILAS+SDV AKKH+ + TAKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRH+AIKYIY  RMKEGTS+REHVL MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TNV +NKIE+NLT LLNELQ +Q+L   K 
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ
        +  EAN+A +K K  RGSSS++K GPS     + +KKGKGK    T ++NK +K   KGKC+HC ENGH   NCPKYL +KKAEK  Q
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKAD-KGKCFHCNENGHWKRNCPKYLAEKKAEKTQQ

A0A5D3BPM3 Gag/pol protein2.6e-9768.42Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M NSIVQLLASEKLN +NY  WKSNLNTILV+DDL+FVLTEECP  P SNANR ++ AYDRWIKAN K  VYILAS+SDV AKKH+ + T KEI++SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGTSVREHVLDMM+HFN+AE N  VIDE  QVSFI+ SL KSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK
        +  EANVA +K K  RGSSS++K GPS   S  MKKKG GK    T ++NKV+K  +KGKC+HC ENGHW +NCPKYLA+KK  +
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQK-ADKGKCFHCNENGHWKRNCPKYLAEKKAEK

A0A5D3CJ27 Gag/pol protein1.4e-9867.36Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M +SIVQLLASEKLNGDNY  WKSNLNTILV++DL+FVLTEECPP P SNANRT+R+AYDRWIKAN+KA VYIL S+ DV AKKH+ +  AKEIM+SLKG
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQ  +SL+HEAIKYIY  RMKEGT VREHVL+MM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  ++KIE+NLT LLNELQ++Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA +K K  RG SS++K GPS     + +KKGKG N  P   K K +  +KGKC+HC ENGHW RNCPKYLA+KKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

A0A5D3E7W3 Gag/pol protein6.2e-9968.51Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        + +SIVQLLASEKLNGDNY  WKSNLNTILV+DDL+FVLTEE P  P SNANRT+R AYDRW+KAN+KARVYILA ++DV AKKH+ + TAKEIM++LK 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQP +SLRHEAIKYIY  RMKEGT VREHVLDMM+ FN+AE N+  IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L   KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVAI+K K  RGSSS+ K+GP   K  + KKKGKGK    T ++NK +KA +KGKC+H  +NGHW RNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKA-DKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

E2GK51 Gag/pol protein (Fragment)4.0e-10671.88Show/hide
Query:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG
        M  SIVQLLASEKLNGDNY+ WKSNLNTILV+DDL+FVLTEECP  P  NANRT R+AYDRW+KANDKARVYILAS++DV AKKHD + TAK IM+SL+ 
Subjt:  MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKG

Query:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG
        MFGQPS+SLRHEAIK+IY  RMKEGTSVREHVLDMM+HFN+AE N   IDE +QVSFI+ SLPKSF  F+TN  +NKIE+NLT LLNELQ +Q+L  +KG
Subjt:  MFGQPSFSLRHEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKG

Query:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG
        +  EANVA++K K +RGSSS+NK GPS ++   MKKKGKG  K P   K K + ADKGKCFHCN++GHWKRNCPKYLAEKKAEK  QG
Subjt:  QTGEANVAISK-KLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQG

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-941.8e-0721.97Show/hide
Query:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKGMFGQPSFSLRH
        K NGDN ++TW+  +  +L+   L  VL  +        A        + W   +++A   I   +SD          TA+ I   L+ ++   + + + 
Subjt:  KLNGDN-YTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKGMFGQPSFSLRH

Query:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI
           K +Y   M EGT+   H+                I+E+ +   ++ SLP S+    T ++  K    L  + + L   + +       G+A +    
Subjt:  EAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANV--AI

Query:  SKKLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP
         +   R S++  +SG           +GK KN+  +  +N         C++CN+ GH+KR+CP
Subjt:  SKKLLRGSSSQNKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCP

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGTGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTCGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGAAACTAG
mRNA sequenceShow/hide mRNA sequence
ATGACAAACTCAATAGTACAATTACTCGCTTCTGAGAAATTAAACGGTGACAACTACACAACTTGGAAATCAAACCTAAACACAATACTGGTCATTGATGATTTAAAGTT
TGTTTTAACTGAGGAGTGTCCTCCAAACCCAAACTCAAATGCAAACCGAACAGCTCGGGATGCATATGACAGATGGATAAAGGCAAATGACAAAGCCCGAGTGTACATTC
TAGCCAGCATATCTGATGTTTTCGCTAAGAAACACGATGTTATGGGTACTGCTAAAGAGATTATGGAATCTCTAAAAGGGATGTTTGGACAACCGTCCTTCTCCCTTAGA
CATGAAGCCATAAAATACATTTACAACTGCCGTATGAAAGAAGGGACCTCAGTTAGAGAACATGTCCTGGACATGATGGTCCATTTCAATGTGGCAGAAGAAAATGAAGC
TGTCATTGATGAGAAGAGTCAAGTCAGTTTTATCATGATGTCTCTTCCGAAGAGCTTCTTCCAGTTCCGCACCAATGTGGTTATGAACAAAATAGAATATAACTTGACTG
CTCTTCTCAATGAGCTACAGACTTATCAGTCCCTCTTAACAAACAAGGGACAAACAGGAGAAGCAAATGTTGCCATCTCCAAGAAATTACTACGAGGATCGTCCTCCCAA
AATAAGTCTGGACCTTCAACTTCTAAAAGTGTTTTGATGAAGAAGAAGGGAAAAGGGAAAAATAAGATTCCTACTAACCGCAAGAACAAGGTTCAAAAAGCAGATAAAGG
AAAATGTTTCCATTGCAACGAAAACGGGCACTGGAAGAGAAATTGCCCAAAATACCTTGCAGAGAAGAAAGCCGAAAAGACACAACAAGGAAACTAGCTCCTGGAGAATG
CTTGCGGACGGCGAGATAACACTCAGGGTTGGAACAGGAGAGGTTGTCTCAGCAAGATC
Protein sequenceShow/hide protein sequence
MTNSIVQLLASEKLNGDNYTTWKSNLNTILVIDDLKFVLTEECPPNPNSNANRTARDAYDRWIKANDKARVYILASISDVFAKKHDVMGTAKEIMESLKGMFGQPSFSLR
HEAIKYIYNCRMKEGTSVREHVLDMMVHFNVAEENEAVIDEKSQVSFIMMSLPKSFFQFRTNVVMNKIEYNLTALLNELQTYQSLLTNKGQTGEANVAISKKLLRGSSSQ
NKSGPSTSKSVLMKKKGKGKNKIPTNRKNKVQKADKGKCFHCNENGHWKRNCPKYLAEKKAEKTQQGN