; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh17G004880 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh17G004880
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCma_Chr17:3029820..3031821
RNA-Seq ExpressionCmaCh17G004880
SyntenyCmaCh17G004880
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_016740232.1 uncharacterized protein LOC107949990 [Gossypium hirsutum]4.2e-8852.84Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        M+VLNLIRD EL+KMKESESVKEYSDR L+IANKVRL GS L+DSRIVEKLLVTVPEKFEATITTLENTKDLS+ISL ELLNALQAQEQRRSMRQ+ V+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGD-PFTNYQKTKGGGFKKSYPPCRHCEKKVVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSGKESSESWLID
        GAL  KHQD+NRYK  K  KN   NG+  F+NYQK KG   KK+YPP +HC KK             +AQV DQ  EEED+LF+VT  S +ESSESWLID
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGD-PFTNYQKTKGGGFKKSYPPCRHCEKKVVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSGKESSESWLID

Query:  SGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEE--------LVP------LRFGT----------------------KDL-----
        SGCTNHMTYDKE FEELR TE KRV+I NGEHL VKGK T+AI SYE          VP      L  G                       KDL     
Subjt:  SGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEE--------LVP------LRFGT----------------------KDL-----

Query:  ----------------------------------------------DTFIIEV-------------KRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRV
                                                      + FI+EV             KRDKLDKK+  GIF+G ST+S AYRVFQ    R+
Subjt:  ----------------------------------------------DTFIIEV-------------KRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRV

Query:  IVSQDVHFGSMLEDSEDERQDA
        IVS+DVHF   +ED +   +D+
Subjt:  IVSQDVHFGSMLEDSEDERQDA

XP_022930156.1 uncharacterized protein LOC111436667 [Cucurbita moschata]1.2e-8773.73Show/hide
Query:  KMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRY
        KMKES+SVK+YSDRLLNIANKVRL GS+LNDSRIVEKLL ++PEKFEATITTLENTKDLSKISL ELLNALQAQ+QRR MRQEGV EG + VK+QD++RY
Subjt:  KMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRY

Query:  KNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSG
        KN K FKNQSTNGD  TNYQKTKGGG KK YPPCRHCEKK                         VICK K QVKEVDA V+DQ  EEEDQLF+VT    
Subjt:  KNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSG

Query:  KESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITS
        KESSE WLIDSGCTNHMTYD E FEELRDTE KRVRIDNGEHL V GK TVAI S
Subjt:  KESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITS

XP_022959005.1 uncharacterized protein LOC111460124 [Cucurbita moschata]7.6e-11447.36Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        MK LNLIRDFEL+KMK+SESVKEYS+RLLNIANKVRL GS+LNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSMRQEGV+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE
        GAL VKHQD++RYKNNK FKNQ T GD   NYQKTKGGGFKKSYP CRHCEKK                         VICK KD VKEVDAQVVDQ EE
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE

Query:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEEL----------------------------
        EEDQL MVTSSS KESSESWLIDSGCTNHMTYDKESFEELRDTE KRVRI NGEHLEVKGK TVAITSYEE                             
Subjt:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEEL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------VPLRF---------------GTKDLD------------------
                                                                +P RF                TKD+                   
Subjt:  --------------------------------------------------------VPLRF---------------GTKDLD------------------

Query:  --------TFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRVIVSQDVH--------------------------FGSMLE--------DSEDE
                T+I +VKRDKLDKKSEVGIFVG STISKAYRVFQPH SRVIVS+DVH                          FG MLE        DSEDE
Subjt:  --------TFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRVIVSQDVH--------------------------FGSMLE--------DSEDE

Query:  RQDALVDDAPVKGGAFGDRGKTKSG
        RQDALVDDAPV+GGAFGDR K   G
Subjt:  RQDALVDDAPVKGGAFGDRGKTKSG

XP_022963821.1 uncharacterized protein LOC111464007 [Cucurbita moschata]1.9e-9680.23Show/hide
Query:  MKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRYK
        MKESESVKEYSDRL +IANKVRL GS+LNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISL ELLNALQAQEQRRSMRQEGV+EGAL VKHQDS+RYK
Subjt:  MKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRYK

Query:  NNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQ-EEEEEDQLFMVTSSSG
        NNK FKNQ T GD   NYQKTK GGFKKSYPPCRH E K                         VICKAKD VKEVDAQVVDQ EEEEEDQL MVTSSS 
Subjt:  NNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQ-EEEEEDQLFMVTSSSG

Query:  KESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIDNGEHLEVKGKSTVAITSYE
        KES ESWLIDSGCTNHMTYDKESFEELRDT EDKRVRI NGEH+EVKGK TVAITSYE
Subjt:  KESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIDNGEHLEVKGKSTVAITSYE

XP_022974382.1 uncharacterized protein LOC111473053 [Cucurbita maxima]7.9e-10359.85Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        MK LNLIRDFEL+KM ES+SVKEYS++LL+IANKVRL GSVLNDS IVEKLLV VPE FE TITTLENTKDLSKISL ELLNALQAQEQRRSMRQEGV+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE
        GAL VKHQD+ RYKN K FKNQSTNGDP TN+QKTKGG FKKSYPPCRHCEKK                         VICK++DQVKEVDAQVVDQ EE
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE

Query:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELV--------------PLRFGT-------
        EEDQLFMV SSS KESSESWLI+SGCTNHMTY+KE FE+LRD EDKRVRI NGE LEVKGK TVAITSYE+                PL  G        
Subjt:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELV--------------PLRFGT-------

Query:  ---------------KDLDTFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQ-----PHISRVIVSQD--------VHFGSMLEDSEDER--------Q
                       KDL    +E K   L+  + +  F       K  R F      P  S+ +V +D          F  MLE+SEDER        Q
Subjt:  ---------------KDLDTFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQ-----PHISRVIVSQD--------VHFGSMLEDSEDER--------Q

Query:  DALVDDAPVKG
        D LVDDAPV G
Subjt:  DALVDDAPVKG

TrEMBL top hitse value%identityAlignment
A0A1U8KFZ5 uncharacterized protein LOC1079152232.6e-9168.14Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        MKVLNLIRDFEL+KMKESESVKEYSDRLL+IANKVRL GS LNDSRIVEK+LVT+PEK EATITTLENTKDLSKISL ELLNALQAQ QRRSMRQEGV+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFT-NYQKTKGGGFKKSYPPCRHCEK------------------------KVVICKAKDQVKEVDAQVVDQEE
        GALLVKHQD+NRYK  K F+NQST+ +  + NYQK+K GG KKSYPP  HCEK                        K V+CK K QV+EVDAQV DQ  
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFT-NYQKTKGGGFKKSYPPCRHCEK------------------------KVVICKAKDQVKEVDAQVVDQEE

Query:  EEEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELVPLRFGTKDLDTFIIEVKRDKL
        EEED+LF+VT  SG+ESSE WLIDSGCTNHMTYDKE FEELR+TE K VRI+N E+LEVKGK  VAITSY+       GTK +   +   K  K+
Subjt:  EEEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELVPLRFGTKDLDTFIIEVKRDKL

A0A1U8NMJ8 uncharacterized protein LOC1079499902.0e-8852.84Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        M+VLNLIRD EL+KMKESESVKEYSDR L+IANKVRL GS L+DSRIVEKLLVTVPEKFEATITTLENTKDLS+ISL ELLNALQAQEQRRSMRQ+ V+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGD-PFTNYQKTKGGGFKKSYPPCRHCEKKVVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSGKESSESWLID
        GAL  KHQD+NRYK  K  KN   NG+  F+NYQK KG   KK+YPP +HC KK             +AQV DQ  EEED+LF+VT  S +ESSESWLID
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGD-PFTNYQKTKGGGFKKSYPPCRHCEKKVVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSGKESSESWLID

Query:  SGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEE--------LVP------LRFGT----------------------KDL-----
        SGCTNHMTYDKE FEELR TE KRV+I NGEHL VKGK T+AI SYE          VP      L  G                       KDL     
Subjt:  SGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEE--------LVP------LRFGT----------------------KDL-----

Query:  ----------------------------------------------DTFIIEV-------------KRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRV
                                                      + FI+EV             KRDKLDKK+  GIF+G ST+S AYRVFQ    R+
Subjt:  ----------------------------------------------DTFIIEV-------------KRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRV

Query:  IVSQDVHFGSMLEDSEDERQDA
        IVS+DVHF   +ED +   +D+
Subjt:  IVSQDVHFGSMLEDSEDERQDA

A0A6J1H529 uncharacterized protein LOC1114601243.7e-11447.36Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        MK LNLIRDFEL+KMK+SESVKEYS+RLLNIANKVRL GS+LNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISL ELLNALQAQEQ+RSMRQEGV+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE
        GAL VKHQD++RYKNNK FKNQ T GD   NYQKTKGGGFKKSYP CRHCEKK                         VICK KD VKEVDAQVVDQ EE
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE

Query:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEEL----------------------------
        EEDQL MVTSSS KESSESWLIDSGCTNHMTYDKESFEELRDTE KRVRI NGEHLEVKGK TVAITSYEE                             
Subjt:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEEL----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------------------VPLRF---------------GTKDLD------------------
                                                                +P RF                TKD+                   
Subjt:  --------------------------------------------------------VPLRF---------------GTKDLD------------------

Query:  --------TFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRVIVSQDVH--------------------------FGSMLE--------DSEDE
                T+I +VKRDKLDKKSEVGIFVG STISKAYRVFQPH SRVIVS+DVH                          FG MLE        DSEDE
Subjt:  --------TFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRVIVSQDVH--------------------------FGSMLE--------DSEDE

Query:  RQDALVDDAPVKGGAFGDRGKTKSG
        RQDALVDDAPV+GGAFGDR K   G
Subjt:  RQDALVDDAPVKGGAFGDRGKTKSG

A0A6J1HH50 uncharacterized protein LOC1114640079.1e-9780.23Show/hide
Query:  MKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRYK
        MKESESVKEYSDRL +IANKVRL GS+LNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISL ELLNALQAQEQRRSMRQEGV+EGAL VKHQDS+RYK
Subjt:  MKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLVKHQDSNRYK

Query:  NNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQ-EEEEEDQLFMVTSSSG
        NNK FKNQ T GD   NYQKTK GGFKKSYPPCRH E K                         VICKAKD VKEVDAQVVDQ EEEEEDQL MVTSSS 
Subjt:  NNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQ-EEEEEDQLFMVTSSSG

Query:  KESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIDNGEHLEVKGKSTVAITSYE
        KES ESWLIDSGCTNHMTYDKESFEELRDT EDKRVRI NGEH+EVKGK TVAITSYE
Subjt:  KESSESWLIDSGCTNHMTYDKESFEELRDT-EDKRVRIDNGEHLEVKGKSTVAITSYE

A0A6J1IA47 uncharacterized protein LOC1114730533.8e-10359.85Show/hide
Query:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE
        MK LNLIRDFEL+KM ES+SVKEYS++LL+IANKVRL GSVLNDS IVEKLLV VPE FE TITTLENTKDLSKISL ELLNALQAQEQRRSMRQEGV+E
Subjt:  MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLE

Query:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE
        GAL VKHQD+ RYKN K FKNQSTNGDP TN+QKTKGG FKKSYPPCRHCEKK                         VICK++DQVKEVDAQVVDQ EE
Subjt:  GALLVKHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKK------------------------VVICKAKDQVKEVDAQVVDQEEE

Query:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELV--------------PLRFGT-------
        EEDQLFMV SSS KESSESWLI+SGCTNHMTY+KE FE+LRD EDKRVRI NGE LEVKGK TVAITSYE+                PL  G        
Subjt:  EEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDKESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELV--------------PLRFGT-------

Query:  ---------------KDLDTFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQ-----PHISRVIVSQD--------VHFGSMLEDSEDER--------Q
                       KDL    +E K   L+  + +  F       K  R F      P  S+ +V +D          F  MLE+SEDER        Q
Subjt:  ---------------KDLDTFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQ-----PHISRVIVSQD--------VHFGSMLEDSEDER--------Q

Query:  DALVDDAPVKG
        D LVDDAPV G
Subjt:  DALVDDAPVKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAGTCCTAAATTTGATTAGGGATTTCGAGTTGGAGAAGATGAAGGAGTCTGAATCGGTGAAAGAGTACTCTGACAGACTTCTCAACATTGCCAACAAGGTA
AGATTGTTTGGTTCTGTATTAAATGATTCCAGGATCGTTGAAAAGTTGCTAGTTACTGTTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAA
GACTTGTCAAAGATTTCTCTTATAGAGCTCTTGAATGCTTTACAAGCACAAGAGCAAAGGAGGTCTATGAGGCAAGAAGGGGTGCTTGAAGGTGCCTTACTTGTT
AAGCATCAAGACAGCAACAGGTATAAAAACAACAAAAAATTCAAAAACCAATCGACGAATGGAGATCCATTTACCAATTACCAGAAGACAAAAGGAGGAGGTTTC
AAAAAATCCTATCCACCTTGCCGCCATTGTGAGAAGAAAGTTGTGATCTGCAAAGCCAAAGATCAGGTGAAAGAAGTAGATGCACAGGTAGTTGATCAAGAAGAA
GAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAGGAAAAGAATCAAGCGAGAGCTGGTTGATTGACAGTGGGTGCACAAATCACATGACATATGACAAG
GAGTCTTTTGAGGAATTAAGAGACACCGAAGATAAGAGAGTGAGGATTGACAATGGTGAACACTTGGAAGTCAAGGGAAAAAGCACAGTAGCTATAACAAGTTAT
GAAGAGTTAGTGCCATTGAGATTTGGCACAAAAGACTTGGACACTTTCATCATCGAGGTCAAGCGTGATAAGCTTGACAAAAAGTCAGAAGTTGGCATCTTTGTT
GGGAATAGCACTATATCCAAAGCTTACAGAGTTTTTCAACCACACATTAGTCGTGTTATTGTGAGCCAAGATGTTCATTTTGGTAGCATGTTAGAAGATTCTGAA
GATGAACGACAAGATGCTTTGGTTGATGATGCACCTGTCAAAGGAGGAGCTTTTGGTGATCGAGGAAAAACAAAATCTGGGAGTTGGTTGGTTGACCTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAAGTCCTAAATTTGATTAGGGATTTCGAGTTGGAGAAGATGAAGGAGTCTGAATCGGTGAAAGAGTACTCTGACAGACTTCTCAACATTGCCAACAAGGTA
AGATTGTTTGGTTCTGTATTAAATGATTCCAGGATCGTTGAAAAGTTGCTAGTTACTGTTCCAGAGAAGTTTGAAGCCACCATTACTACTCTGGAGAACACCAAA
GACTTGTCAAAGATTTCTCTTATAGAGCTCTTGAATGCTTTACAAGCACAAGAGCAAAGGAGGTCTATGAGGCAAGAAGGGGTGCTTGAAGGTGCCTTACTTGTT
AAGCATCAAGACAGCAACAGGTATAAAAACAACAAAAAATTCAAAAACCAATCGACGAATGGAGATCCATTTACCAATTACCAGAAGACAAAAGGAGGAGGTTTC
AAAAAATCCTATCCACCTTGCCGCCATTGTGAGAAGAAAGTTGTGATCTGCAAAGCCAAAGATCAGGTGAAAGAAGTAGATGCACAGGTAGTTGATCAAGAAGAA
GAAGAAGAAGATCAATTGTTTATGGTCACTTCTTCCTCAGGAAAAGAATCAAGCGAGAGCTGGTTGATTGACAGTGGGTGCACAAATCACATGACATATGACAAG
GAGTCTTTTGAGGAATTAAGAGACACCGAAGATAAGAGAGTGAGGATTGACAATGGTGAACACTTGGAAGTCAAGGGAAAAAGCACAGTAGCTATAACAAGTTAT
GAAGAGTTAGTGCCATTGAGATTTGGCACAAAAGACTTGGACACTTTCATCATCGAGGTCAAGCGTGATAAGCTTGACAAAAAGTCAGAAGTTGGCATCTTTGTT
GGGAATAGCACTATATCCAAAGCTTACAGAGTTTTTCAACCACACATTAGTCGTGTTATTGTGAGCCAAGATGTTCATTTTGGTAGCATGTTAGAAGATTCTGAA
GATGAACGACAAGATGCTTTGGTTGATGATGCACCTGTCAAAGGAGGAGCTTTTGGTGATCGAGGAAAAACAAAATCTGGGAGTTGGTTGGTTGACCTCTAA
Protein sequenceShow/hide protein sequence
MKVLNLIRDFELEKMKESESVKEYSDRLLNIANKVRLFGSVLNDSRIVEKLLVTVPEKFEATITTLENTKDLSKISLIELLNALQAQEQRRSMRQEGVLEGALLV
KHQDSNRYKNNKKFKNQSTNGDPFTNYQKTKGGGFKKSYPPCRHCEKKVVICKAKDQVKEVDAQVVDQEEEEEDQLFMVTSSSGKESSESWLIDSGCTNHMTYDK
ESFEELRDTEDKRVRIDNGEHLEVKGKSTVAITSYEELVPLRFGTKDLDTFIIEVKRDKLDKKSEVGIFVGNSTISKAYRVFQPHISRVIVSQDVHFGSMLEDSE
DERQDALVDDAPVKGGAFGDRGKTKSGSWLVDL