; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh09G012390 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh09G012390
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionReverse transcriptase
Genome locationCmo_Chr09:10638698..10640050
RNA-Seq ExpressionCmoCh09G012390
SyntenyCmoCh09G012390
Gene Ontology termsGO:0006278 - RNA-dependent DNA biosynthetic process (biological process)
GO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0090305 - nucleic acid phosphodiester bond hydrolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0003964 - RNA-directed DNA polymerase activity (molecular function)
GO:0004190 - aspartic-type endopeptidase activity (molecular function)
GO:0004519 - endonuclease activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR041577 - Reverse transcriptase/retrotransposon-derived protein, RNase H-like domain
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022150099.1 uncharacterized protein LOC111018360 [Momordica charantia]7.9e-19675.95Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKC+++   + TV+  SIKQPG +RMISA+QLK+GL REEPTFM IP++E+      +P EI+ V+  Y DIM +SLP+TLPPRRGIDHEIEL+P  
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAP+GAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL II+DLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVM FGLTNAPATF  +MN VF+EYLDQFV+V LDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I F
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HVI  GQI MD+DK+KAIQEW+VPTSV ++RSFLGLANYYRRF+EGFSRRATP+TELLKK   W WS + Q AFEDLK  M +GP   L DVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR
        ET+ASD+ALG VL+Q+  PI YESRKLN+AERRYTVSEKEMLA+VHCLR
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR

XP_022975516.1 uncharacterized protein LOC111474945, partial [Cucurbita maxima]1.1e-19279.56Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI +RN TVIPASIKQP                               TTEET+P+EI EVLN YADIM ESLPQTLPPRRGIDHEIEL+P V
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL IISDLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VFYEYLDQFVIV LDDIVVYSTTLEEHKVHLKLVFDKLRQNQL                
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
              CGQI MDSDKIKAIQEWKVPTSVSD+RSFLGLANYYRRFVEGFSRRA PLTELLKKDH WSWS+ CQMAFEDLKTTMTRGP   LVDVTK FEI
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV
        ET+ASDFALG VLIQEG PIA+ESRKLNDAERRYTVSEKEMLA+VHCLRV
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV

XP_023524533.1 uncharacterized protein LOC111788429 [Cucurbita pepo subsp. pepo]4.4e-23190Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI +RN TVIPASIKQPGNLRMISAIQLKRGLAREEPTFM IPL+EE TTEET+P+EIKEVL++Y DIM ESLPQTLPPRRGIDHEIELLP V
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL IISDLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RI EGDE KTT +TRYGAFEFLVMPFGLTNAPATFY LMN VFYEYLDQFVIV LDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HV+ CGQI MDSDKIKAIQEWKVPTSVS++RSFLGLANYYRRFVEGFSRRA PLTELLKKDHPWSWSN CQMAFE+LKTTMTRGP   LVDVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV
        ET+ASDFALG VLIQEG PIAYESRKLNDAERRYTVSEKEMLA+VHCLRV
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV

XP_023526180.1 uncharacterized protein LOC111789739 [Cucurbita pepo subsp. pepo]1.9e-20289.25Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI +RN  VIPASIKQPGNLRMISAIQLKRGLAREEPTFM IPL+EE TTEET+P+EIKEVL+SY DIM ESLPQTLPPRRGIDHEIELLP V
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL IISDLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VFYEYLDQFVIV LDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI+F
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HV+ CGQI MDSDKIKAIQEWKVPTSVS++RSFLGLANYYRRFVEGFSRRA PLTELLKKDHPWSWSN CQMAFE+LKTTMTRGP   LVD+TK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

XP_023537907.1 uncharacterized protein LOC111798805 [Cucurbita pepo subsp. pepo]6.4e-23089.78Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI +RN TVIPASIKQPGNLRMISAIQLKRGLAREEPTFM IPL+EE TTEET+P+EIK+VL+SY DIM ESLPQTLPPRRGIDHEIELLP V
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL IISDLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VFYEYLDQFVIV LDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCI+F
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HV+ CGQI MDSDKIKAIQEWKVPTSVS++RSFLGLANYYRRFVEGFSRRA PLTELLKKDHPWSWSN CQMAFE+LKTTMTRGP   LVDVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV
        ET+ASDFALG VLIQEG PIAYESRKLNDAERRYTVSEKEMLA+VHCLRV
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV

TrEMBL top hitse value%identityAlignment
A0A5D3C4R1 Reverse transcriptase1.4e-18772.61Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI     +V+   ++QP  L+MISA+QLK+GL+R+EPTFM IPL     + ET+P EI  VL  Y D+M +SLP++LPPRR IDHEIEL+P  
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQ+KKDG+LRLCIDYRALNK+TV NKYPL II+DLFD+LHGAKYF+KLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VF+EYLD+FV+V LDDIVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INF
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HVI CG+IGM+  KI AI++W +P SVS++RSFLGLANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP   + DVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR
        ET+ASD+ALG VL+Q G PIAYESRKLN AERRYTVSEKEMLA+VHCLR
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR

A0A5D3C9P8 Reverse transcriptase1.1e-18772.83Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI     +V+   ++QP  L+MISA+QLK+GL+R+EPTFM IPL     + ET+P EI  VL  Y D+M +SLP++LPPRR IDHEIEL+P  
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDG+LRLCIDYRALNK+TV NKYPL II+DLFD+LHGAKYF+KLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VF+EYLD+FV+V LDDIVVYSTT+EEH+ HL+ VF KL++NQLYVK+EKC+FAQ  INF
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HVI CG+IGM+  KI AI++W +P SVS++RSFLGLANYYRRFVEGFS+RA+PLTELLKKD  W+W  +CQ AF+ LK  +  GP   + DVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR
        ET+ASD+ALG VL+Q G PIAYESRKLN AERRYTVSEKEMLA+VHCLR
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR

A0A6J1D906 Reverse transcriptase3.8e-19675.95Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKC+++   + TV+  SIKQPG +RMISA+QLK+GL REEPTFM IP++E+      +P EI+ V+  Y DIM +SLP+TLPPRRGIDHEIEL+P  
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAP+GAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL II+DLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVM FGLTNAPATF  +MN VF+EYLDQFV+V LDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I F
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HVI  GQI MD+DK+KAIQEW+VPTSV ++RSFLGLANYYRRF+EGFSRRATP+TELLKK   W WS + Q AFEDLK  M +GP   L DVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR
        ET+ASD+ALG VL+Q+  PI YESRKLN+AERRYTVSEKEMLA+VHCLR
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR

A0A6J1DK29 uncharacterized protein LOC1110218296.9e-19074.83Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKC+++   + TV+ ASIKQPG +RMISA+QLK+GL REEPTF     M+ V T + +P EI+ V+  Y DIM +SLP+TLPPRRGIDHEIEL+P  
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP +LAELRKQLDEL   GFI PAKAP+GA VLFQKKKDGTLRLCIDYRALNKVTV NKYPL II+DLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  +MN VF+EYLDQFV+V LDDIVVYS TL+EH++HL+LVFDKLRQNQLYVKKEKCAFAQ  I F
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
        L HVI  GQI MD+DK+K IQEW+VPTSV+++RSFLGLANYYRRF+EGFSRRATP+TELL K     WS + Q AFEDLK  M +GP   L DVTK FE+
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR
        ET+ASD+ALG VL+Q+  PIAYESRKLN+AERRYTVSEKEMLA+VHCLR
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLR

A0A6J1IEF9 uncharacterized protein LOC1114749455.1e-19379.56Show/hide
Query:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV
        MPLAKCLVI +RN TVIPASIKQP                               TTEET+P+EI EVLN YADIM ESLPQTLPPRRGIDHEIEL+P V
Subjt:  MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEV

Query:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV
        KPPAKNAY MAP ELAELRKQLDELL AGFI PAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTV NKYPL IISDLFDQLHGAKYFTKLD+RSGYYQV
Subjt:  KPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQV

Query:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF
        RIAEGDE KTT +TRYGAFEFLVMPFGLTNAPATF  LMN VFYEYLDQFVIV LDDIVVYSTTLEEHKVHLKLVFDKLRQNQL                
Subjt:  RIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINF

Query:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI
              CGQI MDSDKIKAIQEWKVPTSVSD+RSFLGLANYYRRFVEGFSRRA PLTELLKKDH WSWS+ CQMAFEDLKTTMTRGP   LVDVTK FEI
Subjt:  LEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEI

Query:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV
        ET+ASDFALG VLIQEG PIA+ESRKLNDAERRYTVSEKEMLA+VHCLRV
Subjt:  ETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKEMLALVHCLRV

SwissProt top hitse value%identityAlignment
P0CT34 Transposon Tf2-1 polyprotein2.0e-6934.77Show/hide
Query:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP
        ++  + T   +  +  +  E  +PD  KE  +  A+  +E LP+   P +G++ E+EL  E  + P +N Y + P ++  +  ++++ L +G I  +KA 
Subjt:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP

Query:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM
           PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  ++ G+  FTKLD++S Y+ +R+ +GDE K  F    G FE+LVMP+G++ APA F  
Subjt:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM

Query:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG
         +N +  E  +  V+  +DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+ + IS        + I  + +WK P +  ++R FLG
Subjt:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG

Query:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER
          NY R+F+   S+   PL  LLKKD  W W+     A E++K  +   P     D +K   +ET+ASD A+G VL Q+       P+ Y S K++ A+ 
Subjt:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER

Query:  RYTVSEKEMLALVHCLR
         Y+VS+KEMLA++  L+
Subjt:  RYTVSEKEMLALVHCLR

P0CT35 Transposon Tf2-2 polyprotein2.0e-6934.77Show/hide
Query:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP
        ++  + T   +  +  +  E  +PD  KE  +  A+  +E LP+   P +G++ E+EL  E  + P +N Y + P ++  +  ++++ L +G I  +KA 
Subjt:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP

Query:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM
           PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  ++ G+  FTKLD++S Y+ +R+ +GDE K  F    G FE+LVMP+G++ APA F  
Subjt:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM

Query:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG
         +N +  E  +  V+  +DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+ + IS        + I  + +WK P +  ++R FLG
Subjt:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG

Query:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER
          NY R+F+   S+   PL  LLKKD  W W+     A E++K  +   P     D +K   +ET+ASD A+G VL Q+       P+ Y S K++ A+ 
Subjt:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER

Query:  RYTVSEKEMLALVHCLR
         Y+VS+KEMLA++  L+
Subjt:  RYTVSEKEMLALVHCLR

P0CT36 Transposon Tf2-3 polyprotein2.0e-6934.77Show/hide
Query:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP
        ++  + T   +  +  +  E  +PD  KE  +  A+  +E LP+   P +G++ E+EL  E  + P +N Y + P ++  +  ++++ L +G I  +KA 
Subjt:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP

Query:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM
           PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  ++ G+  FTKLD++S Y+ +R+ +GDE K  F    G FE+LVMP+G++ APA F  
Subjt:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM

Query:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG
         +N +  E  +  V+  +DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+ + IS        + I  + +WK P +  ++R FLG
Subjt:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG

Query:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER
          NY R+F+   S+   PL  LLKKD  W W+     A E++K  +   P     D +K   +ET+ASD A+G VL Q+       P+ Y S K++ A+ 
Subjt:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER

Query:  RYTVSEKEMLALVHCLR
         Y+VS+KEMLA++  L+
Subjt:  RYTVSEKEMLALVHCLR

P0CT37 Transposon Tf2-4 polyprotein2.0e-6934.77Show/hide
Query:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP
        ++  + T   +  +  +  E  +PD  KE  +  A+  +E LP+   P +G++ E+EL  E  + P +N Y + P ++  +  ++++ L +G I  +KA 
Subjt:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP

Query:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM
           PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  ++ G+  FTKLD++S Y+ +R+ +GDE K  F    G FE+LVMP+G++ APA F  
Subjt:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM

Query:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG
         +N +  E  +  V+  +DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+ + IS        + I  + +WK P +  ++R FLG
Subjt:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG

Query:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER
          NY R+F+   S+   PL  LLKKD  W W+     A E++K  +   P     D +K   +ET+ASD A+G VL Q+       P+ Y S K++ A+ 
Subjt:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER

Query:  RYTVSEKEMLALVHCLR
         Y+VS+KEMLA++  L+
Subjt:  RYTVSEKEMLALVHCLR

P0CT41 Transposon Tf2-12 polyprotein2.0e-6934.77Show/hide
Query:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP
        ++  + T   +  +  +  E  +PD  KE  +  A+  +E LP+   P +G++ E+EL  E  + P +N Y + P ++  +  ++++ L +G I  +KA 
Subjt:  LAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPE-VKPPAKNAYWMAPLELAELRKQLDELLTAGFICPAKAP

Query:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM
           PV+F  KK+GTLR+ +DY+ LNK    N YPL +I  L  ++ G+  FTKLD++S Y+ +R+ +GDE K  F    G FE+LVMP+G++ APA F  
Subjt:  YGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFEFLVMPFGLTNAPATFYM

Query:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG
         +N +  E  +  V+  +DDI+++S +  EH  H+K V  KL+   L + + KC F Q+ + F+ + IS        + I  + +WK P +  ++R FLG
Subjt:  LMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLG

Query:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER
          NY R+F+   S+   PL  LLKKD  W W+     A E++K  +   P     D +K   +ET+ASD A+G VL Q+       P+ Y S K++ A+ 
Subjt:  LANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEG-----RPIAYESRKLNDAER

Query:  RYTVSEKEMLALVHCLR
         Y+VS+KEMLA++  L+
Subjt:  RYTVSEKEMLALVHCLR

Arabidopsis top hitse value%identityAlignment
ATMG00860.1 DNA/RNA polymerases superfamily protein1.2e-1938.46Show/hide
Query:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFL--EHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSW
        HL +V     Q+Q Y  ++KCAF Q  I +L   H+IS   +  D  K++A+  W  P + +++R FLGL  YYRRFV+ + +   PLTELLKK +   W
Subjt:  HLKLVFDKLRQNQLYVKKEKCAFAQTCINFL--EHVISCGQIGMDSDKIKAIQEWKVPTSVSDVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSW

Query:  SNKCQMAFEDLKTTMTRGPAFELVDVTKSF
        +    +AF+ LK  +T  P   L D+   F
Subjt:  SNKCQMAFEDLKTTMTRGPAFELVDVTKSF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCACTAGCGAAATGCTTGGTGATCATTGAACGCAATTCCACAGTAATACCTGCAAGCATCAAACAGCCAGGTAATCTTAGAATGATTTCGGCCATACAATTGAAAAG
AGGGCTCGCACGAGAGGAACCTACATTTATGACTATACCACTGATGGAAGAAGTGACTACTGAGGAAACTATCCCAGACGAAATCAAGGAGGTATTAAACAGTTACGCTG
ATATAATGTCAGAGAGCCTACCACAAACATTACCACCCCGTCGAGGCATTGATCACGAAATTGAACTCCTTCCCGAGGTTAAACCCCCAGCGAAGAATGCATACTGGATG
GCTCCCCTTGAGCTAGCCGAATTGAGGAAACAACTAGATGAGTTGTTGACGGCAGGATTCATTTGCCCGGCAAAAGCACCTTACGGAGCCCCCGTACTATTTCAGAAAAA
GAAGGATGGGACGTTGCGCCTGTGCATAGATTATAGAGCCTTAAACAAGGTGACGGTACACAACAAGTACCCACTGCTGATAATATCTGACTTGTTCGACCAACTTCATG
GGGCCAAATACTTCACGAAGTTGGACATACGATCAGGGTACTACCAAGTACGTATCGCCGAGGGGGACGAGTCCAAGACGACGTTCATGACAAGATATGGGGCCTTCGAG
TTCCTGGTGATGCCCTTTGGCTTGACAAACGCCCCAGCTACGTTTTACATGTTAATGAACCTGGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATCCCTCGATGA
CATAGTTGTATACAGCACAACCTTAGAGGAACACAAGGTACACTTGAAGCTTGTATTTGACAAGCTGCGGCAGAATCAGTTGTACGTCAAGAAAGAAAAATGTGCCTTCG
CACAGACATGCATCAACTTCCTCGAACATGTCATCAGTTGTGGACAGATTGGGATGGATAGCGATAAGATAAAAGCTATCCAAGAGTGGAAAGTCCCTACTTCCGTATCC
GATGTGCGATCCTTCTTAGGATTAGCAAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCACCCCCTTAACGGAGCTCCTGAAGAAAGACCACCCTTGGTC
GTGGTCAAATAAATGTCAAATGGCCTTTGAAGATCTGAAGACAACCATGACGAGGGGTCCTGCCTTCGAGTTAGTAGATGTTACAAAGTCATTTGAAATAGAAACAAATG
CTTCTGACTTTGCCCTAGGCGATGTCCTTATTCAAGAAGGGCGCCCCATCGCTTATGAAAGTCGAAAGCTCAATGATGCCGAACGGAGATACACTGTCTCCGAAAAGGAA
ATGCTAGCTTTAGTTCATTGCCTACGAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGCCACTAGCGAAATGCTTGGTGATCATTGAACGCAATTCCACAGTAATACCTGCAAGCATCAAACAGCCAGGTAATCTTAGAATGATTTCGGCCATACAATTGAAAAG
AGGGCTCGCACGAGAGGAACCTACATTTATGACTATACCACTGATGGAAGAAGTGACTACTGAGGAAACTATCCCAGACGAAATCAAGGAGGTATTAAACAGTTACGCTG
ATATAATGTCAGAGAGCCTACCACAAACATTACCACCCCGTCGAGGCATTGATCACGAAATTGAACTCCTTCCCGAGGTTAAACCCCCAGCGAAGAATGCATACTGGATG
GCTCCCCTTGAGCTAGCCGAATTGAGGAAACAACTAGATGAGTTGTTGACGGCAGGATTCATTTGCCCGGCAAAAGCACCTTACGGAGCCCCCGTACTATTTCAGAAAAA
GAAGGATGGGACGTTGCGCCTGTGCATAGATTATAGAGCCTTAAACAAGGTGACGGTACACAACAAGTACCCACTGCTGATAATATCTGACTTGTTCGACCAACTTCATG
GGGCCAAATACTTCACGAAGTTGGACATACGATCAGGGTACTACCAAGTACGTATCGCCGAGGGGGACGAGTCCAAGACGACGTTCATGACAAGATATGGGGCCTTCGAG
TTCCTGGTGATGCCCTTTGGCTTGACAAACGCCCCAGCTACGTTTTACATGTTAATGAACCTGGTTTTCTACGAATACTTGGATCAGTTTGTCATAGTATCCCTCGATGA
CATAGTTGTATACAGCACAACCTTAGAGGAACACAAGGTACACTTGAAGCTTGTATTTGACAAGCTGCGGCAGAATCAGTTGTACGTCAAGAAAGAAAAATGTGCCTTCG
CACAGACATGCATCAACTTCCTCGAACATGTCATCAGTTGTGGACAGATTGGGATGGATAGCGATAAGATAAAAGCTATCCAAGAGTGGAAAGTCCCTACTTCCGTATCC
GATGTGCGATCCTTCTTAGGATTAGCAAACTACTATAGGCGGTTCGTCGAAGGGTTTTCACGACGAGCCACCCCCTTAACGGAGCTCCTGAAGAAAGACCACCCTTGGTC
GTGGTCAAATAAATGTCAAATGGCCTTTGAAGATCTGAAGACAACCATGACGAGGGGTCCTGCCTTCGAGTTAGTAGATGTTACAAAGTCATTTGAAATAGAAACAAATG
CTTCTGACTTTGCCCTAGGCGATGTCCTTATTCAAGAAGGGCGCCCCATCGCTTATGAAAGTCGAAAGCTCAATGATGCCGAACGGAGATACACTGTCTCCGAAAAGGAA
ATGCTAGCTTTAGTTCATTGCCTACGAGTCTAG
Protein sequenceShow/hide protein sequence
MPLAKCLVIIERNSTVIPASIKQPGNLRMISAIQLKRGLAREEPTFMTIPLMEEVTTEETIPDEIKEVLNSYADIMSESLPQTLPPRRGIDHEIELLPEVKPPAKNAYWM
APLELAELRKQLDELLTAGFICPAKAPYGAPVLFQKKKDGTLRLCIDYRALNKVTVHNKYPLLIISDLFDQLHGAKYFTKLDIRSGYYQVRIAEGDESKTTFMTRYGAFE
FLVMPFGLTNAPATFYMLMNLVFYEYLDQFVIVSLDDIVVYSTTLEEHKVHLKLVFDKLRQNQLYVKKEKCAFAQTCINFLEHVISCGQIGMDSDKIKAIQEWKVPTSVS
DVRSFLGLANYYRRFVEGFSRRATPLTELLKKDHPWSWSNKCQMAFEDLKTTMTRGPAFELVDVTKSFEIETNASDFALGDVLIQEGRPIAYESRKLNDAERRYTVSEKE
MLALVHCLRV