; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI03G29270 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI03G29270
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationChr3:26866266..26867890
RNA-Seq ExpressionCSPI03G29270
SyntenyCSPI03G29270
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0006488 - dolichol-linked oligosaccharide biosynthetic process (biological process)
GO:0015074 - DNA integration (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003676 - nucleic acid binding (molecular function)
GO:0030246 - carbohydrate binding (molecular function)
GO:0047938 - glucose-6-phosphate 1-epimerase activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR021109 - Aspartic peptidase domain superfamily
IPR043128 - Reverse transcriptase/Diguanylate cyclase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050511.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]9.9e-17366.81Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTE-QRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYG
        EL MLVV     E EI+EE+  + + E+  ++V   E   +ELS+NS+VGL+NPGTMK KGK+ G EV++LIDCGATHNFI+E LV  L +  + T NYG
Subjt:  ELLMLVVMGENVEYEIIEEDNTE-QRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYG

Query:  AILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRG
         ILGS TA+KGKGVC+ +E+ L  W+V D FL L+LGGVD IL MQWL+SLG+TEVDWK L+LTF HQGKKVVIRGDPS TKARV+LKNLMKS+G +D+G
Subjt:  AILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRG

Query:  FLVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSP
        FLVECR +E     E E   +      E +A +L+RF  VFEWP TLPPQR I+HHI+LK G DPVNVRPYRYA+ QK EMERLV+EML S +IR S SP
Subjt:  FLVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSP

Query:  YSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQS
        YSSPVLLVRK+DGSWRFCVDYRALNNVT+PDKFPI VIEELFDEL  A++F+KIDL+ GYHQIRMC +DIEKTAFRTHEGH EF+VMPFGLTNA STFQ+
Subjt:  YSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQS

Query:  LMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        LMN +F+ YLR+FVLVFFDDILVYS+G+E+HL H++ VL +L++ ELY N +KCSFA+
Subjt:  LMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

TYJ96875.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-17267.25Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV   N E EI+EE  T+  EL T+EV  +    VELSINS+VGL++PGTMK +G LQG+EV++LIDCGATHNF+SE LV  LQ+  K T++YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS TAI+GKG+CE+IE+ + DW V ++FL LELGGVD IL MQWLYSLG+T  DWKNL LTF    KK+ I+GDPS TKARV+LKNL+K++ E D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY
        L+ECR++    +       +E   IEE +  +L +F+D+FEWPE LPP+R IEH IHLK+GT+PVNVRPYRYAY QK EMERLV EMLAS +IR S SPY
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY

Query:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL
        SSPVLLV+KKDGSWRFCVDYRALNNVTVPDKFPI V+EELFDEL  A++FTKIDL+ GYHQIRM   DIEKTAFRTHEGH EF+VMPFGLTNA +TFQSL
Subjt:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL

Query:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF
        MN+IFR YLR+FVLVFFDDIL+YS+ LEDHL H++ V  VLRK+EL+AN+KKCSF
Subjt:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF

TYK20792.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]3.4e-17365.8Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV  E  EYEI+EE+N E+REL+ IE+ E+  TVVELSINS+VGL++PGTMK +GKL G EV++LIDCGATHNF+SE LVK+L +  K TS+YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS  A++GKGVCE +E+ +G W++V++FL LELGGVD IL MQWLYSLG+T VDWKNL +TF   GK+V I+GDPS TKAR++LK L+K++ ++D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS
        L+ECR+L+ +   E E S      + ++V     + V+++F DVFEWPE LPP+R IEHHIHLK+GT+P+NVRPYRY + QK EME+LV+EML S VIR 
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS

Query:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS
        STSPYSSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPI V+EELFDEL  AT+F+KIDL+ GYHQIRM  +D+EKTAFRTHEGH EF+VMPFGLTNA +
Subjt:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS

Query:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        TFQ+LMN IF+ +LRKFVLVFFDDILVYS+  E+H  HMK VL +LR+NELYAN+KKC FAQ
Subjt:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

TYK20833.1 Ty3/gypsy retrotransposon protein [Cucumis melo var. makuwa]1.3e-17265.58Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV  +  EYEI+EE+N E+REL+ IE+ E+  TVVELSINS+VGL++PGTMK +GKL G EV+VLIDCGATHNF+SE LVK+L +  K TS+YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS  A++GKGVCE +E+ +G W++V++FL LELGGVD IL MQWLYSLG+T VDWKNL +TF   GK+V I+GDPS TKAR++LK L+K++ ++D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS
        L+ECR+L+ +   E E S+     + ++V     + V+++F DVF+WPE LPP+R IEHHIHLK+GT+P+NVRPYRY + QK EME+LV EML S VIR 
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS

Query:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS
        STSPYSSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPI V+EELFDEL  AT+F+KIDL+ GYHQIRM  +D+EKTAFRTHEGH EF+VMPFGLTNA +
Subjt:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS

Query:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        TFQ+LMN IF+ +LRKFVLVFFDDILVYS+  E+H  HMK VL +LR+NELYAN+KKC FAQ
Subjt:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

XP_031745972.1 uncharacterized protein LOC116406393 [Cucumis sativus]1.7e-18570.04Show/hide
Query:  LMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAIL
        L + V+ E+ EYEI+EE   ++ ELN +E+  E+Q +VELSINS+VGL+NPGTMK +GK++ REVI+LIDCGATHNFIS+ +V+EL + TK TS+YG IL
Subjt:  LMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAIL

Query:  GSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGFLV
        GS  A+KGKG+CE IE+ L  W+V   FL LELGGVD +LEMQWLYSLG+TEVDWKNL +TF H GKKV I+GDPS TKA V LKN++KS+ + D+GFL+
Subjt:  GSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGFLV

Query:  ECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSS
        ECRA+E      E++  +EVL ++E+V+ VLK+FEDVF WPETLPP+R IEHHI+LK+GTDPVNVRPYRY YQQK EMERLVEEML+S VIR S SPYSS
Subjt:  ECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSS

Query:  PVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMN
        PVLLVRKKDGSWRFCVDYR LN+VT+PDKFPI VIEELFDEL+ A  F+KIDL+ GYHQIRM + DIEKTAFRTHEGH EF+VMPFGLTNA STFQSLMN
Subjt:  PVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMN

Query:  TIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFA
        T+F+ YLRKF+LVFFDDIL+YSK LE HL H+   LE+LR+NELYAN+KKCSFA
Subjt:  TIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFA

TrEMBL top hitse value%identityAlignment
A0A5A7UAE4 Ty3/gypsy retrotransposon protein4.8e-17366.81Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTE-QRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYG
        EL MLVV     E EI+EE+  + + E+  ++V   E   +ELS+NS+VGL+NPGTMK KGK+ G EV++LIDCGATHNFI+E LV  L +  + T NYG
Subjt:  ELLMLVVMGENVEYEIIEEDNTE-QRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYG

Query:  AILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRG
         ILGS TA+KGKGVC+ +E+ L  W+V D FL L+LGGVD IL MQWL+SLG+TEVDWK L+LTF HQGKKVVIRGDPS TKARV+LKNLMKS+G +D+G
Subjt:  AILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRG

Query:  FLVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSP
        FLVECR +E     E E   +      E +A +L+RF  VFEWP TLPPQR I+HHI+LK G DPVNVRPYRYA+ QK EMERLV+EML S +IR S SP
Subjt:  FLVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSP

Query:  YSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQS
        YSSPVLLVRK+DGSWRFCVDYRALNNVT+PDKFPI VIEELFDEL  A++F+KIDL+ GYHQIRMC +DIEKTAFRTHEGH EF+VMPFGLTNA STFQ+
Subjt:  YSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQS

Query:  LMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        LMN +F+ YLR+FVLVFFDDILVYS+G+E+HL H++ VL +L++ ELY N +KCSFA+
Subjt:  LMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

A0A5A7V5H5 Ty3/gypsy retrotransposon protein6.2e-17367.25Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV   N E EI+EE  T+  EL T+EV  +    VELSINS+VGL++PGTMK +G LQG+EV++LIDCGATHNF+SE LV  LQ+  K T++YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS TAI+GKG+CE+IE+ + DW V ++FL LELGGVD IL MQWLYSLG+T  DWKNL LTF    KK+ I+GDPS TKARV+LKNL+K++ E D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY
        L+ECR++    +       +E   IEE +  +L +F+D+FEWPE LPP+R IEH IHLK+GT+PVNVRPYRYAY QK EMERLV EMLAS +IR S SPY
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY

Query:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL
        SSPVLLV+KKDGSWRFCVDYRALNNVTVPDKFPI V+EELFDEL  A++FTKIDL+ GYHQIRM   DIEKTAFRTHEGH EF+VMPFGLTNA +TFQSL
Subjt:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL

Query:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF
        MN+IFR YLR+FVLVFFDDIL+YS+ LEDHL H++ V  VLRK+EL+AN+KKCSF
Subjt:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF

A0A5D3BEL2 Ty3/gypsy retrotransposon protein6.2e-17367.25Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV   N E EI+EE  T+  EL T+EV  +    VELSINS+VGL++PGTMK +G LQG+EV++LIDCGATHNF+SE LV  LQ+  K T++YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS TAI+GKG+CE+IE+ + DW V ++FL LELGGVD IL MQWLYSLG+T  DWKNL LTF    KK+ I+GDPS TKARV+LKNL+K++ E D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY
        L+ECR++    +       +E   IEE +  +L +F+D+FEWPE LPP+R IEH IHLK+GT+PVNVRPYRYAY QK EMERLV EMLAS +IR S SPY
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPY

Query:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL
        SSPVLLV+KKDGSWRFCVDYRALNNVTVPDKFPI V+EELFDEL  A++FTKIDL+ GYHQIRM   DIEKTAFRTHEGH EF+VMPFGLTNA +TFQSL
Subjt:  SSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL

Query:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF
        MN+IFR YLR+FVLVFFDDIL+YS+ LEDHL H++ V  VLRK+EL+AN+KKCSF
Subjt:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF

A0A5D3DB20 Ty3/gypsy retrotransposon protein1.6e-17365.8Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV  E  EYEI+EE+N E+REL+ IE+ E+  TVVELSINS+VGL++PGTMK +GKL G EV++LIDCGATHNF+SE LVK+L +  K TS+YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS  A++GKGVCE +E+ +G W++V++FL LELGGVD IL MQWLYSLG+T VDWKNL +TF   GK+V I+GDPS TKAR++LK L+K++ ++D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS
        L+ECR+L+ +   E E S      + ++V     + V+++F DVFEWPE LPP+R IEHHIHLK+GT+P+NVRPYRY + QK EME+LV+EML S VIR 
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS

Query:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS
        STSPYSSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPI V+EELFDEL  AT+F+KIDL+ GYHQIRM  +D+EKTAFRTHEGH EF+VMPFGLTNA +
Subjt:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS

Query:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        TFQ+LMN IF+ +LRKFVLVFFDDILVYS+  E+H  HMK VL +LR+NELYAN+KKC FAQ
Subjt:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

A0A5D3DT65 Ty3/gypsy retrotransposon protein6.2e-17365.58Show/hide
Query:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA
        EL M VV  +  EYEI+EE+N E+REL+ IE+ E+  TVVELSINS+VGL++PGTMK +GKL G EV+VLIDCGATHNF+SE LVK+L +  K TS+YG 
Subjt:  ELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGA

Query:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF
        ILGS  A++GKGVCE +E+ +G W++V++FL LELGGVD IL MQWLYSLG+T VDWKNL +TF   GK+V I+GDPS TKAR++LK L+K++ ++D G+
Subjt:  ILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGF

Query:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS
        L+ECR+L+ +   E E S+     + ++V     + V+++F DVF+WPE LPP+R IEHHIHLK+GT+P+NVRPYRY + QK EME+LV EML S VIR 
Subjt:  LVECRALERRESLEEEDSFDEVLTIEESV-----AVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRS

Query:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS
        STSPYSSPVLLV+KKDGSWRFCVDYRA+NN T+PDKFPI V+EELFDEL  AT+F+KIDL+ GYHQIRM  +D+EKTAFRTHEGH EF+VMPFGLTNA +
Subjt:  STSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAAS

Query:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ
        TFQ+LMN IF+ +LRKFVLVFFDDILVYS+  E+H  HMK VL +LR+NELYAN+KKC FAQ
Subjt:  TFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQ

SwissProt top hitse value%identityAlignment
P04323 Retrovirus-related Pol polyprotein from transposon 17.62.3e-3937.12Show/hide
Query:  LERRESLEEEDSFDEVLTIEESVAVVLKRFEDV-FEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVL
        L R E L  E+        ++ +  +L+++ D+ +   + L      +H I+ K      +   Y  AY+Q  E+E  +++ML   +IR+S SPY+SP+ 
Subjt:  LERRESLEEEDSFDEVLTIEESVAVVLKRFEDV-FEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVL

Query:  LVRKKDGS-----WRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL
        +V KK  +     +R  +DYR LN +TV D+ PI  ++E+  +L     FT IDL  G+HQI M  + + KTAF T  GH E++ MPFGL NA +TFQ  
Subjt:  LVRKKDGS-----WRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSL

Query:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQFEWTTWG
        MN I R  L K  LV+ DDI+V+S  L++HL  +  V E L K  L     KC F + E T  G
Subjt:  MNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSFAQFEWTTWG

P20825 Retrovirus-related Pol polyprotein from transposon 2973.5e-4029.48Show/hide
Query:  QGREVIVLIDCGATHNFISEGLVKELQINTK---ITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKN
        +GR    L+D G+T N I+E +      N++   +TSN    L     +    +          ++  + F          +L  + L     + +++KN
Subjt:  QGREVIVLIDCGATHNFISEGLVKELQINTK---ITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKN

Query:  LILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGFLVECRALE-RRESLEEEDSFDEVLTIEESVAVVLKRFEDV-FEWPETLPPQRLIEHHIH
          +T   Q  K++     S+    + ++   +S    D+  + +    + R + L +E++F         +  +L +F ++ ++  E L     I+H ++
Subjt:  LILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGFLVECRALE-RRESLEEEDSFDEVLTIEESVAVVLKRFEDV-FEWPETLPPQRLIEHHIH

Query:  LKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKD-----GSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTK
            + P+  + Y  A   + E+E  V+EML   +IR S SPY+SP  +V KK        +R  +DYR LN +T+PD++PI  ++E+  +L +   FT 
Subjt:  LKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKD-----GSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFTK

Query:  IDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKK
        IDL  G+HQI M  + I KTAF T  GH E++ MPFGL NA +TFQ  MN I R  L K  LV+ DDI+++S  L +HLN ++ V   L    L     K
Subjt:  IDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKKK

Query:  CSFAQFE
        C F + E
Subjt:  CSFAQFE

Q7LHG5 Transposon Ty3-I Gag-Pol polyprotein6.9e-4445.19Show/hide
Query:  IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFT
        ++H I +K G     ++PY    + + E+ ++V+++L +K I  S SP SSPV+LV KKDG++R CVDYR LN  T+ D FP+  I+ L   +  A +FT
Subjt:  IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFT

Query:  KIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKK
         +DL  GYHQI M   D  KTAF T  G  E+ VMPFGL NA STF   M   FR    +FV V+ DDIL++S+  E+H  H+  VLE L+   L   KK
Subjt:  KIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKK

Query:  KCSFAQFE
        KC FA  E
Subjt:  KCSFAQFE

Q8I7P9 Retrovirus-related Pol polyprotein from transposon opus6.9e-3636.44Show/hide
Query:  EESVAVVLKRFEDVFEWP-ETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKK-----DGSWRFCVD
        +E +  +L  F  +FE P   +  +  ++  I      DP+  + Y Y    + E+ER ++E+L   +IR S SPY+SP+ +V KK     +  +R  VD
Subjt:  EESVAVVLKRFEDVFEWP-ETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKK-----DGSWRFCVD

Query:  YRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDD
        ++ LN VT+PD +PI  I      L  A  FT +DL  G+HQI M   DI KTAF T  G  EF+ +PFGL NA + FQ +++ I R ++ K   V+ DD
Subjt:  YRALNNVTVPDKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDD

Query:  ILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF
        I+V+S+  + H  +++ VL  L K  L  N +K  F
Subjt:  ILVYSKGLEDHLNHMKAVLEVLRKNELYANKKKCSF

Q99315 Transposon Ty3-G Gag-Pol polyprotein6.9e-4445.19Show/hide
Query:  IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFT
        ++H I +K G     ++PY    + + E+ ++V+++L +K I  S SP SSPV+LV KKDG++R CVDYR LN  T+ D FP+  I+ L   +  A +FT
Subjt:  IEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVPDKFPIQVIEELFDELHEATMFT

Query:  KIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKK
         +DL  GYHQI M   D  KTAF T  G  E+ VMPFGL NA STF   M   FR    +FV V+ DDIL++S+  E+H  H+  VLE L+   L   KK
Subjt:  KIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLEVLRKNELYANKK

Query:  KCSFAQFE
        KC FA  E
Subjt:  KCSFAQFE

Arabidopsis top hitse value%identityAlignment
AT3G29750.1 Eukaryotic aspartyl protease family protein3.7e-1630.46Show/hide
Query:  LNTIEVTEEEQTVVELSINSMV-GLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWR
        +N +E  E++   +   +  +V  L+    M+  G +   +V+V ID GAT NFI   L   L++ T IT+    +LG R  I+  G C  I + + +  
Subjt:  LNTIEVTEEEQTVVELSINSMV-GLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWR

Query:  VVDEFLSLELG--GVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEED
        + + FL L+L    VD IL  +WL  LG T V+W+N   +F+H  + + +  +  + + +V  K  MKS  E++
Subjt:  VVDEFLSLELG--GVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEED

AT3G30770.1 Eukaryotic aspartyl protease family protein1.8e-1028.57Show/hide
Query:  EEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSL
        E+ +T+ ++   S    +    M+  G +   +V+V+ID GAT+NFIS+ L   L++ T  T+    +LG R  I+  G C  I +++ +  + + FL L
Subjt:  EEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFLSL

Query:  EL--GGVDAI--------LEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDR
        +L    VD I        LE QWL         W N   +F H  + V +     + + +V  K  MKS  E+++
Subjt:  EL--GGVDAI--------LEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDR

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding6.5e-0527.14Show/hide
Query:  GTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFL--SLELGGVDAILEMQWLYSLG
        GT+K + +L G EV     C     F    L +E+  + ++ S+             K  C+ I + + D  +V+++    L+   VD IL  +WL  LG
Subjt:  GTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIKGKGVCEAIEIMLGDWRVVDEFL--SLELGGVDAILEMQWLYSLG

Query:  ITEVDWKNLILTFTHQGKKVVI---RGDPSQTKARVNLKN
         TEV+W+N   +F H    V +     D  Q   RV +++
Subjt:  ITEVDWKNLILTFTHQGKKVVI---RGDPSQTKARVNLKN

ATMG00850.1 DNA/RNA polymerases superfamily protein1.0e-0553.85Show/hide
Query:  QKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSW
        ++T ++  + EML +++I+ S SPYSSPVLLV+KKDG W
Subjt:  QKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATTACTTATGCTGGTAGTAATGGGAGAGAATGTTGAGTATGAAATCATTGAAGAAGATAACACTGAACAGAGGGAGTTGAACACGATTGAGGTTACAGAAGAAGA
ACAAACAGTGGTAGAATTGTCAATAAATTCAATGGTGGGGCTATCCAATCCGGGAACGATGAAGGGGAAGGGAAAGTTACAAGGAAGAGAAGTCATTGTATTGATAGATT
GTGGAGCTACACACAATTTTATCTCAGAAGGTTTAGTTAAGGAGTTACAGATTAATACCAAGATTACCTCAAATTATGGGGCCATTTTGGGTTCGAGAACAGCTATTAAA
GGAAAAGGAGTTTGTGAAGCCATTGAAATAATGTTAGGAGACTGGAGGGTGGTTGATGAATTCTTATCTTTGGAGTTAGGAGGGGTGGATGCAATACTGGAAATGCAATG
GTTGTATTCTTTAGGTATAACCGAGGTGGATTGGAAGAATTTGATCTTGACATTTACACATCAAGGGAAGAAGGTGGTTATAAGGGGGGATCCAAGCCAAACAAAGGCAA
GGGTCAACTTAAAAAATCTCATGAAATCCTATGGAGAAGAGGATCGGGGATTTTTAGTAGAGTGTCGTGCATTGGAAAGGAGAGAGTCATTGGAAGAGGAAGATTCGTTT
GATGAAGTGTTGACTATAGAAGAATCTGTAGCAGTGGTGTTGAAAAGATTTGAGGACGTCTTTGAATGGCCCGAGACATTACCTCCACAAAGATTGATAGAACATCATAT
CCATCTTAAAAAGGGAACTGACCCAGTAAATGTTCGTCCTTATCGGTATGCATATCAACAAAAAACAGAGATGGAAAGATTAGTGGAAGAGATGCTAGCATCAAAGGTAA
TAAGGTCGAGTACAAGCCCTTATTCCAGTCCCGTATTGCTGGTGAGAAAGAAGGATGGGAGCTGGCGTTTTTGTGTAGATTACAGGGCTCTAAATAATGTAACTGTACCA
GATAAGTTTCCAATACAAGTGATTGAGGAGTTGTTTGATGAGTTACATGAAGCTACTATGTTCACTAAGATAGATCTTAGGTTAGGATATCATCAAATTAGGATGTGTGC
AGATGATATTGAAAAGACAGCTTTCCGAACCCATGAGGGTCACTGTGAATTTATGGTGATGCCATTTGGATTGACAAACGCTGCGTCCACTTTTCAGTCACTAATGAATA
CTATATTCAGGTCATACCTTCGAAAGTTTGTCTTAGTATTTTTTGATGATATTCTGGTTTATAGCAAAGGATTGGAAGATCATTTGAACCATATGAAAGCAGTGTTAGAA
GTATTGAGGAAGAATGAATTATATGCGAATAAGAAGAAGTGTAGTTTTGCTCAGTTTGAGTGGACTACCTGGGGCACATTATTTCGGGAAATGGAGTTGAAGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATTACTTATGCTGGTAGTAATGGGAGAGAATGTTGAGTATGAAATCATTGAAGAAGATAACACTGAACAGAGGGAGTTGAACACGATTGAGGTTACAGAAGAAGA
ACAAACAGTGGTAGAATTGTCAATAAATTCAATGGTGGGGCTATCCAATCCGGGAACGATGAAGGGGAAGGGAAAGTTACAAGGAAGAGAAGTCATTGTATTGATAGATT
GTGGAGCTACACACAATTTTATCTCAGAAGGTTTAGTTAAGGAGTTACAGATTAATACCAAGATTACCTCAAATTATGGGGCCATTTTGGGTTCGAGAACAGCTATTAAA
GGAAAAGGAGTTTGTGAAGCCATTGAAATAATGTTAGGAGACTGGAGGGTGGTTGATGAATTCTTATCTTTGGAGTTAGGAGGGGTGGATGCAATACTGGAAATGCAATG
GTTGTATTCTTTAGGTATAACCGAGGTGGATTGGAAGAATTTGATCTTGACATTTACACATCAAGGGAAGAAGGTGGTTATAAGGGGGGATCCAAGCCAAACAAAGGCAA
GGGTCAACTTAAAAAATCTCATGAAATCCTATGGAGAAGAGGATCGGGGATTTTTAGTAGAGTGTCGTGCATTGGAAAGGAGAGAGTCATTGGAAGAGGAAGATTCGTTT
GATGAAGTGTTGACTATAGAAGAATCTGTAGCAGTGGTGTTGAAAAGATTTGAGGACGTCTTTGAATGGCCCGAGACATTACCTCCACAAAGATTGATAGAACATCATAT
CCATCTTAAAAAGGGAACTGACCCAGTAAATGTTCGTCCTTATCGGTATGCATATCAACAAAAAACAGAGATGGAAAGATTAGTGGAAGAGATGCTAGCATCAAAGGTAA
TAAGGTCGAGTACAAGCCCTTATTCCAGTCCCGTATTGCTGGTGAGAAAGAAGGATGGGAGCTGGCGTTTTTGTGTAGATTACAGGGCTCTAAATAATGTAACTGTACCA
GATAAGTTTCCAATACAAGTGATTGAGGAGTTGTTTGATGAGTTACATGAAGCTACTATGTTCACTAAGATAGATCTTAGGTTAGGATATCATCAAATTAGGATGTGTGC
AGATGATATTGAAAAGACAGCTTTCCGAACCCATGAGGGTCACTGTGAATTTATGGTGATGCCATTTGGATTGACAAACGCTGCGTCCACTTTTCAGTCACTAATGAATA
CTATATTCAGGTCATACCTTCGAAAGTTTGTCTTAGTATTTTTTGATGATATTCTGGTTTATAGCAAAGGATTGGAAGATCATTTGAACCATATGAAAGCAGTGTTAGAA
GTATTGAGGAAGAATGAATTATATGCGAATAAGAAGAAGTGTAGTTTTGCTCAGTTTGAGTGGACTACCTGGGGCACATTATTTCGGGAAATGGAGTTGAAGTGAATCCT
GAGAAGATCAGGGCTATAAAGGAGTGGCCAGTTCCAGTTAATGTAAGAGAGGTTCGAGGATTTCTTGGGTTGACTGAATATTATCACAATTTTGTTCAAAATTATGGAAC
AATTGCTGCTCCTCTAACACAGTTGTTGAAGATATGAGGGTTTAAATGGACAGAGGAAACACAAGGGGGGAAACACAAGGGGCTT
Protein sequenceShow/hide protein sequence
MELLMLVVMGENVEYEIIEEDNTEQRELNTIEVTEEEQTVVELSINSMVGLSNPGTMKGKGKLQGREVIVLIDCGATHNFISEGLVKELQINTKITSNYGAILGSRTAIK
GKGVCEAIEIMLGDWRVVDEFLSLELGGVDAILEMQWLYSLGITEVDWKNLILTFTHQGKKVVIRGDPSQTKARVNLKNLMKSYGEEDRGFLVECRALERRESLEEEDSF
DEVLTIEESVAVVLKRFEDVFEWPETLPPQRLIEHHIHLKKGTDPVNVRPYRYAYQQKTEMERLVEEMLASKVIRSSTSPYSSPVLLVRKKDGSWRFCVDYRALNNVTVP
DKFPIQVIEELFDELHEATMFTKIDLRLGYHQIRMCADDIEKTAFRTHEGHCEFMVMPFGLTNAASTFQSLMNTIFRSYLRKFVLVFFDDILVYSKGLEDHLNHMKAVLE
VLRKNELYANKKKCSFAQFEWTTWGTLFREMELK