; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0023069 (gene) of Chayote v1 genome

Gene IDSed0023069
OrganismSechium edule (Chayote v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG09:35383803..35400892
RNA-Seq ExpressionSed0023069
SyntenySed0023069
Gene Ontology termsGO:0048583 - regulation of response to stimulus (biological process)
GO:0090304 - nucleic acid metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BBN67784.1 VIRB2-interacting protein 2 [Prunus dulcis]2.7e-12346.3Show/hide
Query:  QEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT
        Q+++K   +  SS    +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ 
Subjt:  QEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT

Query:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR
        +K D+M+V  DFF  GIINA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R
Subjt:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR

Query:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH
         N+ G+  K+D+EKA+D V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + H
Subjt:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH

Query:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWE
        GL  G  +  ++HLQF DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+
Subjt:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWE

Query:  PMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        P+++KVE ++  WK   +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  PMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

BBN69746.1 VIRB2-interacting protein 2 [Prunus dulcis]3.9e-12247.18Show/hide
Query:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI
        +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ +K D+M+V  DFF  GI
Subjt:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI

Query:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD
        INA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R N+ G+  K+D+EKA+D
Subjt:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD

Query:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG
         V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + HGL  G  +  ++HLQF 
Subjt:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG

Query:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH
        DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+P+++KVE ++  WK   
Subjt:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH

Query:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

VVA13439.1 Hypothetical predicted protein, partial [Prunus dulcis]3.9e-12247.18Show/hide
Query:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI
        +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ +K D+M+V  DFF  GI
Subjt:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI

Query:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD
        INA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R N+ G+  K+D+EKA+D
Subjt:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD

Query:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG
         V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + HGL  G  +  ++HLQF 
Subjt:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG

Query:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH
        DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+P+++KVE ++  WK   
Subjt:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH

Query:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

VVA21938.1 Hypothetical predicted protein, partial [Prunus dulcis]3.9e-12247.18Show/hide
Query:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI
        +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ +K D+M+V  DFF  GI
Subjt:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI

Query:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD
        INA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R N+ G+  K+D+EKA+D
Subjt:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD

Query:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG
         V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + HGL  G  +  ++HLQF 
Subjt:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG

Query:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH
        DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+P+++KVE ++  WK   
Subjt:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH

Query:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

VVA41200.1 PREDICTED: RNA-directed DNA polymerase, partial [Prunus dulcis]3.9e-12247.18Show/hide
Query:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI
        +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ +K D+M+V  DFF  GI
Subjt:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI

Query:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD
        INA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R N+ G+  K+D+EKA+D
Subjt:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD

Query:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG
         V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + HGL  G  +  ++HLQF 
Subjt:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG

Query:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH
        DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+P+++KVE ++  WK   
Subjt:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH

Query:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

TrEMBL top hitse value%identityAlignment
A0A5H2XKI7 VIRB2-interacting protein 21.3e-12346.3Show/hide
Query:  QEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT
        Q+++K   +  SS    +V + EIE E I+FF  LY+   E     +  +W+ I+ E+A  L+ PF E E+  A+   G +K+PG DGF+   ++  W+ 
Subjt:  QEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT

Query:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR
        +K D+M+V  DFF  GIINA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R
Subjt:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR

Query:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH
         N+ G+  K+D+EKA+D V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGK R +RGLRQGDPLSPFLF L+MD  SR+M +  +  + H
Subjt:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH

Query:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWE
        GL  G  +  ++HLQF DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  ++ ++  +GC +G WP  YLGLPL  NP +  FW+
Subjt:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWE

Query:  PMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        P+++KVE ++  WK   +SKGGRLT+I+A L ++P Y+MS+F++P  V   +E+++R+FL
Subjt:  PMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

M5WKV4 Reverse transcriptase domain-containing protein (Fragment)8.4e-12347.14Show/hide
Query:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM
        ++ L   D  +++ D +IE E I FF  LY+         +  +W PI+  +A  LE PF   E+  A+ + G +K+PG DGF+  F++  W  +K D+M
Subjt:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM

Query:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI
        +V QDFF++GI+N   NET+ICLIPKK  +  V D+RPISL+  +YK+I++VL+ RL+ VL + IS SQ AFV+ R ILDA+L+ANEV++E +++ RKG+
Subjt:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI

Query:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT
          K+D EKA+D V+W+F+D ++  KGFG KWR WI GC+ S NFSI+INGKPRGK R +RGLRQGDPLSPFLF L+ D  SR++    + + +HG+  G 
Subjt:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT

Query:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV
            ++HLQF DDT+ L D   +    L Q++  F +VSG+ IN  K+ +LG+N     L+ ++  +GC++G WP  YLGLPL  NP +  FW P++ KV
Subjt:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV

Query:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        EK++  WK   +SKGGRLTLI+A L+++P+Y+MS+FKMP  V A +E+++RNFL
Subjt:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

M5X4S0 Reverse transcriptase domain-containing protein (Fragment)6.5e-12347.36Show/hide
Query:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM
        ++ L   D  +++ D  IE E I FF  LY+         +  +W PI+  +A  LE PF   E+  A+   G +K+PG DGF+  F++  W  +K D+M
Subjt:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM

Query:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI
        +V QDFF++GI+N   NET+ICLIPKK  +  V D+RPISL+  +YK+I++VL+ RL+ VL + IS SQ AFV+ R ILDA+L+ANEV++E +++ RKG+
Subjt:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI

Query:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT
          K+D EKA+D V+W+F+D +M  KGFG KWR WI GC+ S NFSI+INGKPRGK R +RGLRQGDPLSPFLF L+ D  SR++    + + +HG+  G 
Subjt:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT

Query:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV
            ++HLQF DDT+ L D   +    L Q++  F  VSG+ IN  K+ +LG+N     L+ ++  +GC++G WP  YLGLPL  NP +  FW P+++KV
Subjt:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV

Query:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        EK++  WK   +SKGGRLTLI+A L+++P+Y+MS+FKMP  V A +E+++RNFL
Subjt:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)1.4e-12247.14Show/hide
Query:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM
        ++ L   D  +++ D  IE E I FF  LY+         +  +W PI+  +A  LE PF   E+  A+ + G +K+PG DGF+  F++  W  +K D+M
Subjt:  LQILSSNDSSLVD-DTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIM

Query:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI
        +V QDFF++GI+N   NET+ICLIPKK  +  V D+RPISL+  +YK+I++VL  RL+ VL + IS SQ AFV+ R ILDA+L+ANEV++E +++ RKG+
Subjt:  RVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGI

Query:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT
          K+D EKA+D V+W+F+D +++ KGFG KWR WI GC+ S NFSI+INGKPRGK R +RGLRQGDPLSPFLF L+ D  SR++    + + +HG+  G 
Subjt:  FIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGT

Query:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV
            ++HLQF DDT+ L D   +    L Q++  F +VSG+ IN  K+ +LG+N     L+ ++  +GC++G WP  YLGLPL  NP +  FW P++ KV
Subjt:  KVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKV

Query:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        EK++  WK   +SKGGRLTLI+A L+++P+Y+MS+FKMP  V A +E+++RNFL
Subjt:  EKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

M5XUF8 Reverse transcriptase domain-containing protein (Fragment)2.0e-12448.31Show/hide
Query:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI
        +V++ EIE E I+FF  LY+   E     +  +W+ I+ E+A  LE PF E E+  A+   G +K+PG DGF+   ++  W  +K D+M+V  DFF  GI
Subjt:  LVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGI

Query:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD
        INA  NET+ICLIPKK E+  V+DFRPISL+  +YK++++VL+ RL+ VL   IS  Q+AFV+GR ILDA LIANEV++E +R N+ G+  K+D+EKA+D
Subjt:  INASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFD

Query:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG
         V+W F+D++++ KGFG +WR WI GC+ +ANFS++ING+PRGKIR +RGLRQGDPLSPFLF L+MD  SR+M +  +  + HGL  G  +  ++HLQF 
Subjt:  TVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFG

Query:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH
        DDT+   +   +  + L QI+  F  VSG+ IN +K  L+G+N+D+  L+ L+  +GC++G+WP +YLGLPL  NP +  FW+P+++KVE ++  WK   
Subjt:  DDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQH

Query:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
        +SKGGRLT+I+A L ++P Y+MSVF++P  V   +E+++R+FL
Subjt:  ISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.9e-4527.22Show/hide
Query:  KIAKSNGWNLGM--KIRSCFTDMWLQEDEKVLLQILSSNDSSL-VDDTEIEGEFISFFNKLYTKR----DEPRTLPQIQDWSPINFEQAHHLESPFTETE
        KI +S  W      KI      +  ++ EK  +  + ++   +  D TEI+     ++  LY  +    +E  T         +N E+   L  P T +E
Subjt:  KIAKSNGWNLGM--KIRSCFTDMWLQEDEKVLLQILSSNDSSL-VDDTEIEGEFISFFNKLYTKR----DEPRTLPQIQDWSPINFEQAHHLESPFTETE

Query:  IWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKK-VEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPH
        I   IN L T K+PG DGFTAEFY++Y   L   ++++FQ   K GI+  S  E  I LIPK   +     +FRPISL+    KI+ ++L+ R+++ +  
Subjt:  IWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKK-VEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPH

Query:  IISDSQAAFVEGRNILDAILIANEVIDEWKR-KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGL
        +I   Q  F+ G      I  +  VI    R K++  + I +D EKAFD +   F+ K +   G    +   I         +II+NG+         G 
Subjt:  IISDSQAAFVEGRNILDAILIANEVIDEWKR-KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGL

Query:  RQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDL
        RQG PLSP LF ++++  +R + +E E   I G+ +G +   L+   F DD ++  +  + +   L ++++ F +VSG  IN  K++    N +      
Subjt:  RQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDL

Query:  LSKQYGCKLGSWPTTYLGLPLISNPNS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL
        +  +    + S    YLG+ L  +        ++P+L+++++    WKN   S  GR+ ++K  +     Y  +    K+P      LE+    F+
Subjt:  LSKQYGCKLGSWPTTYLGLPLISNPNS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL

P08548 LINE-1 reverse transcriptase homolog9.5e-3926.22Show/hide
Query:  EKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDW------SPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKY
        + ++  I + ND    D +EI+     ++ KLY+ + E   L +I  +        ++ ++   L  P + +EI   I  L   K+PG DGFT+EFY+ +
Subjt:  EKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDW------SPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKY

Query:  WNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKK-VEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGR----NILDAILIAN
           L   ++ +FQ+  K GI+  +  E  I LIPK   +     ++RPISL+    KI+ ++L+ R+++ +  II   Q  F+ G     NI  +I +  
Subjt:  WNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKK-VEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGR----NILDAILIAN

Query:  EVIDEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSE
         +    K KN+  + + +D EKAFD +   F+ + +   G    +   I    S    +II+NG          G RQG PLSP LF ++M+  +  + E
Subjt:  EVIDEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSE

Query:  EVEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKT-ELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLIS
        E     I G++IG++   L+   F DD ++  +   D+   L +++  +  VSG  IN +K+   +  N ++A+   +       +      YLG+ L  
Subjt:  EVEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKT-ELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLIS

Query:  NPNS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL
        +        +E + +++ + +  WKN   S  GR+ ++K ++     Y  +    K P      LE++I +F+
Subjt:  NPNS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL

P11369 LINE-1 retrotransposable element ORF2 protein4.7e-3826.33Show/hide
Query:  DEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKR----DEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYW
        D+ ++ +I +       D  EI+    SF+ +LY+ +    DE            +N +Q  HL SP +  EI   IN L T K+PG DGF+AEFY+ + 
Subjt:  DEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKR----DEPRTLPQIQDWSPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYW

Query:  NTLKFDIMRVFQDFFKNGIINASLNETYICLIPK-KVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGR----NILDAILIANE
          L   + ++F      G +  S  E  I LIPK + +   + +FRPISL+    KI+ ++L+ R++  +  II   Q  F+ G     NI  +I + + 
Subjt:  NTLKFDIMRVFQDFFKNGIINASLNETYICLIPK-KVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGR----NILDAILIANE

Query:  VIDEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEE
        +    K K++  + I LD EKAFD +   F+ K++   G    +   I    S    +I +NG+    I    G RQG PLSP+LF ++++  +R + ++
Subjt:  VIDEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEE

Query:  VEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNP
         E   I G+ IG +   ++ L   DD ++      ++   L  ++ +F +V G  IN NK+       ++     + +     + +    YLG+ L    
Subjt:  VEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPTTYLGLPLISNP

Query:  NS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL
            +  ++ + +++++ +  WK+   S  GR+ ++K  +     Y  +    K+P++    LE  I  F+
Subjt:  NS--NVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSV--FKMPSRVVATLERMIRNFL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.0e-4027.41Show/hide
Query:  DEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRD-EPRTLPQIQDWSPINFE-QAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT
        + K +  + + + + L D   I     SF+  L++     P    ++ D  P+  E +   LE+P T  E+  A+  +  NK+PG DG T EF++ +W+T
Subjt:  DEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRD-EPRTLPQIQDWSPINFE-QAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNT

Query:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR
        L  D  RV  + FK G +  S     + L+PKK + + + ++RP+SL+   YKI+A+ +S RLK VL  +I   Q+  V GR I D + +  +++   +R
Subjt:  LKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKR

Query:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH
              F+ LD EKAFD VD  +L   +    FG ++  ++    +SA   + IN      +   RG+RQG PLS  L+ L ++ F  ++     + ++ 
Subjt:  KNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIH

Query:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPT---TYLGLPLISN--PNS
        GL +      +    + DD +L++   VD ++   +    +   S   IN++K+   GL      +D L   +  +  SW +    YLG+ L +   P S
Subjt:  GLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWPT---TYLGLPLISN--PNS

Query:  NVFWEPMLQKVEKKILSWKN--QHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL
          F E + + V  ++  WK   + +S  GR  +I   +A+   Y +         +A ++R + +FL
Subjt:  NVFWEPMLQKVEKKILSWKN--QHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM4.6e-1722.53Show/hide
Query:  WLQEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLE---SPFTETEIWDAINKLGTNKTPGQDGFTAEFYK
        W +   + +  +L+  D S++   EI    + ++ ++ T+       P     S    +  H LE   S  TE ++    +++  + +PG DG T +  +
Subjt:  WLQEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDWSPINFEQAHHLE---SPFTETEIWDAINKLGTNKTPGQDGFTAEFYK

Query:  KYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVI
        +  + +   ++R+       G +  S+       IPK V AK   DFRPIS+   + + +  +L+ RL   +       Q  F+      D   I + V+
Subjt:  KYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQRLKRVLPHIISDSQAAFVEGRNILDAILIANEVI

Query:  DEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVE
            +  R      LD+ KAFD++    +   +   G    +  ++         S+  +G    +    RG++QGDPLSP LF L+MD   R +  E+ 
Subjt:  DEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVE

Query:  KHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGL
             G  +G  + +     F DD +L ++  +  +  L      F  + GL +N +K   +G+
Subjt:  KHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.1e-1329.34Show/hide
Query:  FTDMWLQEDEKVLLQILSSNDSSLVDD-TEIEGEFISFFNKLYTKRDE---PRTLPQIQDWSPI--NFEQAHHLESPFTETEIWDAINKLGTNKTPGQDG
        F  + L    K L++ L  +D   V++ T+++   ++++  L     +   P ++ +I+D  P   N   A  L +  ++ EI  A+  +  NK PG D 
Subjt:  FTDMWLQEDEKVLLQILSSNDSSLVDD-TEIEGEFISFFNKLYTKRDE---PRTLPQIQDWSPI--NFEQAHHLESPFTETEIWDAINKLGTNKTPGQDG

Query:  FTAEFYKKYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKII
        FTAEF+ + W  +K   +   ++FF+ G +    N T I LIPK      ++ FRP+S    +YKII
Subjt:  FTAEFYKKYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.7e-0934.18Show/hide
Query:  QRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKN--RKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKW
        +RLK ++ ++I  +QA+F+ GR   D I+   E +   +RK   +  + +KLD+EKA+D + WD+L+  ++  GF   W
Subjt:  QRLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKN--RKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1245.59Show/hide
Query:  IINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFGDDT
        IING P+G +  +RGLRQGDPLSP+LFIL  +  S +     E+ ++ G+ +    P + HL F DDT
Subjt:  IINGKPRGKIRETRGLRQGDPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFGDDT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATATCGAGGAAAGGGATCGATTGCATCCTATCCAAATCTGCCAAAGGAATTCTATCAAAGCACAATTGCTTGACATTGCAGTCAAAGAAGAGACCTTATGGAGGCAA
AATTGCAAAGTCAAATGGATGGAATTTGGGGATGAAAATTCGAAGTTGTTTCACCGATATGTGGCTGCAAGAAGACGAAAAAGTACTATTACAGATTCTCTCTAGCAATG
ACAGTAGTTTGGTTGATGATACAGAGATTGAAGGTGAATTCATTAGCTTCTTTAACAAGCTATACACTAAGAGAGATGAGCCGAGGACCCTCCCTCAAATTCAAGACTGG
AGTCCGATTAACTTTGAGCAAGCTCACCATCTAGAATCACCATTCACCGAAACAGAAATCTGGGATGCCATCAACAAATTGGGAACCAACAAAACTCCAGGGCAAGATGG
ATTCACTGCTGAGTTCTATAAAAAATATTGGAACACCCTTAAGTTTGATATAATGAGGGTGTTCCAAGATTTTTTTAAGAACGGAATTATCAACGCAAGCCTCAATGAGA
CTTATATCTGCCTCATTCCCAAAAAAGTAGAAGCTAAATGTGTTAATGACTTTAGACCCATCAGTTTGATCCCCTGTATGTACAAGATTATTGCTAGAGTATTATCTCAA
AGACTAAAAAGAGTCTTACCTCACATTATCTCTGATTCTCAAGCTGCATTTGTGGAGGGAAGGAACATTTTAGATGCCATCCTTATTGCTAACGAAGTTATTGACGAGTG
GAAAAGAAAGAATAGAAAAGGAATATTCATCAAATTAGATATTGAAAAAGCTTTTGACACGGTGGATTGGGACTTTCTTGACAAAATAATGATGATAAAAGGCTTTGGCA
CAAAATGGAGACTTTGGATACATGGTTGTATCTCATCGGCGAATTTTTCCATCATTATTAATGGAAAACCGAGGGGAAAAATAAGGGAAACAAGAGGGCTTAGACAAGGA
GATCCTCTTTCTCCTTTTCTATTTATTCTGATCATGGATTGTTTTAGTAGAATGATGTCAGAGGAAGTCGAGAAGCATAAAATTCATGGGCTTTATATTGGTACCAAGGT
TCCTAGCCTTACTCATCTACAGTTTGGAGATGATACTCTCCTTTTGTCAGACTATAATGTGGATGCCATTGATGCACTCTTTCAAATAGTTGCAACTTTTGAACAAGTAT
CAGGGCTTAACATCAACTTTAACAAAACAGAGTTACTTGGGCTGAACATTGACGAAGCTGACCTGGATTTGCTTTCTAAACAATATGGATGTAAGCTTGGCTCATGGCCA
ACAACCTATTTGGGCCTTCCTCTTATTAGTAATCCTAATTCCAATGTTTTTTGGGAACCAATGCTACAGAAAGTGGAAAAGAAGATCTTATCTTGGAAGAATCAACATAT
ATCGAAGGGTGGGAGGTTGACTCTCATCAAAGCCACTCTTGCTAATCTCCCCACATACTTTATGTCGGTGTTTAAAATGCCTAGTAGAGTTGTAGCTACCTTGGAAAGAA
TGATTAGAAACTTCCTCTAG
mRNA sequenceShow/hide mRNA sequence
AAAAAAGCTCCGGTAATTCAATAATCTCACTCAAAGTATCACATTTCTTCCATGCAACCTATCAACACGACACGTGTCATTCACTCCTTCTCCGTTCTTCACCCTCCGTG
GATGCTCTGTTCCAAATCCAAAACCCTCAGCTCCATCATCCATGGCGGCGAAGCGGCCATGTCCGTCGCGAAACCCTAGAATCCAAGTCCAGCCATGGACCTTCTTCTCG
ACGCCATCTTAGGCCTTTCCGACTCCATCTCTATCGACCTCTCATTCGAGCGCCCTCAGAATGGGCGCCTTCTTGCTCCAGGTTGCGAGACGATCCGATCGCTGCCTCGC
ATAACTCCGTCGTTTGGGCTCTTCCTCCTGATATCACTATCAAAGTTTTTTCCATGCTTGACACTCATAGCCTTTGCTATGCAATAGCTACTTCAATTTTTCACAAGTGT
GCAATGGATCCTTCCTGCTATGCCAACATTGACTTGACAACAGTTTTACCAAGAGTTAATAATGCAGTGGTTTCTACGAAGATTCACCGAGCTGGAAACTCTCTTCAATT
TCCTGCATTTCTCTGCCCCTCTCCAGTTCGTTCATTGGCGCTTCTCTCTTCCGTCCCTTCGATTTTCATTGTTATTCCTGACTGAAATGGTTAGGGGAGGATTGATCCGG
CAGTTGCTGAGGAAATTTCCATCTCAATCTACATGAAGTTGAAGAGGATTTTAGTTTTTTTTAGTTACATGCTGGTGGAGAAGAATTATTGGTGGCAAATATAGTACCCA
ACTACTTGGAAATCCACTATGAGAACCACATCATCTCCCCCTTCTCCAACCTCTGTTGCTCCACCATGATACCATCGCTACAACATGATATCTACAAAATACCCAAGGAA
ATCTACCATCGAAAGAAAAGACTTCATCATCGACATTGACTTACATTACAGAGGCAGCAGAGTAAAGATCACAGAAGCAACCAAGGACAAGTCATTCTCTGTTTCCATAC
AGTGGTCCTCATTAACATGGATCCTCAAAAGTCTGTATTCACTCTTAGAGACCCCATTGAACCACAAGTTCTTCTTCGAAACAAGGATCGACGATTACTCCCTTTGGTTG
GCAAAGACCAGAAATAAAAAGGGATACATTGCAGAACTCACCAAGGAGAACAAAAATGGCACGATATGCAAGCTGGTTATCCCTATCGGAACAGAGAGGATGGGTTGGTT
CAGTTTTTACACTTTATTACAAGGTTCCCCAGCACCAGAAGGTCCCAACAAGAATATTAATGCCTCTCTTCCCACACAAAAGCCATCATTTACTCTTAACAGCGGTCCAG
AGTTCACCAGACCACCAAACTTCATTTATGGACACAATAATAAAGTGTAGCCGAAGGAAGTTCCTACTACAAAACCTCAGGCGTTGTCCTATAAGTCTGCACTTATTAAG
GGTAACAAAGTAGAACCGATCATCAGACCAGACGCCAACAAAGAATACAGAGCTGTTATTACAGCTAGAGAAGTCAGTAAAAACAAGATGGGAAATCAACAGAACACAAA
CCTCTGGAGATCGCAAAAAAGGAAAAATAACAAGAAGAAGAAAGAAGAGGAACAGAGGAATAAGCCACTTGTTTATAGGCCTAAAAATATTGCAGAACCACAAATAATCT
CTACTAAGAAGCATATAGAAGAGTGGTGCGACCCATCGATATTCTGCAAAACAATCGAGTATGAATATTATTCAGATGACCCCTTCATAGCAGCTGAATTCGGTACTCCC
CCGGTGACAGAAAAACAGGGTTACTCGATGAAGAAGACGACCGGTAGAGACTCTACGATGAACCTCAACGATTGCGCTCTGAAAGACGAGACATGTGGGCGGCTTAGTCT
AGCAGTACTCAGTTCCTCAACAGACAACTCAGAGGTTTTTTACACAAAATCAGCCATTGAGGGGCCGGGATTTCTTCTGGTAACAGGTACACCCTCTTATCCGTCACATC
TCATCCTTCCCTCCAATCCTTGGATACTTTATCCTCTGGACAACACCTTACCCCCCATCTTACGACACCTAACCTGCTATCCGAACCAATTACCCTTTTTCCAAGCCTAA
TCTCCCAATCCACATCAACCAAAAAGTCTCCTTTTTTGTACCTGGAACTAAAGAGAATACCAAACTCACTTCTATCTCTGAATCCGACATCTATTCTTCACCATATTCAT
CAACCTCTCCATCTTCCCATGTTTCACCAAAAACTCAATCCCATGAACCAACACCATTACAAATTTCCTACCCGACTCCTCTCTCATCCCTCTCTACCTTAGAAGATTCT
GCTACCTATTACCAGAAGAAAGCTCTCAAGTTCGATCCAAACATTCATCCATCATTATACCCCTTGGCTAAGGAATATGGGTCTGGGAATCCTTCCCCTGCCTACCAAAG
TGACAAAGCAGATGAAAGAGGCTAGGAAGAACAATAAATTATATAGAGAATTAGTTGGTCTGACATCATCGGTTAACTATGACCAACATCACAGAGACATCTCTATGGGT
GGAACAAAGGTGCCTGGATGAAATTCCTTTCCTGGAATGTGAGAGGCCTTGGCTACAAAGATAAAAGGGCTATAGTGAAAAACATGATTCTTAAACATAATCCGGCAGTG
GTCATTCTCCAAGAAACCAAGTCCCATTCAATTGACAACTGGTTCATCAAATCTATTTGGAGCTCACGAGATATAGCTTTGTCATCTCTGGATTCGGTGGGATCCGCGGG
TGGAATCATTATCATGTGGAACAACAACTCTATTGAGATCAATGACATCAAAAGAGGTAATTATACTCTTTCTATTAACATTAAATTGGCAGATGGCTTTCTTTTTTGGA
TTACAGGAGTCTATGGCCCTTCTAGCACGACACACTCACCAGAATTTTGGGATGAGCTTAAAGAACTGGCTGTTTACTGTTCGTTCGGATGGATTATTTGCGGTGATTTT
AATACTATCAGATGGACACACGAAAGATCAGTCCAGGGACATATTACTCATGATATGAGAAGCTTTAATGGCTTCATTGAACAACAGGTTCTTATAGACATCCCCTTTCG
AATGGCTTATGCACCTGGTCTGATTTCAGAGCTTCCCCCACTTTGTCTAAGCTTGACAGGTTCCTAATCACAGAAAGTATACAAGGCCGGTTTAAAGACATTATTGTTAG
CAGACTGGATAGACCTACCTTTGATCATTTCCCTCTGCAGATGGTGATCGGTAAATCAAAGTGGGGTCCTACCCCATTCAGATTCCATAATATGTGGATGGATCATAAGG
ACTTCAAACCAATGATGGACTATTGGTGGACAAACACTCCCATGAGAGGATGGCCTGGCCATGCTTTTATTCAAAAGCTTAAGAGCTTCAAGAGTATGATCAAACTTTGG
AACAAGGAAGTCTTTGGCAATGTGACAGAAAAGAGAAACCAATTGAGCCTTGAACTTGCTATACTTGATGATATCGAGGAAAGGGATCGATTGCATCCTATCCAAATCTG
CCAAAGGAATTCTATCAAAGCACAATTGCTTGACATTGCAGTCAAAGAAGAGACCTTATGGAGGCAAAATTGCAAAGTCAAATGGATGGAATTTGGGGATGAAAATTCGA
AGTTGTTTCACCGATATGTGGCTGCAAGAAGACGAAAAAGTACTATTACAGATTCTCTCTAGCAATGACAGTAGTTTGGTTGATGATACAGAGATTGAAGGTGAATTCAT
TAGCTTCTTTAACAAGCTATACACTAAGAGAGATGAGCCGAGGACCCTCCCTCAAATTCAAGACTGGAGTCCGATTAACTTTGAGCAAGCTCACCATCTAGAATCACCAT
TCACCGAAACAGAAATCTGGGATGCCATCAACAAATTGGGAACCAACAAAACTCCAGGGCAAGATGGATTCACTGCTGAGTTCTATAAAAAATATTGGAACACCCTTAAG
TTTGATATAATGAGGGTGTTCCAAGATTTTTTTAAGAACGGAATTATCAACGCAAGCCTCAATGAGACTTATATCTGCCTCATTCCCAAAAAAGTAGAAGCTAAATGTGT
TAATGACTTTAGACCCATCAGTTTGATCCCCTGTATGTACAAGATTATTGCTAGAGTATTATCTCAAAGACTAAAAAGAGTCTTACCTCACATTATCTCTGATTCTCAAG
CTGCATTTGTGGAGGGAAGGAACATTTTAGATGCCATCCTTATTGCTAACGAAGTTATTGACGAGTGGAAAAGAAAGAATAGAAAAGGAATATTCATCAAATTAGATATT
GAAAAAGCTTTTGACACGGTGGATTGGGACTTTCTTGACAAAATAATGATGATAAAAGGCTTTGGCACAAAATGGAGACTTTGGATACATGGTTGTATCTCATCGGCGAA
TTTTTCCATCATTATTAATGGAAAACCGAGGGGAAAAATAAGGGAAACAAGAGGGCTTAGACAAGGAGATCCTCTTTCTCCTTTTCTATTTATTCTGATCATGGATTGTT
TTAGTAGAATGATGTCAGAGGAAGTCGAGAAGCATAAAATTCATGGGCTTTATATTGGTACCAAGGTTCCTAGCCTTACTCATCTACAGTTTGGAGATGATACTCTCCTT
TTGTCAGACTATAATGTGGATGCCATTGATGCACTCTTTCAAATAGTTGCAACTTTTGAACAAGTATCAGGGCTTAACATCAACTTTAACAAAACAGAGTTACTTGGGCT
GAACATTGACGAAGCTGACCTGGATTTGCTTTCTAAACAATATGGATGTAAGCTTGGCTCATGGCCAACAACCTATTTGGGCCTTCCTCTTATTAGTAATCCTAATTCCA
ATGTTTTTTGGGAACCAATGCTACAGAAAGTGGAAAAGAAGATCTTATCTTGGAAGAATCAACATATATCGAAGGGTGGGAGGTTGACTCTCATCAAAGCCACTCTTGCT
AATCTCCCCACATACTTTATGTCGGTGTTTAAAATGCCTAGTAGAGTTGTAGCTACCTTGGAAAGAATGATTAGAAACTTCCTCTAGAAGGGGTGTATTGATGTCAAAGG
GATGCATCTCGTTAAATGGGATACTATTACCCTCCCACATTTACAAGTGGGCCTTGATATTGACAAAATCAAACCCAAGAACGAGGCCCTTCTTACAAAGTGGATTTAGA
GATATTATGTGGAAGAACAAGCTCTATGGAGGAAAGTGATTGATGACAAGTATGGCACATGTCAATTTAGCAATAGGTCGAGAAGGACAACATTGGCTTCGACTAAAGGT
CCATGGAAGCCCCATGATTCCGGATCGAATCTCTTATAAAGTGAAGAATGGTGAGAGCACTCTTTTTTGGAAGGATAAATGGCTTAATGATTCTCCTCTCAACCATGATT
TCTCTCTTCTTTTCCACATCTCCAAAGAAAAAAATATGACTGTTAGTAAGGCATGGGAGGTTGAAAGGAAAATCTGGAACTTGAGGCTTAGAAGAAACCTGAAAGATGAT
GAACTAGATGAGTTTTGTTCTCTCCAGAATCAACTTGATGGAGTGACTCTGGCCGAGGGTAGGGACTGCTGCATTTGGAACCTGGAACCTAGTGGAAAGTTCACTGTCCA
TTCCTTACTCAAAGATCTTAAAGCCATTGCATCTGCTACAATACCTGATTTTGGCAGATTTATAGTTCTATTTGGAAAGGGAAATGCCCGAAGAAGGTGCAATTCTTCCT
TTGGGAATTGAGTCATGGGGCCATCAACACCAATGATCGGCTCCAACGTAGACTTCCACGTATGACTCTCTCTCCACAATGGTGTACCATCTGTTATAAATCCACCGAAA
CTCAACCACATCTACTGATACCTTGTACTCATGCAAATGCATTCTGGGATATATTCGGGGGGCATTCTCTTGGTATTTGGTCATGCCTAATGATCTTCACGAGCTCCTTA
TATTATCCCTTTCGAACCACCCATTCAAAGAAGAAAAGAAAAAATCATTATGGGAAAACTTTATCGAAGCATATTGTTGGAATATTTGGATCGAGCGGAATCGCAGGATC
TTCAAAGGAACATCTAATCACTTCGACTGGTTCATAGACAATATTACTCATATGGTGGTAACTTGGTGTAAAATGTCTCCATTATTTCATGATTATAGTTACGATATTCT
TATTAATAGTTGGAGATCTTTATTATAAAGAACTATGTATTTTGTAATTTCATATTATCAATGAAATTGTTTCTTATCCCAAAAACAGTTACACATTTTGACTATTCACT
ACCCAAGTAGCCAACAAGAGCGTAGCTTCAGTTACATGATGCATAACCTAGTAATCAAAAGGTTACGGGTTCAAACCTCCCACCCCCAAATGTTGTTGAATCAAAAGATT
ATTCACTACCCTAGTATTGCTGTGAAATCCATCCGAACCACAATTCTTATTCGGTTGACTGTCTATGCGAGATCTCTTAAGCTTGGTATAGTTCCTGGCCCAACTGGATC
TGGGATCTTGCCAACCATTGGTTTTCGGGAACTCTGTAGATATTACTAGCTTCTCATGGAATGACAAAAGATCCAGACAAGGGAAGGAGTCATCAGTTCTCACAAGATCC
TGCTCAAGCTCTTTGGGAGGGGGGAGTAGCTTTACTGGGAAGATCTTAAGAACGTCGCATCTTTACAATATTGAAAGAATGGACAATCATTCACTTCGTGTTGCTTTATC
TGCTTGCCCGTCACTCCTTGATCTGGAAATCGTGGGCCTGATCATTGCAATTGAGCAAGTAATGAAAGAAAGGCCTGATGTATGCTTGCTGGCTGATTTTCCATCAGAAG
GAAGTTACTTCGAAATTGAACAGATGCTGGATAGCGAATTAAACAGCAATGTTAGTCTGCCATCACAGCTGAGCAGTCAAACATTTAATTCAATGTTTATAAGTTGTTCA
GAGAGCAGCTATAACAGCGATCAAGGTAGTGACGATGAGGATGGTCGAGATGCCAGCTATGCTATATTCGGGGAGAGTTCAGATGAGGTGGATTATCTTGCCCTGTAGTC
GAAGAGCTGATAGTTGCAGTAACCAATTGAATTCGGGAGGATACTTCGACGCACAAGCTGTGGTGCGCATTTTGTGCCGGCATGAGGCGATTTGGTGCTTCGGTCGATTC
ACCGAAGATTGGAAAACATTGAATCAGTAGCCTGGTTGACTTGATATCTCCATTTAATTGCTGAGATACAAACCAAAAGCTATATCCAGGTTACCTTCTTCCTTCCAACC
CTTTCTCTAGAAGAGGAAATACGAAAATAAGAGGAAAGAGTGGAGATTTAGTAGTCCCTATCCCCTTTCCTCGTTGTTATATCGTACATGGTCGTTCACATCGTCGGTTC
ATAGAGAGCTCTAGTTCCTGATGATGTTAATCTGCCATGAATGTAAATTTTGTTTTGCAGCGCATTTGTACCAAAAGATCCAACTCAATAGGAGCTGGAAGTTACTCTTC
TAGTTTTTTTTAGTACATATTGGGAGAGGGAGGACAAATACTGATTATCGGGATCGAACCTGTGATGCTGGTGATGAACTGGTTTATAACCAAACAATGAGACCATTGCG
CTACTGACCCATTCAGTAAGTTATCTTCTAGTTGATAACATTTTGTATGCATCAGCAATTCAAATTATTATAGCATTGTTTCATGAATGCAGTTATTATTACTCTCCTTT
TTT
Protein sequenceShow/hide protein sequence
MISRKGIDCILSKSAKGILSKHNCLTLQSKKRPYGGKIAKSNGWNLGMKIRSCFTDMWLQEDEKVLLQILSSNDSSLVDDTEIEGEFISFFNKLYTKRDEPRTLPQIQDW
SPINFEQAHHLESPFTETEIWDAINKLGTNKTPGQDGFTAEFYKKYWNTLKFDIMRVFQDFFKNGIINASLNETYICLIPKKVEAKCVNDFRPISLIPCMYKIIARVLSQ
RLKRVLPHIISDSQAAFVEGRNILDAILIANEVIDEWKRKNRKGIFIKLDIEKAFDTVDWDFLDKIMMIKGFGTKWRLWIHGCISSANFSIIINGKPRGKIRETRGLRQG
DPLSPFLFILIMDCFSRMMSEEVEKHKIHGLYIGTKVPSLTHLQFGDDTLLLSDYNVDAIDALFQIVATFEQVSGLNINFNKTELLGLNIDEADLDLLSKQYGCKLGSWP
TTYLGLPLISNPNSNVFWEPMLQKVEKKILSWKNQHISKGGRLTLIKATLANLPTYFMSVFKMPSRVVATLERMIRNFL