; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI04G12010 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI04G12010
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr4:10334440..10338517
RNA-Seq ExpressionCSPI04G12010
SyntenyCSPI04G12010
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]1.7e-14450.88Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFFHGRKG+RQ +PLSPF FVMVM+V SRML  PPQ F+FHQ CEKV+LT  TFADDLMIFC A+  S+SF++ET+++FGEL GL+ANL
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLI--VRKLLGLLLTWVLPLVTSLFVILVFLSSL---EDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD
         K   F+ G    V   L   + ++L  +   ++ L  L+      DC  LI+ I   +      V+   GR QLVRSV RSLQVYWASVF+LP  V   
Subjt:  DKTLFFLWGLI--VRKLLGLLLTWVLPLVTSLFVILVFLSSL---EDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD

Query:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLL--------------LVKSGSL--------------AILRKRDIL
        VDKILR+YLWR                          +RDG S NI STLKILWLL              ++K  SL              AILRKRD L
Subjt:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLL--------------LVKSGSL--------------AILRKRDIL

Query:  KAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIAS
        K HV +EVG+G  CRVWLDPW+QG  I++Q GERV+YDA SRR+ARL +F+  DG W+WP VS+ L+D+WD +Q VR  LS+ DRWVWVPG    FSIAS
Subjt:  KAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIAS

Query:  TWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-----------------------------------
          +TIRP   RV W GLLWGGGN+PKHSFCAWL I+++LGTRDRL RWD S+P+S ILC G  ES                                   
Subjt:  TWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-----------------------------------

Query:  ---C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVHGLI
           C   IG  VR+KLWR+L CAT YFIW++ NHRLHGG  R L++IFQ I +CI+AR  SW +  H LI
Subjt:  ---C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVHGLI

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]5.4e-14654.9Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFF+GRKGLRQ DPLSPFLFVMVMEVLSRML   PQ+F+FH  CEKV+LTH TFADDLMIFC A+  S+SFI+E +++FGE SGLFAN 
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGL---IVRKLLGLL-LTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDV
         K+  F+ G+       L   +  +W  P   S      +     DC  LI+ I   +      V+   GRLQLVRSVLRSLQVYWASVF+LP  V  +V
Subjt:  DKTLFFLWGL---IVRKLLGLL-LTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDV

Query:  DKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAIL-RKRDILKAHVKMEVGNGRRCRVWLDPWI--QGGL
        DKILR+YLWRGKEEGRGG KVAW +V LPF+EGGL IRDG S NIA+TLKI   LL   GSL +   +  ILK     +V + R  R W    I  +   
Subjt:  DKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAIL-RKRDILKAHVKMEVGNGRRCRVWLDPWI--QGGL

Query:  IIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPK
        +    GERV+YDA SRR+A+L DF+D +G W WP VSL L+D+W+ +Q V   LS+ D WVWVPG    FSIAS WE I P   RV W GLLWGGGNIPK
Subjt:  IIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPK

Query:  HSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------------C---IGKGVRKKLWRLLWCATIY
        HSFCAWLAI+DRL TRDRL RWD SIPLS ILC G  ES                                      C   IGKGVR+KLWR+LWCATIY
Subjt:  HSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------------C---IGKGVRKKLWRLLWCATIY

Query:  FIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH
        FIW +RNHRLHGG  R+ +++F +I + I+ARA SW +  H
Subjt:  FIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH

KAA0062318.1 uncharacterized protein E6C27_scaffold154G00690 [Cucumis melo var. makuwa]9.8e-14054.17Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFFHGRKG+RQ DPLS FLFVMVMEVLSRML   PQ+F FH  CEKV+LTH TFADDLMIFC AN  S+ FI+E +++FGELSGLFAN 
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLIVRKL--LGLLLTWVLPLVTSLFVILVFLSS---LEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD
         K+  F+ G+       L   + +V   ++  ++ L  L+      D   LI+ I   +      V+   GRLQLV SVLRS QVYWASVF+LP  V  +
Subjt:  DKTLFFLWGLIVRKL--LGLLLTWVLPLVTSLFVILVFLSS---LEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD

Query:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDI
        VDKILR+YLWRGKEEGRGG KVAW +V LPF+EGGL IRDG S NIASTLKILWL+L  SGSL                             AILRKR+ 
Subjt:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDI

Query:  LKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIA
        LK  V+M+VGNG   RVWLDPW+  G I++Q GERV+YDA SRR ARL DF+D DG W WP VSL L+D+W+ +Q V   LS+ D WVWVPG    FSIA
Subjt:  LKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIA

Query:  STWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGIL-CGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGG
        S WE +RP   RV W GLLWGGGNI KH FCAWLAI+DRLGT DRL RWD S+P+  IL   G++   IG G+       L C                 
Subjt:  STWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGIL-CGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGG

Query:  AVRELMVIFQIIRSCIKARAASWSDGVH
          R+ +V+F +I S I+ARA SW    H
Subjt:  AVRELMVIFQIIRSCIKARAASWSDGVH

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]3.2e-18366.06Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSI+ING LEGFFHGRKGLRQ DPLS FLFVMVMEVLSRML  PPQNF+FHQFCEKV+LTH TFADDLMIFC A+NYSMSFIKETIKRFGELSGLFANL
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLIVRKLLGLLLTWVLPL----VTSLFVILVF--LSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQR
         K+  FL G+   K   L       +    V  L + L+F  L S  DC  LI+ I   +      V+   GRLQLVRSVLRSLQVYWASVFMLPMKV R
Subjt:  DKTLFFLWGLIVRKLLGLLLTWVLPL----VTSLFVILVF--LSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQR

Query:  DVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAILRKRDILKAHVKMEVGNGRRCRVWLD--------
        DVDKILR+YLWRGKEEGRGGAKVAWDEV LPFDEGGL IRDGSS NIASTLKILWLLLVKSGSL +        A V+  +  GR    W+D        
Subjt:  DVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAILRKRDILKAHVKMEVGNGRRCRVWLD--------

Query:  ----------------------PWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFS
                               WIQGG IIQQFGERVIYDAGSRRDARLVDFM RDG WRWPLVSL LMDIWD IQGVR S S+EDRWVWVPGS DSFS
Subjt:  ----------------------PWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFS

Query:  IASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------
        IAS WETIRPHSSRVGWSGLLW  GNIPKHSF AWLAIRDRLGTRDRLS+WDRSIPLS +LCGGNYES                                
Subjt:  IASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------

Query:  ------C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIF
              C   IGKGVR+KLW LLWCATIYFIW++RNH LHGGAVRE M+ F
Subjt:  ------C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIF

XP_031745730.1 uncharacterized protein LOC116406187 [Cucumis sativus]2.5e-16772.3Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFFHGRKGLRQ DPLSPFLFVMVMEVLSRML +PPQNF+FHQFCEKVRLTH TF DDLMIFCTA+N+SMSF KETIKRFGELSGLFANL
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLIVRKLLGLLLTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSVLLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYL
         K+  FL G+   K   L       +   LFVIL FLSSLEDCGALI I LFSVL VVFGL                          V RDVDKILRAYL
Subjt:  DKTLFFLWGLIVRKLLGLLLTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSVLLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYL

Query:  WRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDILKAHVKMEV
        WRGK+EGRG AKVAWDEV LPFDEGGLDIRDGSS NIASTLKILWLLLVKSGSL                              ILRK+DILKAHVKMEV
Subjt:  WRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDILKAHVKMEV

Query:  GNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPH
        GNGR+ RVWL PWIQGG IIQQFGERVIYDAGSR DARL+DFM RDG WRWPLV L LMDIWD +QGVR S S+EDRWVWVPGSHDSFSI S WETIRPH
Subjt:  GNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPH

Query:  SSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSI
        SSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTR    R +R I
Subjt:  SSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSI

TrEMBL top hitse value%identityAlignment
A0A5A7T440 Zf-RVT domain-containing protein2.2e-12160.11Show/hide
Query:  GRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-------
        GRLQLVRSVLRSLQVYWASVF+LP  V  +VDKILR+YLW+GKEEGRGG KVAW +V LPF+EGGL IRDG S NIA+TLKILWL+L  SGSL       
Subjt:  GRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-------

Query:  ----------------------AILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDI
                              AILRKR+ LK HV+M+VGNG RCRVWLDPW+QGG I++Q GERV+YDA SRR+ARL DF+D +G W WP VSL L+D+
Subjt:  ----------------------AILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDI

Query:  WDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDR----LGTRDRLSRWDRSIPLSGILCGGNYES
        W+ +Q V   LS+ D WVWVPG    FSIAS WE I P  SRV W GLLW GGNIPKHSFCAWLAI+DR    + +  R+  W   + LS I   G    
Subjt:  WDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDR----LGTRDRLSRWDRSIPLSGILCGGNYES

Query:  CIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH
         IGKGVR+KLWR+LWCATIYFIW +RNHRLHGG  R+ +++F +I + I+ARA SW +  H
Subjt:  CIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH

A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein8.4e-14550.88Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFFHGRKG+RQ +PLSPF FVMVM+V SRML  PPQ F+FHQ CEKV+LT  TFADDLMIFC A+  S+SF++ET+++FGEL GL+ANL
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLI--VRKLLGLLLTWVLPLVTSLFVILVFLSSL---EDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD
         K   F+ G    V   L   + ++L  +   ++ L  L+      DC  LI+ I   +      V+   GR QLVRSV RSLQVYWASVF+LP  V   
Subjt:  DKTLFFLWGLI--VRKLLGLLLTWVLPLVTSLFVILVFLSSL---EDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD

Query:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLL--------------LVKSGSL--------------AILRKRDIL
        VDKILR+YLWR                          +RDG S NI STLKILWLL              ++K  SL              AILRKRD L
Subjt:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLL--------------LVKSGSL--------------AILRKRDIL

Query:  KAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIAS
        K HV +EVG+G  CRVWLDPW+QG  I++Q GERV+YDA SRR+ARL +F+  DG W+WP VS+ L+D+WD +Q VR  LS+ DRWVWVPG    FSIAS
Subjt:  KAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIAS

Query:  TWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-----------------------------------
          +TIRP   RV W GLLWGGGN+PKHSFCAWL I+++LGTRDRL RWD S+P+S ILC G  ES                                   
Subjt:  TWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-----------------------------------

Query:  ---C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVHGLI
           C   IG  VR+KLWR+L CAT YFIW++ NHRLHGG  R L++IFQ I +CI+AR  SW +  H LI
Subjt:  ---C---IGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVHGLI

A0A5A7TZS0 Reverse transcriptase domain-containing protein2.6e-14654.9Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFF+GRKGLRQ DPLSPFLFVMVMEVLSRML   PQ+F+FH  CEKV+LTH TFADDLMIFC A+  S+SFI+E +++FGE SGLFAN 
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGL---IVRKLLGLL-LTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDV
         K+  F+ G+       L   +  +W  P   S      +     DC  LI+ I   +      V+   GRLQLVRSVLRSLQVYWASVF+LP  V  +V
Subjt:  DKTLFFLWGL---IVRKLLGLL-LTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDV

Query:  DKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAIL-RKRDILKAHVKMEVGNGRRCRVWLDPWI--QGGL
        DKILR+YLWRGKEEGRGG KVAW +V LPF+EGGL IRDG S NIA+TLKI   LL   GSL +   +  ILK     +V + R  R W    I  +   
Subjt:  DKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAIL-RKRDILKAHVKMEVGNGRRCRVWLDPWI--QGGL

Query:  IIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPK
        +    GERV+YDA SRR+A+L DF+D +G W WP VSL L+D+W+ +Q V   LS+ D WVWVPG    FSIAS WE I P   RV W GLLWGGGNIPK
Subjt:  IIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPK

Query:  HSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------------C---IGKGVRKKLWRLLWCATIY
        HSFCAWLAI+DRL TRDRL RWD SIPLS ILC G  ES                                      C   IGKGVR+KLWR+LWCATIY
Subjt:  HSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES--------------------------------------C---IGKGVRKKLWRLLWCATIY

Query:  FIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH
        FIW +RNHRLHGG  R+ +++F +I + I+ARA SW +  H
Subjt:  FIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVH

A0A5A7UV01 F17F8.56.2e-12451.67Show/hide
Query:  GLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDKTLFFLWGLIVRKL--L
        G+RQ DPLSPFLFVMVMEVLSRML   PQ+F+FH  CEKV+LT+ TFADDLMIFC A+  S+ FI+E +++FGELSGLFAN  K+  F+ G+       L
Subjt:  GLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDKTLFFLWGLIVRKL--L

Query:  GLLLTWV---LPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRG
           + +V   LP+      +L       D   LI+ I   +      V+   GRLQLVR VLRSLQVYWASVF+LP  V  +VDKIL +YLWRGKEEGRG
Subjt:  GLLLTWV---LPLVTSLFVILVFLSSLEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRG

Query:  GAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDILKAHVKMEVGNGRRCRVW
        G KVAW +V LPF+E GL IRDG S NIASTLKIL L+L  SGSL                             AILRKR+ LK  V M+VGN   CRVW
Subjt:  GAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDILKAHVKMEVGNGRRCRVW

Query:  LDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGL
        LD W+ G  I++Q GERV+YDA S R+ARL DF+D DG W WP                                   FSIAS WE +RP   +V W GL
Subjt:  LDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGL

Query:  LWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR
        LWGGGNIPK+SFCAWLAI+DRLGTRDRL R+              +ES     VR+KLWR+LWCATIYFIW +RNHRLHGG  R+ +VIF +I S I+AR
Subjt:  LWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR

Query:  AASWSDGVH
          SW +  H
Subjt:  AASWSDGVH

A0A5A7V3Z0 Reverse transcriptase domain-containing protein4.8e-14054.17Show/hide
Query:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL
        MFSIMING LEGFFHGRKG+RQ DPLS FLFVMVMEVLSRML   PQ+F FH  CEKV+LTH TFADDLMIFC AN  S+ FI+E +++FGELSGLFAN 
Subjt:  MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANL

Query:  DKTLFFLWGLIVRKL--LGLLLTWVLPLVTSLFVILVFLSS---LEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD
         K+  F+ G+       L   + +V   ++  ++ L  L+      D   LI+ I   +      V+   GRLQLV SVLRS QVYWASVF+LP  V  +
Subjt:  DKTLFFLWGLIVRKL--LGLLLTWVLPLVTSLFVILVFLSS---LEDCGALIEIILFSV----LLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRD

Query:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDI
        VDKILR+YLWRGKEEGRGG KVAW +V LPF+EGGL IRDG S NIASTLKILWL+L  SGSL                             AILRKR+ 
Subjt:  VDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSL-----------------------------AILRKRDI

Query:  LKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIA
        LK  V+M+VGNG   RVWLDPW+  G I++Q GERV+YDA SRR ARL DF+D DG W WP VSL L+D+W+ +Q V   LS+ D WVWVPG    FSIA
Subjt:  LKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIA

Query:  STWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGIL-CGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGG
        S WE +RP   RV W GLLWGGGNI KH FCAWLAI+DRLGT DRL RWD S+P+  IL   G++   IG G+       L C                 
Subjt:  STWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGIL-CGGNYESCIGKGVRKKLWRLLWCATIYFIWQDRNHRLHGG

Query:  AVRELMVIFQIIRSCIKARAASWSDGVH
          R+ +V+F +I S I+ARA SW    H
Subjt:  AVRELMVIFQIIRSCIKARAASWSDGVH

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein5.6e-0532.08Show/hide
Query:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK
        +I++NG     F  + G RQ  PLSP LF +V+EVL+R +    +        E+V+L+   FADD++++      S   + + I  F ++SG   N+ K
Subjt:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK

Query:  TLFFLW
        +  FL+
Subjt:  TLFFLW

P08548 LINE-1 reverse transcriptase homolog1.0e-0633.02Show/hide
Query:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK
        +I++NG+    F  R G RQ  PLSP LF +VMEVL+  +         H   E+++L+   FADD++++      S + + E IK +  +SG   N  K
Subjt:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK

Query:  TLFFLW
        ++ F++
Subjt:  TLFFLW

P11369 LINE-1 retrotransposable element ORF2 protein2.8e-0430.19Show/hide
Query:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK
        +I +NG        + G RQ  PLSP+LF +V+EVL+R +    +        E+V+++    ADD++++ +    S   +   I  FGE+ G   N +K
Subjt:  SIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDK

Query:  TLFFLW
        ++ FL+
Subjt:  TLFFLW

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.7e-1524.44Show/hide
Query:  ILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGS
        + + R++ +  V  +VG+G   + W D W   G +I   G       G                   P+ ++ L+D              +D ++W    
Subjt:  ILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVPGS

Query:  H---DSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-------CIGKGVRKKLWR-----
        H   + FS A T   + P +  V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP   +LC  + ES       C   G    +WR     
Subjt:  H---DSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-------CIGKGVRKKLWR-----

Query:  --------LLWC----------------------ATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR
                L++C                      A +Y IW++RN  LH G  R    + + I+  I+AR
Subjt:  --------LLWC----------------------ATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein8.0e-2330.32Show/hide
Query:  ILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRR---DARLVDFMDRDGVWRWPLV-SLALMDIWDSIQGVR--SSLSIEDRW
        +L  R + +  VK  +GNGR    W D W   G +I+  G+   Y + S R   +AR+V+ +  +G W+ PL  S     I D I  +   S  +IED +
Subjt:  ILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRR---DARLVDFMDRDGVWRWPLV-SLALMDIWDSIQGVR--SSLSIEDRW

Query:  VWVPGS--HDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES----CIGKGVRKKLWRL--
         WV G      FS A TW+ IRP +  + W+  +W  G +PKH+F  W++  DRL TR RL+ W         LC    ES            ++WRL  
Subjt:  VWVPGS--HDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES----CIGKGVRKKLWRL--

Query:  --------LWC--------------------------ATIYFIWQDRNHRLHGGAVRELMVIFQI----IRSCIKAR
                L+C                          A IY IW+ RN+ LH       ++IF+I    IR+ I +R
Subjt:  --------LWC--------------------------ATIYFIWQDRNHRLHGGAVRELMVIFQI----IRSCIKAR

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-3432.59Show/hide
Query:  GRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGS-LAILRKR
        GRLQL+ SV+ SL  +W S F LP    +++D I  ++LW G E     AKVAW +V  P DEGGL IR     N  S   I     + S     IL+ R
Subjt:  GRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRGGAKVAWDEVYLPFDEGGLDIRDGSSGNIASTLKILWLLLVKSGS-LAILRKR

Query:  DILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSS--LSIEDRWVWVPGSHD-
         +    VK ++ NG     W D W + G +I   G R   D G    A + + +      R       L+ I D I  VR     S ED   W  G+ D 
Subjt:  DILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALMDIWDSIQGVRSS--LSIEDRWVWVPGSHD-

Query:  ---SFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYE-------SCIGKGVRKKLWRLLWCATI
            F+   TW   R    +V W   +W     PK+S  AW+AI++RL T DR+  W+     S +LC    E       +C        L R  +  T+
Subjt:  ---SFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYE-------SCIGKGVRKKLWRLLWCATI

Query:  YFIWQDRNHRLHG
        + +W++RN R HG
Subjt:  YFIWQDRNHRLHG

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-2227.88Show/hide
Query:  RDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVW------RWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVP
        R + +  +  EVG+G   + W D WI  G +I+  G       G   DA + D +     W      R P++ + L ++    QG+      +D ++W  
Subjt:  RDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVW------RWPLVSLALMDIWDSIQGVRSSLSIEDRWVWVP

Query:  GSH---DSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-------CIGKGV---------
          H   + FS   TW  + P S  V W   +W   ++PKH+F  W+   +RL TRDRL  W  SIP   +LC  + +S       C   GV         
Subjt:  GSH---DSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYES-------CIGKGV---------

Query:  -------------------RKK----LWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR
                           R+K    + RL + + +Y IW++RN RLH G  R    I + I+  I+AR
Subjt:  -------------------RKK----LWRLLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKAR

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.4e-1932.45Show/hide
Query:  VKSGS---LAILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVW-------RWPLVSLALMDIWDSIQG
        + SGS    +I + R + +  V  +VG+G  C  W + W   G +I   G+     +G  R+A + D + RDGVW       R P++ L L +       
Subjt:  VKSGS---LAILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVW-------RWPLVSLALMDIWDSIQG

Query:  VRSSLSIE-DRWVWVPGSHDS---FSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILC
        V   +  E D ++W  G  ++   FS A+TW  + P   +V W   +W  G IPKH+F +W+ IR RL TRD+L  W   +P   +LC
Subjt:  VRSSLSIE-DRWVWVPGSHDS---FSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCTCCATTATGATTAATGGATTGTTGGAAGGTTTTTTCCATGGAAGGAAAGGACTTAGACAAGCTGATCCTCTATCCCCGTTCTTATTTGTGATGGTCATGGAGGT
GCTATCTCGCATGTTGATTAGCCCGCCTCAGAATTTTAAATTCCACCAGTTTTGTGAGAAGGTCAGATTAACTCATCATACTTTTGCGGATGATCTGATGATATTTTGTA
CTGCTAATAATTATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTATCAGGCTTGTTTGCTAATCTTGATAAAACTCTATTTTTCTTGTGGGGGTTA
ATAGTGCGAAAGCTTCTCGGCTTGCTGTTAACATGGGTTTTACCATTGGTCACCTCCCTGTTCGTTATCTTGGTCTTCCTCTCCTCTTTAGAAGATTGTGGAGCTCTGAT
TGAGATCATCTTATTCAGTGTATTACTAGTCGTATTCGGTCTTGGTAGACTTCAGCTTGTTCGCTCAGTCCTTAGGAGCCTTCAGGTTTATTGGGCTAGTGTGTTCATGC
TTCCTATGAAAGTCCAAAGAGACGTTGATAAGATCTTGAGGGCTTATCTGTGGAGAGGTAAGGAGGAGGGAAGAGGTGGTGCTAAAGTTGCCTGGGATGAGGTTTATCTT
CCTTTTGATGAAGGAGGTCTTGATATTCGCGATGGATCGTCTGGGAATATAGCAAGCACGTTGAAGATCTTATGGTTGCTACTAGTTAAATCTGGTAGTTTGGCTATCTT
GCGTAAGCGGGACATCCTTAAAGCTCATGTGAAGATGGAGGTAGGCAATGGCAGGAGGTGTAGAGTGTGGTTGGATCCATGGATTCAGGGTGGTTTGATTATCCAGCAGT
TTGGGGAGAGGGTGATCTATGATGCGGGTAGTCGGCGTGATGCGAGGCTTGTGGATTTCATGGATCGAGATGGTGTTTGGAGGTGGCCGCTTGTTTCTTTGGCTTTGATG
GACATTTGGGATAGTATTCAGGGAGTGAGGTCGAGTCTGAGTATTGAGGATAGGTGGGTATGGGTGCCGGGTAGTCATGATAGTTTTTCAATCGCCAGTACGTGGGAGAC
TATTCGTCCTCATAGTAGTAGGGTTGGATGGTCGGGTTTACTATGGGGTGGGGGAAATATTCCTAAGCACTCCTTCTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTA
CTAGAGATAGGTTAAGTCGGTGGGATAGGTCGATTCCTTTATCGGGTATTCTTTGTGGAGGGAACTATGAGTCTTGTATTGGCAAGGGTGTGAGGAAAAAATTGTGGCGC
CTTCTCTGGTGTGCTACTATTTATTTCATTTGGCAGGACCGAAATCATCGTCTTCATGGAGGTGCAGTTCGAGAGCTTATGGTTATATTCCAGATCATTCGATCGTGTAT
TAAAGCGCGTGCTGCTTCTTGGTCCGATGGAGTTCATGGTCTTATTTACAATGCTTTTATTTTGCTTGTCCCCGGGCTGTGGGAGATGTGGGATTGGTTTGGGTACTTAT
GGGTTGTTTTGTCTAGTTGCTTGATTTGTGAGTGTTGTTCGTTCTTGTGCCTTGACCTCAGGCTGCCAGTCGCCCTTTCTTCGACGTTTTGGGATGAAACCAAAAAACTG
AGTCTGGGCATCACCGTTCGACTTTTATGGGCAAGAAACTTCGATAACGCTCAAAATCGAACCACCCAGTGGTTGAATGTGAACGTTCGAGGCCTCCGACCCCGTTTTAA
GTTCGGTCGTTCCTTTCTAGGCCAAAACTTTGATATCGCTCAAAATCGGACCACCCAGTGGTGGGATGTGAACTTTCGAGGCCTCCGACCCCGTTCTAAGCCCCTGGCTA
ATTCTTCGGCGCTTTGGGACGGAACCAAAAAACCGAGTCTAGGACATCACCGATGGGCTTTTGTGGGTAAGAAACTTCGATATCGCTCAAAATCGGACCACCCAGTGGTT
GGATATCAACTTTTGAGGCCTCCGACCCCGCCATTAGGCCCTTTCTTCTGCATTTTGGGACGGAACCAAAAAGCTGAATTCGGGGCATTACTGCCATTGGCCCTTTCTTC
AGCGTTTTGCGACAGAACCAAAAAACTGAGTCCGGGGCATCACCGTTCCGCTTTTACGTTCCTTTCTAGCCCATCGGCCCTTTCTTTGGCGTTTTGCGACGGAACCAAAA
AACTGAGTCCGGGGCATCACCGTTCCGCTTTTGTGGGCAAGAAAACGTTCCTTTCTAGCCCATCGGCCCTTTCTTTGGCGTTTTGCGACGGAACCAAAAAACTAAGTTCG
AGGCATCACCGTTCCACTTTTGTGGCCATGAAATTTCGATATCGGTCAAAATCAGACGACCCAATGGCCATTGGCCCTTTCTTCGGCGTTTTGGGTTGGAACCAAAAAAG
TGAGTTCCGGGCATCACCATTCAGCATTTTTGTCAAGAAACTTCAATATCACTCAAAATTGGACCACCCAGTGGTTGGATGTGAAATTTTGAGGCGTCCGACCCCGCGAT
TGGCCCTTTCTTCAGCGTTTTGGGACGGAACCAAAAAACTGAGTCCAGAGCATCACCGTTCGGTATTTGTGAACAAGAAACTTCGATATCGCTCAAAATCGGACAACCTT
GTGGTTGGATGTGAACTTTCGAGGCCTTCGACCCCGTCATTGGCCATTTTTTCGGCGTTTTGGGATGGAACCAAAAAACTGAGTCCGGGACATCACCGTTCGGCTTTTGT
GGGCAAGGAACTTCGATATCGCTCAAAATCGGATCACCCAGTGGTTGGATGTGAACTTTCGAGGCCTCCGACCCCTTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGTTCTCCATTATGATTAATGGATTGTTGGAAGGTTTTTTCCATGGAAGGAAAGGACTTAGACAAGCTGATCCTCTATCCCCGTTCTTATTTGTGATGGTCATGGAGGT
GCTATCTCGCATGTTGATTAGCCCGCCTCAGAATTTTAAATTCCACCAGTTTTGTGAGAAGGTCAGATTAACTCATCATACTTTTGCGGATGATCTGATGATATTTTGTA
CTGCTAATAATTATTCTATGAGTTTCATAAAAGAGACTATTAAGAGGTTTGGTGAGCTATCAGGCTTGTTTGCTAATCTTGATAAAACTCTATTTTTCTTGTGGGGGTTA
ATAGTGCGAAAGCTTCTCGGCTTGCTGTTAACATGGGTTTTACCATTGGTCACCTCCCTGTTCGTTATCTTGGTCTTCCTCTCCTCTTTAGAAGATTGTGGAGCTCTGAT
TGAGATCATCTTATTCAGTGTATTACTAGTCGTATTCGGTCTTGGTAGACTTCAGCTTGTTCGCTCAGTCCTTAGGAGCCTTCAGGTTTATTGGGCTAGTGTGTTCATGC
TTCCTATGAAAGTCCAAAGAGACGTTGATAAGATCTTGAGGGCTTATCTGTGGAGAGGTAAGGAGGAGGGAAGAGGTGGTGCTAAAGTTGCCTGGGATGAGGTTTATCTT
CCTTTTGATGAAGGAGGTCTTGATATTCGCGATGGATCGTCTGGGAATATAGCAAGCACGTTGAAGATCTTATGGTTGCTACTAGTTAAATCTGGTAGTTTGGCTATCTT
GCGTAAGCGGGACATCCTTAAAGCTCATGTGAAGATGGAGGTAGGCAATGGCAGGAGGTGTAGAGTGTGGTTGGATCCATGGATTCAGGGTGGTTTGATTATCCAGCAGT
TTGGGGAGAGGGTGATCTATGATGCGGGTAGTCGGCGTGATGCGAGGCTTGTGGATTTCATGGATCGAGATGGTGTTTGGAGGTGGCCGCTTGTTTCTTTGGCTTTGATG
GACATTTGGGATAGTATTCAGGGAGTGAGGTCGAGTCTGAGTATTGAGGATAGGTGGGTATGGGTGCCGGGTAGTCATGATAGTTTTTCAATCGCCAGTACGTGGGAGAC
TATTCGTCCTCATAGTAGTAGGGTTGGATGGTCGGGTTTACTATGGGGTGGGGGAAATATTCCTAAGCACTCCTTCTGTGCTTGGTTGGCCATCAGGGATAGGTTGGGTA
CTAGAGATAGGTTAAGTCGGTGGGATAGGTCGATTCCTTTATCGGGTATTCTTTGTGGAGGGAACTATGAGTCTTGTATTGGCAAGGGTGTGAGGAAAAAATTGTGGCGC
CTTCTCTGGTGTGCTACTATTTATTTCATTTGGCAGGACCGAAATCATCGTCTTCATGGAGGTGCAGTTCGAGAGCTTATGGTTATATTCCAGATCATTCGATCGTGTAT
TAAAGCGCGTGCTGCTTCTTGGTCCGATGGAGTTCATGGTCTTATTTACAATGCTTTTATTTTGCTTGTCCCCGGGCTGTGGGAGATGTGGGATTGGTTTGGGTACTTAT
GGGTTGTTTTGTCTAGTTGCTTGATTTGTGAGTGTTGTTCGTTCTTGTGCCTTGACCTCAGGCTGCCAGTCGCCCTTTCTTCGACGTTTTGGGATGAAACCAAAAAACTG
AGTCTGGGCATCACCGTTCGACTTTTATGGGCAAGAAACTTCGATAACGCTCAAAATCGAACCACCCAGTGGTTGAATGTGAACGTTCGAGGCCTCCGACCCCGTTTTAA
GTTCGGTCGTTCCTTTCTAGGCCAAAACTTTGATATCGCTCAAAATCGGACCACCCAGTGGTGGGATGTGAACTTTCGAGGCCTCCGACCCCGTTCTAAGCCCCTGGCTA
ATTCTTCGGCGCTTTGGGACGGAACCAAAAAACCGAGTCTAGGACATCACCGATGGGCTTTTGTGGGTAAGAAACTTCGATATCGCTCAAAATCGGACCACCCAGTGGTT
GGATATCAACTTTTGAGGCCTCCGACCCCGCCATTAGGCCCTTTCTTCTGCATTTTGGGACGGAACCAAAAAGCTGAATTCGGGGCATTACTGCCATTGGCCCTTTCTTC
AGCGTTTTGCGACAGAACCAAAAAACTGAGTCCGGGGCATCACCGTTCCGCTTTTACGTTCCTTTCTAGCCCATCGGCCCTTTCTTTGGCGTTTTGCGACGGAACCAAAA
AACTGAGTCCGGGGCATCACCGTTCCGCTTTTGTGGGCAAGAAAACGTTCCTTTCTAGCCCATCGGCCCTTTCTTTGGCGTTTTGCGACGGAACCAAAAAACTAAGTTCG
AGGCATCACCGTTCCACTTTTGTGGCCATGAAATTTCGATATCGGTCAAAATCAGACGACCCAATGGCCATTGGCCCTTTCTTCGGCGTTTTGGGTTGGAACCAAAAAAG
TGAGTTCCGGGCATCACCATTCAGCATTTTTGTCAAGAAACTTCAATATCACTCAAAATTGGACCACCCAGTGGTTGGATGTGAAATTTTGAGGCGTCCGACCCCGCGAT
TGGCCCTTTCTTCAGCGTTTTGGGACGGAACCAAAAAACTGAGTCCAGAGCATCACCGTTCGGTATTTGTGAACAAGAAACTTCGATATCGCTCAAAATCGGACAACCTT
GTGGTTGGATGTGAACTTTCGAGGCCTTCGACCCCGTCATTGGCCATTTTTTCGGCGTTTTGGGATGGAACCAAAAAACTGAGTCCGGGACATCACCGTTCGGCTTTTGT
GGGCAAGGAACTTCGATATCGCTCAAAATCGGATCACCCAGTGGTTGGATGTGAACTTTCGAGGCCTCCGACCCCTTTCTAA
Protein sequenceShow/hide protein sequence
MFSIMINGLLEGFFHGRKGLRQADPLSPFLFVMVMEVLSRMLISPPQNFKFHQFCEKVRLTHHTFADDLMIFCTANNYSMSFIKETIKRFGELSGLFANLDKTLFFLWGL
IVRKLLGLLLTWVLPLVTSLFVILVFLSSLEDCGALIEIILFSVLLVVFGLGRLQLVRSVLRSLQVYWASVFMLPMKVQRDVDKILRAYLWRGKEEGRGGAKVAWDEVYL
PFDEGGLDIRDGSSGNIASTLKILWLLLVKSGSLAILRKRDILKAHVKMEVGNGRRCRVWLDPWIQGGLIIQQFGERVIYDAGSRRDARLVDFMDRDGVWRWPLVSLALM
DIWDSIQGVRSSLSIEDRWVWVPGSHDSFSIASTWETIRPHSSRVGWSGLLWGGGNIPKHSFCAWLAIRDRLGTRDRLSRWDRSIPLSGILCGGNYESCIGKGVRKKLWR
LLWCATIYFIWQDRNHRLHGGAVRELMVIFQIIRSCIKARAASWSDGVHGLIYNAFILLVPGLWEMWDWFGYLWVVLSSCLICECCSFLCLDLRLPVALSSTFWDETKKL
SLGITVRLLWARNFDNAQNRTTQWLNVNVRGLRPRFKFGRSFLGQNFDIAQNRTTQWWDVNFRGLRPRSKPLANSSALWDGTKKPSLGHHRWAFVGKKLRYRSKSDHPVV
GYQLLRPPTPPLGPFFCILGRNQKAEFGALLPLALSSAFCDRTKKLSPGHHRSAFTFLSSPSALSLAFCDGTKKLSPGHHRSAFVGKKTFLSSPSALSLAFCDGTKKLSS
RHHRSTFVAMKFRYRSKSDDPMAIGPFFGVLGWNQKSEFRASPFSIFVKKLQYHSKLDHPVVGCEILRRPTPRLALSSAFWDGTKKLSPEHHRSVFVNKKLRYRSKSDNL
VVGCELSRPSTPSLAIFSAFWDGTKKLSPGHHRSAFVGKELRYRSKSDHPVVGCELSRPPTPF