; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025874 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025874
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:22650091..22655000
RNA-Seq ExpressionLag0025874
SyntenyLag0025874
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]2.9e-17039.7Show/hide
Query:  WSGEDLGNV----ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGGVLIS---
        +S  DLG +    +   F F +     Q V  T+ DL   V VEV CVYASNS  ERR+LW +++ + S WSSPG+VMGDF AI    E F    I    
Subjt:  WSGEDLGNV----ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGGVLIS---

Query:  ------------------------LRWRHLM--ICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRAL
                                  W +L+  +  W G   H P++F P     RR  +FRFF+HW ++  F  VV   W  ++ VSPLV F RNL+  
Subjt:  ------------------------LRWRHLM--ICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRAL

Query:  KSVLRRKFGCSIDVISRQVVEARRRMEDAQVA-------VGLNPLSSSLWIRLLGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFV
        K +L R FG + +     V     R+E A +        + L   + + + R L  ++ F + LG + + + R L    S W+  I+ F           
Subjt:  KSVLRRKFGCSIDVISRQVVEARRRMEDAQVA-------VGLNPLSSSLWIRLLGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFV

Query:  MVLILWWMKEVIFLLANLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIVGFVWPEECGVD-----------------LGRSIVDNILLCQEL---VGDYH
        + +    ++ V+F + +    G     + +F G++          A +V  + P+ CG +                 + + + D + +       V  YH
Subjt:  MVLILWWMKEVIFLLANLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIVGFVWPEECGVD-----------------LGRSIVDNILLCQEL---VGDYH

Query:  LDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFG
        L+ G P C LKVDLQKAYDS+ WDFLFG+L+A+ TP++FVSW                              G+PLSP+ FVMVM+V SRMLN PP GF 
Subjt:  LDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFG

Query:  FHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQD----------------------------LALFIGVSLASLPVRYLGLPLISGRLMYRDCKP
        FH  CEKV LT L FADDLMIF   D  S+SFV++ LQ                             LA  +G  L +LPVRYL LPL++GRL   DC P
Subjt:  FHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQD----------------------------LALFIGVSLASLPVRYLGLPLISGRLMYRDCKP

Query:  LLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWSEV-----CLPREEGGL------GVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR
        L++RIT+RI SW+ARVLS+ GR QLV+SV +S QV W+ V     C+  +   +       VR   SWN  + +K+LWLL   S SLWVAWVEAYIL  +
Subjt:  LLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWSEV-----CLPREEGGL------GVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR

Query:  SLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQG
        SLW V      SWC RAIL  RD  +  +   +GDG  C  W+DPWL    I+ +  ERV+YDAAS   A +++F+ PDG W+WP VS+EL+ L   VQ 
Subjt:  SLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQG

Query:  VRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPF
        VRPCL+  D  VW+P   GGFS++SA + IRPR   V W  LLW GGN+PKHSF AWL ++++L TRDRLHRWD+S P   +LC G  ESRDHLFF CPF
Subjt:  VRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPF

Query:  NQVVWGGILSWGDCSHRVASWDTELSWIC-QVSGASV
         + VW  +L     SHR+  W  ELSWIC Q  G SV
Subjt:  NQVVWGGILSWGDCSHRVASWDTELSWIC-QVSGASV

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]4.3e-16938.28Show/hide
Query:  ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGG--------------------
        + + F F ++    + V GT+ DL   V VEV+CVYASNS  ERR LW  +  + SAWSS G+VMGDFNAIR   E FGG                    
Subjt:  ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGG--------------------

Query:  --------------------------VLISLRW------RHLMICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPL
                                  VL++  W        + +  W G   H P++F P   V  R  +FRFF+HW +E  F++VVA  W  ++ VS L
Subjt:  --------------------------VLISLRW------RHLMICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPL

Query:  VGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------------------------WIRL---------------
        V   RNL  LK +LRR+FG  I  +S +V  A+  M+ AQ  V  NPLS  L                             W+ L               
Subjt:  VGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------------------------WIRL---------------

Query:  ---LGLLSLFGS---------GLGGRSLR-FARSLGSS--GSRWVIIIL-----------------------------------------LFSTGVLR--
             LLSL  S         G+   ++  F+ SLGS   G R +  I+                                          FS G  +  
Subjt:  ---LGLLSLFGS---------GLGGRSLR-FARSLGSS--GSRWVIIIL-----------------------------------------LFSTGVLR--

Query:  ---LGRFVMVLILWWMKEVIFLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELV
           +G      +L +  E  +L   +    + LI        +E F  +SC     Y  I++I+          F+   +     GRSI++NILLCQELV
Subjt:  ---LGRFVMVLILWWMKEVIFLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELV

Query:  GDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPP
        G YHL+ G PRC LKVDLQKAYDSV WDFLFG+L+A+GTP++FVSW                              GDPLSP+LFVMVMEVLSRMLN  P
Subjt:  GDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPP

Query:  DGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGV----------------SLASLPVRYLGLPLIS-----------GRLMYRD
          F FH RCEKV LTHL FADDLMIF A D  SISF+++CLQ    F G+                  AS     +G    S             L   D
Subjt:  DGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGV----------------SLASLPVRYLGLPLIS-----------GRLMYRD

Query:  CKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEA
        C PL++RIT+RI SW+ARVLS+ GRLQLV+SVL+S QV W+                                     +VCLP EEGGLG+R   SWN A
Subjt:  CKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEA

Query:  AVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAP
          +K   +L+   GSLWVAW+EAYIL G+SLW V      SWC RAIL  R++ +  +                            ERV+YDAAS   A 
Subjt:  AVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAP

Query:  VADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLH
        ++DF+ P+G W WP VSLEL+ L   VQ V PCL+  DS VW+P R GGFS++SAWEAI PR   V W  LLW GGNIPKHSF AWL ++DRL TRDRLH
Subjt:  VADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLH

Query:  RWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC
        RWD+S P  C+LC G  ESRDHLFF CPF   VW  +    + SHR+  W  ELSWIC
Subjt:  RWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC

KAA0062318.1 uncharacterized protein E6C27_scaffold154G00690 [Cucumis melo var. makuwa]3.3e-16142.24Show/hide
Query:  HCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL
        H P++F P   +  +  +F FF+HW ++  F++VVA  W  ++ VSP+V   RNL  LK ++RR+FG  I  +S +V  A+  ++ AQ  V  NP+S  L
Subjt:  HCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL

Query:  WIRLLGLLSLFGSG--LGGRSLR-----------------FARSLGSSGSRWVIIILLFSTGVLRLGRFVM------VLILWWMKEVI------------
          +       F +   L   SLR                 F RS+ S  SR  +  L+ S G   +G   +      ++   W +E              
Subjt:  WIRLLGLLSLFGSG--LGGRSLR-----------------FARSLGSSGSRWVIIILLFSTGVLRLGRFVM------VLILWWMKEVI------------

Query:  -----------------FLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDY
                         F    +    + LI        +E F  +SC     Y  I++I+          F+   +     GRSI++NILLCQELVG Y
Subjt:  -----------------FLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDY

Query:  HLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGF
        HL+ G PRC LKVDLQKAYDSV WDFLFG+L+A+GTP++FVSW                              GDPLS +LFVMVMEVLSRMLN  P  F
Subjt:  HLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGF

Query:  GFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCK
         FH RCEKV LTHL FADDLMIF A +  SI F+++CLQ                             LA  +G    +L VRYLGLPL++GRL   D  
Subjt:  GFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCK

Query:  PLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEAAV
        PL++RIT+RI SW+ARVLS+ GRLQLV SVL+SFQV W+                                     +VCLP EEGGLG+R   SWN A+ 
Subjt:  PLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEAAV

Query:  MKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVA
        +K+LWL++  SGSLWVAWVEAYIL GRSLW V      SWC RAIL  R++ + L+R  +G+G     W+DPWLPEG+I+ +  ERV+YDAAS   A ++
Subjt:  MKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVA

Query:  DFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRW
        DF+ PDG W WP VSLEL+ L   VQ V PCL+  DS VW+P R GGFS++SAWEA+RPR   V W  LLW GGNI KH F AWL ++DRL T DRLHRW
Subjt:  DFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRW

Query:  DASSPTDCLLCF
        D+S P  C+L F
Subjt:  DASSPTDCLLCF

TYK12108.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]1.2e-15042.75Show/hide
Query:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL
        +F+FF+HW ++  F++VVA  W  ++ VSPLV   RNL  LK +L R+FG  I  +S ++  A+  M+ AQ  V  NP+S  L           W  +RL
Subjt:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL

Query:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----
                              +  L L  S    R + F+   G +       +  F      +G      +L +  E  +L   +    + LI     
Subjt:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----

Query:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT
           +E F  +SC     Y  I++I+          F+   +    LGRSI++NILLCQELVG YHL+ G PRC LKVDLQKAYDSV W+FLFG+L+A+GT
Subjt:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT

Query:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD
        P++FVSW                              G+PLSP+LFVMVMEVLSRMLN  P  F FH RCEKV LTHL FADDLMIF A D  SI F+++
Subjt:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD

Query:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV
        CLQ                             LA  +G    +LPVRYLGLPL++GRL   DC P+++RIT+RI SW ARVLS+ GRLQLV SVL+S QV
Subjt:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV

Query:  VWSEV-CLPREEGGLGVRHVR--------SWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGR
         W+ V  LP        + +R        SWN A+ +K+LWL++   GSLWVAWVEAYIL GRSLW V      SWC RAIL                  
Subjt:  VWSEV-CLPREEGGLGVRHVR--------SWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGR

Query:  RCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIV
                    G+I+ +  ERV+YDAAS   A ++DF+  +G W W  V LEL+ L   VQ V PCL+  DS VW+P R GGFS++SAW+AIRPR   V
Subjt:  RCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIV

Query:  PWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGIL
         W  LLW G NIPKHSF AWL ++DRL TRDRLHRWD+S P  C+LC G  ESRDHLFF CPF   VW  +L
Subjt:  PWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGIL

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]4.0e-15939.08Show/hide
Query:  VCSAWSSPGLVMGDFNAIRCSLEGFGG----------------------------------------------VLIS----LRWRHLM--ICGWLGDFYH
        + + W   G+VMGDFN IR   E FGG                                              +L++     +W ++   +  W G   H
Subjt:  VCSAWSSPGLVMGDFNAIRCSLEGFGG----------------------------------------------VLIS----LRWRHLM--ICGWLGDFYH

Query:  CPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSLW
         P++  P    ++   +FRFF+HW KE  F+DVV+ +W     VSP+V   RNLR LKS+LR  FG  I  IS  VV  R    + +            W
Subjt:  CPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSLW

Query:  IRLLGLLSLFGSGLGGRSLRFARSLGSSGS------------RWVIIILLFSTG--VLRLGRFVM--------------------VLILWWMKEVIFLLA
         R     S +   +  + ++  +S+ + GS              V   L    G   +R G F M                     +  ++ + V  +  
Subjt:  IRLLGLLSLFGSGLGGRSLRFARSLGSSGS------------RWVIIILLFSTG--VLRLGRFVM--------------------VLILWWMKEVIFLLA

Query:  NLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLF
         L  +      +E F  +SC     Y  I+ I+          FV   +     GRSI+DNILLCQELVG YHL RG PRC +KVDLQKAYDSV WDFLF
Subjt:  NLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLF

Query:  GVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDR
        G+L+A+G  ++FVSW                              GDPLS +LFVMVMEVLSRMLN PP  F FH  CEKV LTHL FADDLMIF A D 
Subjt:  GVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDR

Query:  SSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQ
         S+SF+++ ++                             LA  +G S+  LPVRYLGLPL+ GRL   DC PL++RIT+RI SWSARVLS+ GRLQLV+
Subjt:  SSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQ

Query:  SVLQSFQ-------------------------------------VVWSEVCLPREEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRS
        SVL+S Q                                     V W EVCLP +EGGL +R   SWN A+ +K+LWLL++KSGSLWVAWVEAYIL GRS
Subjt:  SVLQSFQ-------------------------------------VVWSEVCLPREEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRS

Query:  LWTVRPSPRWSWCWRAILSLRDQFRSLI------RFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLL
        +          W    +L L       I      R+++     C      W+  G+II +F ERVIYDA S   A + DF++ DG WRWP VSL+L+ + 
Subjt:  LWTVRPSPRWSWCWRAILSLRDQFRSLI------RFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLL

Query:  PSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLF
          +QGVRP  + ED  VW+P     FS++SAWE IRP SS V W  LLW  GNIPKHSF AWL +RDRL TRDRL +WD S P  C+LC G  ESRDHLF
Subjt:  PSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLF

Query:  FGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC
        F CPF   +W  IL +   SHR+  W  ELSWIC
Subjt:  FGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein1.4e-17039.7Show/hide
Query:  WSGEDLGNV----ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGGVLIS---
        +S  DLG +    +   F F +     Q V  T+ DL   V VEV CVYASNS  ERR+LW +++ + S WSSPG+VMGDF AI    E F    I    
Subjt:  WSGEDLGNV----ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGGVLIS---

Query:  ------------------------LRWRHLM--ICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRAL
                                  W +L+  +  W G   H P++F P     RR  +FRFF+HW ++  F  VV   W  ++ VSPLV F RNL+  
Subjt:  ------------------------LRWRHLM--ICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRAL

Query:  KSVLRRKFGCSIDVISRQVVEARRRMEDAQVA-------VGLNPLSSSLWIRLLGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFV
        K +L R FG + +     V     R+E A +        + L   + + + R L  ++ F + LG + + + R L    S W+  I+ F           
Subjt:  KSVLRRKFGCSIDVISRQVVEARRRMEDAQVA-------VGLNPLSSSLWIRLLGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFV

Query:  MVLILWWMKEVIFLLANLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIVGFVWPEECGVD-----------------LGRSIVDNILLCQEL---VGDYH
        + +    ++ V+F + +    G     + +F G++          A +V  + P+ CG +                 + + + D + +       V  YH
Subjt:  MVLILWWMKEVIFLLANLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIVGFVWPEECGVD-----------------LGRSIVDNILLCQEL---VGDYH

Query:  LDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFG
        L+ G P C LKVDLQKAYDS+ WDFLFG+L+A+ TP++FVSW                              G+PLSP+ FVMVM+V SRMLN PP GF 
Subjt:  LDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFG

Query:  FHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQD----------------------------LALFIGVSLASLPVRYLGLPLISGRLMYRDCKP
        FH  CEKV LT L FADDLMIF   D  S+SFV++ LQ                             LA  +G  L +LPVRYL LPL++GRL   DC P
Subjt:  FHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQD----------------------------LALFIGVSLASLPVRYLGLPLISGRLMYRDCKP

Query:  LLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWSEV-----CLPREEGGL------GVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR
        L++RIT+RI SW+ARVLS+ GR QLV+SV +S QV W+ V     C+  +   +       VR   SWN  + +K+LWLL   S SLWVAWVEAYIL  +
Subjt:  LLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWSEV-----CLPREEGGL------GVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR

Query:  SLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQG
        SLW V      SWC RAIL  RD  +  +   +GDG  C  W+DPWL    I+ +  ERV+YDAAS   A +++F+ PDG W+WP VS+EL+ L   VQ 
Subjt:  SLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQG

Query:  VRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPF
        VRPCL+  D  VW+P   GGFS++SA + IRPR   V W  LLW GGN+PKHSF AWL ++++L TRDRLHRWD+S P   +LC G  ESRDHLFF CPF
Subjt:  VRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPF

Query:  NQVVWGGILSWGDCSHRVASWDTELSWIC-QVSGASV
         + VW  +L     SHR+  W  ELSWIC Q  G SV
Subjt:  NQVVWGGILSWGDCSHRVASWDTELSWIC-QVSGASV

A0A5A7TZS0 Reverse transcriptase domain-containing protein2.1e-16938.28Show/hide
Query:  ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGG--------------------
        + + F F ++    + V GT+ DL   V VEV+CVYASNS  ERR LW  +  + SAWSS G+VMGDFNAIR   E FGG                    
Subjt:  ESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCVYASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGG--------------------

Query:  --------------------------VLISLRW------RHLMICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPL
                                  VL++  W        + +  W G   H P++F P   V  R  +FRFF+HW +E  F++VVA  W  ++ VS L
Subjt:  --------------------------VLISLRW------RHLMICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPL

Query:  VGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------------------------WIRL---------------
        V   RNL  LK +LRR+FG  I  +S +V  A+  M+ AQ  V  NPLS  L                             W+ L               
Subjt:  VGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------------------------WIRL---------------

Query:  ---LGLLSLFGS---------GLGGRSLR-FARSLGSS--GSRWVIIIL-----------------------------------------LFSTGVLR--
             LLSL  S         G+   ++  F+ SLGS   G R +  I+                                          FS G  +  
Subjt:  ---LGLLSLFGS---------GLGGRSLR-FARSLGSS--GSRWVIIIL-----------------------------------------LFSTGVLR--

Query:  ---LGRFVMVLILWWMKEVIFLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELV
           +G      +L +  E  +L   +    + LI        +E F  +SC     Y  I++I+          F+   +     GRSI++NILLCQELV
Subjt:  ---LGRFVMVLILWWMKEVIFLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELV

Query:  GDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPP
        G YHL+ G PRC LKVDLQKAYDSV WDFLFG+L+A+GTP++FVSW                              GDPLSP+LFVMVMEVLSRMLN  P
Subjt:  GDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPP

Query:  DGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGV----------------SLASLPVRYLGLPLIS-----------GRLMYRD
          F FH RCEKV LTHL FADDLMIF A D  SISF+++CLQ    F G+                  AS     +G    S             L   D
Subjt:  DGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGV----------------SLASLPVRYLGLPLIS-----------GRLMYRD

Query:  CKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEA
        C PL++RIT+RI SW+ARVLS+ GRLQLV+SVL+S QV W+                                     +VCLP EEGGLG+R   SWN A
Subjt:  CKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEA

Query:  AVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAP
          +K   +L+   GSLWVAW+EAYIL G+SLW V      SWC RAIL  R++ +  +                            ERV+YDAAS   A 
Subjt:  AVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAP

Query:  VADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLH
        ++DF+ P+G W WP VSLEL+ L   VQ V PCL+  DS VW+P R GGFS++SAWEAI PR   V W  LLW GGNIPKHSF AWL ++DRL TRDRLH
Subjt:  VADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLH

Query:  RWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC
        RWD+S P  C+LC G  ESRDHLFF CPF   VW  +    + SHR+  W  ELSWIC
Subjt:  RWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC

A0A5A7U8G1 Non-LTR retroelement reverse transcriptase-like protein2.8e-14241.12Show/hide
Query:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL
        +F+FF+HW ++  F++VVA  W  ++ VSPLV   RNL  LK +L R+FG  I  +S ++  A+  M+ AQ  V  NP+S  L           W  +RL
Subjt:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL

Query:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----
                              +  L L  S    R + F+   G +       +  F      +G      +L +  E  +L   +    + LI     
Subjt:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----

Query:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT
           +E F  +SC     Y  I++I+          F+   +    LGRSI++NILLCQELVG YHL+ G PRC LKVDLQKAYDSV W+FLFG+L+A+GT
Subjt:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT

Query:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD
        P++FVSW                              G+PLSP+LFVMVMEVLSRMLN  P  F FH RCEKV LTHL FADDLMIF A D  SI F+++
Subjt:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD

Query:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV
        CLQ                             LA  +G    +LPVRYLGLPL++GRL   DC P+++RIT+RI SW ARVLS+ GRLQLV SVL+S QV
Subjt:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV

Query:  VWSEV-CLP----REEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLT
         W+ V  LP     EEGGL +R   SWN A+ +K+LWL++   GSLWVAWVEAYIL GRSLW V      SWC RAIL                      
Subjt:  VWSEV-CLP----REEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLT

Query:  WVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFR
                G+I+ +  ERV+YDAAS   A ++DF+  +G W W  V LEL+ L   VQ V PCL+  DS VW+P R GGFS++SAW+AIRPR   V W  
Subjt:  WVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFR

Query:  LLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC
        LLW G NIPKHSF AWL ++DRL TRDRLHR                                   +L     SHR+  W  ELSWIC
Subjt:  LLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWIC

A0A5A7V3Z0 Reverse transcriptase domain-containing protein1.6e-16142.24Show/hide
Query:  HCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL
        H P++F P   +  +  +F FF+HW ++  F++VVA  W  ++ VSP+V   RNL  LK ++RR+FG  I  +S +V  A+  ++ AQ  V  NP+S  L
Subjt:  HCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL

Query:  WIRLLGLLSLFGSG--LGGRSLR-----------------FARSLGSSGSRWVIIILLFSTGVLRLGRFVM------VLILWWMKEVI------------
          +       F +   L   SLR                 F RS+ S  SR  +  L+ S G   +G   +      ++   W +E              
Subjt:  WIRLLGLLSLFGSG--LGGRSLR-----------------FARSLGSSGSRWVIIILLFSTGVLRLGRFVM------VLILWWMKEVI------------

Query:  -----------------FLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDY
                         F    +    + LI        +E F  +SC     Y  I++I+          F+   +     GRSI++NILLCQELVG Y
Subjt:  -----------------FLLANLTLQGLLLI-------IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDY

Query:  HLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGF
        HL+ G PRC LKVDLQKAYDSV WDFLFG+L+A+GTP++FVSW                              GDPLS +LFVMVMEVLSRMLN  P  F
Subjt:  HLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGF

Query:  GFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCK
         FH RCEKV LTHL FADDLMIF A +  SI F+++CLQ                             LA  +G    +L VRYLGLPL++GRL   D  
Subjt:  GFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCK

Query:  PLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEAAV
        PL++RIT+RI SW+ARVLS+ GRLQLV SVL+SFQV W+                                     +VCLP EEGGLG+R   SWN A+ 
Subjt:  PLLERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWS-------------------------------------EVCLPREEGGLGVRHVRSWNEAAV

Query:  MKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVA
        +K+LWL++  SGSLWVAWVEAYIL GRSLW V      SWC RAIL  R++ + L+R  +G+G     W+DPWLPEG+I+ +  ERV+YDAAS   A ++
Subjt:  MKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVA

Query:  DFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRW
        DF+ PDG W WP VSLEL+ L   VQ V PCL+  DS VW+P R GGFS++SAWEA+RPR   V W  LLW GGNI KH F AWL ++DRL T DRLHRW
Subjt:  DFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRW

Query:  DASSPTDCLLCF
        D+S P  C+L F
Subjt:  DASSPTDCLLCF

A0A5D3CLU1 Non-LTR retroelement reverse transcriptase-like protein5.6e-15142.75Show/hide
Query:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL
        +F+FF+HW ++  F++VVA  W  ++ VSPLV   RNL  LK +L R+FG  I  +S ++  A+  M+ AQ  V  NP+S  L           W  +RL
Subjt:  AFRFFSHWAKEDGFLDVVAESWVPYQDVSPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSL-----------W--IRL

Query:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----
                              +  L L  S    R + F+   G +       +  F      +G      +L +  E  +L   +    + LI     
Subjt:  ----------------------LGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLILWWMKEVIFLLANLTLQGLLLI-----

Query:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT
           +E F  +SC     Y  I++I+          F+   +    LGRSI++NILLCQELVG YHL+ G PRC LKVDLQKAYDSV W+FLFG+L+A+GT
Subjt:  --IIEVFWGLSCLGTGPYSQIAEIV---------GFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVGT

Query:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD
        P++FVSW                              G+PLSP+LFVMVMEVLSRMLN  P  F FH RCEKV LTHL FADDLMIF A D  SI F+++
Subjt:  PVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQD

Query:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV
        CLQ                             LA  +G    +LPVRYLGLPL++GRL   DC P+++RIT+RI SW ARVLS+ GRLQLV SVL+S QV
Subjt:  CLQ----------------------------DLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQV

Query:  VWSEV-CLPREEGGLGVRHVR--------SWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGR
         W+ V  LP        + +R        SWN A+ +K+LWL++   GSLWVAWVEAYIL GRSLW V      SWC RAIL                  
Subjt:  VWSEV-CLPREEGGLGVRHVR--------SWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGR

Query:  RCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIV
                    G+I+ +  ERV+YDAAS   A ++DF+  +G W W  V LEL+ L   VQ V PCL+  DS VW+P R GGFS++SAW+AIRPR   V
Subjt:  RCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPRSSIV

Query:  PWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGIL
         W  LLW G NIPKHSF AWL ++DRL TRDRLHRWD+S P  C+LC G  ESRDHLFF CPF   VW  +L
Subjt:  PWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGIL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.8e-0522.69Show/hide
Query:  HLDRGPPR--CVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVS------------------------------WGDPLSPYLFVMVMEVLSRMLNAPPD
        H++R   +   ++ +D +KA+D +Q  F+   L  +G    ++                                G PLSP LF +V+EVL+R +    +
Subjt:  HLDRGPPR--CVLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVS------------------------------WGDPLSPYLFVMVMEVLSRMLNAPPD

Query:  GFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGVS-------------------------------LASLPVRYLGLPLISG--R
          G     E+V L+   FADD++++     + I   Q+ L+ ++ F  VS                               +AS  ++YLG+ L      
Subjt:  GFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGVS-------------------------------LASLPVRYLGLPLISG--R

Query:  LMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVL
        L   + KPLL+ I    + W     S+VGR+ +V+  +
Subjt:  LMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVL

P0C2F6 Putative ribonuclease H protein At1g657501.4e-2927Show/hide
Query:  LPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQ-------------------------------------VVWSEVCLPREEGG
        +P++  R+       +LER+++R+S W  + LS+ GRL L ++VL S                                       V WS+VC P++EGG
Subjt:  LPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSFQ-------------------------------------VVWSEVCLPREEGG

Query:  LGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR---SLWTVRPSPRWSWCWRAI-LSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPR
        LGVR  +S N A + K+ W L+ +  SLW   ++     G    S W + P   WS  WR+I + LRD     + +  GDG++   W D W+    ++  
Subjt:  LGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGR---SLWTVRPSPRWSWCWRAI-LSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPR

Query:  FSERVIYDAASSLYAPVADFLLPDGSWRWPSVS--------LEL-LLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIR----PRSSIVPWFRL
         +     D  + +     D  +P   W +  +         LEL  ++L  V G R      D + W  S+ G FSV SA+E +     PR ++  +F  
Subjt:  FSERVIYDAASSLYAPVADFLLPDGSWRWPSVS--------LEL-LLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIR----PRSSIVPWFRL

Query:  LWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVW
        LW      +     WL     + T +  HR   S+   C +C G  ES  H+   CP    +W
Subjt:  LWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVW

P11369 LINE-1 retrotransposable element ORF2 protein5.9e-0420.09Show/hide
Query:  VLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKV
        ++ +D +KA+D +Q  F+  VL   G    +++                               G PLSPYLF +V+EVL+R +    +  G     E+V
Subjt:  VLKVDLQKAYDSVQWDFLFGVLLAVGTPVQFVSW------------------------------GDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKV

Query:  GLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIG----------------------------VSLASLPVRYLGLPLIS--GRLMYRDCKPLLERIT
         ++ L  ADD++++ +  ++S   + + +      +G                             S+ +  ++YLG+ L      L  ++ K L + I 
Subjt:  GLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIG----------------------------VSLASLPVRYLGLPLIS--GRLMYRDCKPLLERIT

Query:  ARISSWSARVLSYVGRLQLVQSVL
          +  W     S++GR+ +V+  +
Subjt:  ARISSWSARVLSYVGRLQLVQSVL

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.9e-0440Show/hide
Query:  PVQFVSWGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFS---AGDRSSISFVQDCLQDLAL
        P + V  GDPLSP LF  VM+ + R L   P+  GF    EK+G   L FADDL++ +    G ++S+S ++  LQ+  L
Subjt:  PVQFVSWGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFS---AGDRSSISFVQDCLQDLAL

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.9e-3033.97Show/hide
Query:  RSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQ
        R+ WT+  +   SW WR +  LR+  R  +   +G G     W D W   G                    P+ D + P G     +V L +        
Subjt:  RSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQ

Query:  GVRPCLAREDSVVW---LPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFF
        G+  C   +DS +W   L + S  FS +    A+ P++ IVPW++ +WF  ++PKH+FI W+   +RL+TRDRL  W  S P  CLLC    ESR HLFF
Subjt:  GVRPCLAREDSVVW---LPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFF

Query:  GCPFNQVVW
         CPF   VW
Subjt:  GCPFNQVVW

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-2533.48Show/hide
Query:  ILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWP-SVSLELLLLLPSVQGV-RPCLAR-EDSVVWL
        +L LR      ++ ++G+GR    W D W   G +I    +         L A V + L  +G W+ P S S     +   +  +  P  A  EDS  W+
Subjt:  ILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWP-SVSLELLLLLPSVQGV-RPCLAR-EDSVVWL

Query:  PSR--SGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWG
               GFS +  W+AIRPR+  + W + +WF G +PKH+F  W+   DRL TR RL  W      DC LC    ESRDHL F C F   VW    S  
Subjt:  PSR--SGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWG

Query:  DCSHRV-ASWDTELSWICQVSGAS
            R+  SW   LSW+   S ++
Subjt:  DCSHRV-ASWDTELSWICQVSGAS

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.5e-4229.83Show/hide
Query:  SLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSF-------------------------------------QVVWSEV
        +LPVRYLGLPL++ ++   D  PL+E+I  RI  W+AR LS+ GRLQL+ SV+ S                                      +V WS+V
Subjt:  SLPVRYLGLPLISGRLMYRDCKPLLERITARISSWSARVLSYVGRLQLVQSVLQSF-------------------------------------QVVWSEV

Query:  CLPREEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGS
        C P++EGGLG+R ++  N+              GS W   +      G            SW W+ IL  R      ++  I +G     W D W   G 
Subjt:  CLPREEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGS

Query:  IIPRFSERVIYDAASSLYAPVADFLL---PDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGG------FSVSSAWEAIRPRSSIVPWFRL
        +I     R   D   +L+A VA+ ++   P        + +E ++     QG+    + ED+V W   +  G      F+    W A R     V W++ 
Subjt:  IIPRFSERVIYDAASSLYAPVADFLL---PDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGG------FSVSSAWEAIRPRSSIVPWFRL

Query:  LWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVV
        +WF    PK+S +AW+ +++RL T DR+  W+A + + C+LC    E+RDHLFF CP++  V
Subjt:  LWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVV

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-3036.79Show/hide
Query:  LRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPS-----VSLELLLLLPSVQGVRPCLAREDSVVW--
        LR   R  I   +G G     W D W+  G +I              + A V D L     W   S     + ++L  LLP  QG+  C   +DS +W  
Subjt:  LRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPS-----VSLELLLLLPSVQGVRPCLAREDSVVW--

Query:  -LPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVW
         L + S  FS    W A+ P+S  VPW + +WF  ++PKH+FI W+   +RL+TRDRL  W  S P +CLLC    +SR HLFF C F+ VVW
Subjt:  -LPSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVW

AT5G16486.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.3e-2734.48Show/hide
Query:  SWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCL------
        SW W++I  LR   R  +   +G G  C  W + W   G +I    +     +     A VAD  L DG W W + S       P +Q ++ CL      
Subjt:  SWCWRAILSLRDQFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCL------

Query:  -----AREDSVVWL---PSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFG
             A +D  +W       S GFS ++ W  + P    V W + +WF G IPKH+FI+W+ +R RL TRD+L  W    P+ CLLC    E+R HLFF 
Subjt:  -----AREDSVVWL---PSRSGGFSVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFG

Query:  CPF
        C F
Subjt:  CPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGCTGCGCCGACAGTGGTTGAGACGAAGCTTGCGGCAGGTCATCGTCTTCTTTCGTTCAGGCGGTGGCTAGGGCTCACGATCCTGCAGTCCGGGTTCAATTTGGG
CCGTTGGTCAGCGGGAATTGAATGGCAAACGGTGGTGGTGCCGGATGAGGATATTCTTGAGGAAGGAATTAGTTTATGGAGGAGTTCACTTGTGGGACAATTTATTGATG
CTCGGGTTCCTTTTTCGAATGGGCTGTGGCATATTGGGGGTAAGCCCCTCCTTCTTCGGAAATGGTCCCCAGAGGTTGAGTGTGGTTCTAGTGCCATAGGCAGACCGGTG
TGCCTTGATAAGGCTACTGAGGAGCGCAGGCGGTTGTCCTTTGCCGTGTTTGTGTGGAGATTGGGCCGAGGGATGAATTGCCAAGTACGGTGGAGGTGTGTATTCGTGGC
CAGAATTTTGTTGTGGGGTGGATTATTCTTGGAACCTAAGAGGTGTGTGCCTTGTGGTGTGTTTGGTCACTCGAGTGGTAGTTGTCCGAAGTCAGTGGGGGTTCGTGATG
AGGTTAGGGTTAAGCCTGTTGACCTCCCTTCTACTGTTCCTTCGACCAGTGCTCCTCCTGGATGTGGTCGTCGTGGGCGACAAGCCCTAGTCCCGTGTTTCGCCAGTTGG
CTTGAAGAGGGTGGGGTCGGTTTCGCCGGAAGGGTTGGGCTTCCTCCCCTTCGGGGTCCTCTTCCGTTAGCTATGGTGACGCCCCAGTCGAATTTTATTACTTTTGAGCG
TGCACCTCAGTTCAGTGTGGACCCTGAGTGGGGGAAGGGTGCTTTGGACCTTCCTTCTGATGACTATGTGCTTGTAGTTATGGGCGAGTGGTCTGGGGAGGATCTGGGTA
ATGTGGAGAGCGAATCGTTTCTCTTTCGTTCCTATTTCTGTGGGCCACAAATGGTTCATGGTACGGTTTGTGATCTTTCTTCTCAAGTTTCTGTTGAGGTGGTGTGTGTG
TATGCCTCCAACTCTGAGGGGGAGAGACGTGTGTTGTGGGAAGAGATTCTTGCTGTTTGCTCGGCTTGGAGTTCGCCAGGACTTGTGATGGGGGATTTCAATGCTATTCG
TTGTTCGCTTGAAGGGTTTGGGGGAGTCCTGATCTCTCTGAGATGGAGGCATTTAATGATATGCGGTTGGCTGGGGGATTTTTACCATTGCCCCTTAATTTTTGATCCTG
GGCTGAGTGTGGCTCGTAGGCCTCCTGCTTTTCGTTTTTTCAGTCATTGGGCCAAGGAGGATGGGTTTTTGGATGTTGTGGCTGAGAGTTGGGTTCCATATCAGGATGTG
TCCCCGTTGGTTGGGTTTGGGCGTAATCTTAGGGCGCTTAAATCTGTTCTTCGGAGGAAGTTTGGTTGTTCTATTGATGTTATTAGTAGGCAGGTCGTGGAGGCCAGGAG
GCGTATGGAGGATGCTCAAGTGGCTGTTGGGTTGAACCCTCTCTCTTCTAGTTTGTGGATCAGGCTGCTAGGGCTTCTCAGTCTTTTTGGGAGTGGGTTAGGAGGGAGGA
GTCTTCGCTTCGCTAGAAGTCTCGGGTCCAGTGGCTCTCGTTGGGTGATCATAATTCTGCTTTTTTCCACAGGTGTGTTAAGGCTAGGGCGATTCGTAATGGTCTTAATT
CTTTGGTGGATGAAGGAGGTAATATTCTTACTGGCCAATCTGACATTGCAAGGGCTGTTGTTGATTATTATAGAGGTATTCTGGGGACTCAGTTGCTTGGGTACAGGACC
TTACAGCCAGATTGCTGAGATTGTGGGGTTTGTCTGGCCTGAGGAGTGTGGGGTTGATCTGGGCAGAAGCATTGTGGATAATATTTTGCTATGTCAGGAGCTTGTGGGGG
ATTATCATCTGGATAGGGGGCCTCCCAGGTGTGTTCTGAAGGTTGACCTTCAGAAGGCTTATGACTCTGTCCAGTGGGACTTTCTGTTTGGGGTACTTCTAGCAGTGGGC
ACACCTGTCCAGTTTGTTAGTTGGGGGGATCCACTTTCTCCTTATCTGTTTGTGATGGTGATGGAGGTGTTATCACGTATGTTGAATGCTCCTCCGGATGGCTTTGGTTT
TCATTTTCGATGTGAGAAGGTGGGGTTGACCCACTTGTGCTTTGCAGATGACTTGATGATTTTCAGTGCAGGGGATCGGTCCTCTATTTCGTTTGTTCAGGATTGTCTCC
AGGATCTTGCCTTGTTCATTGGGGTCTCCTTAGCTTCTCTGCCTGTACGTTATCTTGGTCTCCCTCTTATTTCAGGTCGGTTAATGTATAGGGATTGTAAGCCTCTACTG
GAGCGGATTACTGCTCGGATTAGCTCTTGGTCTGCCCGGGTTTTGTCGTATGTAGGTCGCCTTCAGCTTGTTCAGTCGGTGCTCCAGAGCTTTCAGGTTGTCTGGTCGGA
GGTGTGCCTGCCTCGTGAGGAGGGGGGTTTGGGTGTCAGGCATGTTCGCTCGTGGAATGAGGCAGCGGTTATGAAGTTGTTGTGGTTGCTTATGTTGAAGTCTGGCTCGT
TGTGGGTTGCCTGGGTTGAGGCGTATATCTTGTGTGGGCGGTCCTTGTGGACTGTGCGCCCCTCTCCGCGATGGTCTTGGTGTTGGAGAGCGATTCTTAGTCTCCGGGAT
CAGTTTCGATCACTCATTCGCTTCAGCATTGGTGATGGTCGGAGATGTCTTACATGGGTGGATCCGTGGTTGCCTGAGGGTTCTATTATTCCTCGGTTTTCTGAGAGGGT
TATTTATGATGCTGCTAGCTCGTTGTATGCTCCCGTGGCTGATTTCCTTTTGCCTGACGGGTCTTGGCGTTGGCCTAGTGTATCGCTTGAGCTGTTGCTTCTTTTGCCTT
CAGTACAAGGGGTTCGACCGTGTCTTGCTAGGGAGGATAGTGTGGTTTGGCTTCCGAGTAGGTCTGGGGGTTTCTCTGTCTCCAGTGCTTGGGAGGCTATTAGGCCTCGT
TCTTCGATTGTTCCTTGGTTTCGTTTGTTATGGTTTGGGGGCAATATCCCTAAACACTCCTTTATTGCTTGGCTTGGGGTGCGGGATAGGCTCTATACGAGGGACCGGTT
GCATCGTTGGGATGCATCGAGCCCTACTGATTGTCTGTTATGCTTTGGTGCCCCGGAGTCCCGAGATCATCTGTTTTTTGGGTGCCCTTTCAACCAGGTTGTATGGGGTG
GGATCTTGAGTTGGGGTGACTGTTCTCATAGGGTTGCCTCTTGGGATACTGAGTTGTCTTGGATCTGTCAGGTTAGTGGGGCAAGCGTCGTAGCTGTCGAGTGTGGAGGG
TTGCCTGGTGTGCCTCGATTTATCATATTTGGAGGGAGAGGAATCTGCGTCTCCATGGTTCGGTCCTCGCTCTCCTCAGACTGTCTTACATGTGGTTCAGGCAGATATGC
CGGTCGTTGTGCCTCCTGGCTAGGAGGTGTGTCTTCTCCCCTACCTGGGGCATCTCCATGGAGCATGTTTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAAGGCTGCGCCGACAGTGGTTGAGACGAAGCTTGCGGCAGGTCATCGTCTTCTTTCGTTCAGGCGGTGGCTAGGGCTCACGATCCTGCAGTCCGGGTTCAATTTGGG
CCGTTGGTCAGCGGGAATTGAATGGCAAACGGTGGTGGTGCCGGATGAGGATATTCTTGAGGAAGGAATTAGTTTATGGAGGAGTTCACTTGTGGGACAATTTATTGATG
CTCGGGTTCCTTTTTCGAATGGGCTGTGGCATATTGGGGGTAAGCCCCTCCTTCTTCGGAAATGGTCCCCAGAGGTTGAGTGTGGTTCTAGTGCCATAGGCAGACCGGTG
TGCCTTGATAAGGCTACTGAGGAGCGCAGGCGGTTGTCCTTTGCCGTGTTTGTGTGGAGATTGGGCCGAGGGATGAATTGCCAAGTACGGTGGAGGTGTGTATTCGTGGC
CAGAATTTTGTTGTGGGGTGGATTATTCTTGGAACCTAAGAGGTGTGTGCCTTGTGGTGTGTTTGGTCACTCGAGTGGTAGTTGTCCGAAGTCAGTGGGGGTTCGTGATG
AGGTTAGGGTTAAGCCTGTTGACCTCCCTTCTACTGTTCCTTCGACCAGTGCTCCTCCTGGATGTGGTCGTCGTGGGCGACAAGCCCTAGTCCCGTGTTTCGCCAGTTGG
CTTGAAGAGGGTGGGGTCGGTTTCGCCGGAAGGGTTGGGCTTCCTCCCCTTCGGGGTCCTCTTCCGTTAGCTATGGTGACGCCCCAGTCGAATTTTATTACTTTTGAGCG
TGCACCTCAGTTCAGTGTGGACCCTGAGTGGGGGAAGGGTGCTTTGGACCTTCCTTCTGATGACTATGTGCTTGTAGTTATGGGCGAGTGGTCTGGGGAGGATCTGGGTA
ATGTGGAGAGCGAATCGTTTCTCTTTCGTTCCTATTTCTGTGGGCCACAAATGGTTCATGGTACGGTTTGTGATCTTTCTTCTCAAGTTTCTGTTGAGGTGGTGTGTGTG
TATGCCTCCAACTCTGAGGGGGAGAGACGTGTGTTGTGGGAAGAGATTCTTGCTGTTTGCTCGGCTTGGAGTTCGCCAGGACTTGTGATGGGGGATTTCAATGCTATTCG
TTGTTCGCTTGAAGGGTTTGGGGGAGTCCTGATCTCTCTGAGATGGAGGCATTTAATGATATGCGGTTGGCTGGGGGATTTTTACCATTGCCCCTTAATTTTTGATCCTG
GGCTGAGTGTGGCTCGTAGGCCTCCTGCTTTTCGTTTTTTCAGTCATTGGGCCAAGGAGGATGGGTTTTTGGATGTTGTGGCTGAGAGTTGGGTTCCATATCAGGATGTG
TCCCCGTTGGTTGGGTTTGGGCGTAATCTTAGGGCGCTTAAATCTGTTCTTCGGAGGAAGTTTGGTTGTTCTATTGATGTTATTAGTAGGCAGGTCGTGGAGGCCAGGAG
GCGTATGGAGGATGCTCAAGTGGCTGTTGGGTTGAACCCTCTCTCTTCTAGTTTGTGGATCAGGCTGCTAGGGCTTCTCAGTCTTTTTGGGAGTGGGTTAGGAGGGAGGA
GTCTTCGCTTCGCTAGAAGTCTCGGGTCCAGTGGCTCTCGTTGGGTGATCATAATTCTGCTTTTTTCCACAGGTGTGTTAAGGCTAGGGCGATTCGTAATGGTCTTAATT
CTTTGGTGGATGAAGGAGGTAATATTCTTACTGGCCAATCTGACATTGCAAGGGCTGTTGTTGATTATTATAGAGGTATTCTGGGGACTCAGTTGCTTGGGTACAGGACC
TTACAGCCAGATTGCTGAGATTGTGGGGTTTGTCTGGCCTGAGGAGTGTGGGGTTGATCTGGGCAGAAGCATTGTGGATAATATTTTGCTATGTCAGGAGCTTGTGGGGG
ATTATCATCTGGATAGGGGGCCTCCCAGGTGTGTTCTGAAGGTTGACCTTCAGAAGGCTTATGACTCTGTCCAGTGGGACTTTCTGTTTGGGGTACTTCTAGCAGTGGGC
ACACCTGTCCAGTTTGTTAGTTGGGGGGATCCACTTTCTCCTTATCTGTTTGTGATGGTGATGGAGGTGTTATCACGTATGTTGAATGCTCCTCCGGATGGCTTTGGTTT
TCATTTTCGATGTGAGAAGGTGGGGTTGACCCACTTGTGCTTTGCAGATGACTTGATGATTTTCAGTGCAGGGGATCGGTCCTCTATTTCGTTTGTTCAGGATTGTCTCC
AGGATCTTGCCTTGTTCATTGGGGTCTCCTTAGCTTCTCTGCCTGTACGTTATCTTGGTCTCCCTCTTATTTCAGGTCGGTTAATGTATAGGGATTGTAAGCCTCTACTG
GAGCGGATTACTGCTCGGATTAGCTCTTGGTCTGCCCGGGTTTTGTCGTATGTAGGTCGCCTTCAGCTTGTTCAGTCGGTGCTCCAGAGCTTTCAGGTTGTCTGGTCGGA
GGTGTGCCTGCCTCGTGAGGAGGGGGGTTTGGGTGTCAGGCATGTTCGCTCGTGGAATGAGGCAGCGGTTATGAAGTTGTTGTGGTTGCTTATGTTGAAGTCTGGCTCGT
TGTGGGTTGCCTGGGTTGAGGCGTATATCTTGTGTGGGCGGTCCTTGTGGACTGTGCGCCCCTCTCCGCGATGGTCTTGGTGTTGGAGAGCGATTCTTAGTCTCCGGGAT
CAGTTTCGATCACTCATTCGCTTCAGCATTGGTGATGGTCGGAGATGTCTTACATGGGTGGATCCGTGGTTGCCTGAGGGTTCTATTATTCCTCGGTTTTCTGAGAGGGT
TATTTATGATGCTGCTAGCTCGTTGTATGCTCCCGTGGCTGATTTCCTTTTGCCTGACGGGTCTTGGCGTTGGCCTAGTGTATCGCTTGAGCTGTTGCTTCTTTTGCCTT
CAGTACAAGGGGTTCGACCGTGTCTTGCTAGGGAGGATAGTGTGGTTTGGCTTCCGAGTAGGTCTGGGGGTTTCTCTGTCTCCAGTGCTTGGGAGGCTATTAGGCCTCGT
TCTTCGATTGTTCCTTGGTTTCGTTTGTTATGGTTTGGGGGCAATATCCCTAAACACTCCTTTATTGCTTGGCTTGGGGTGCGGGATAGGCTCTATACGAGGGACCGGTT
GCATCGTTGGGATGCATCGAGCCCTACTGATTGTCTGTTATGCTTTGGTGCCCCGGAGTCCCGAGATCATCTGTTTTTTGGGTGCCCTTTCAACCAGGTTGTATGGGGTG
GGATCTTGAGTTGGGGTGACTGTTCTCATAGGGTTGCCTCTTGGGATACTGAGTTGTCTTGGATCTGTCAGGTTAGTGGGGCAAGCGTCGTAGCTGTCGAGTGTGGAGGG
TTGCCTGGTGTGCCTCGATTTATCATATTTGGAGGGAGAGGAATCTGCGTCTCCATGGTTCGGTCCTCGCTCTCCTCAGACTGTCTTACATGTGGTTCAGGCAGATATGC
CGGTCGTTGTGCCTCCTGGCTAGGAGGTGTGTCTTCTCCCCTACCTGGGGCATCTCCATGGAGCATGTTTGCCTGA
Protein sequenceShow/hide protein sequence
MKAAPTVVETKLAAGHRLLSFRRWLGLTILQSGFNLGRWSAGIEWQTVVVPDEDILEEGISLWRSSLVGQFIDARVPFSNGLWHIGGKPLLLRKWSPEVECGSSAIGRPV
CLDKATEERRRLSFAVFVWRLGRGMNCQVRWRCVFVARILLWGGLFLEPKRCVPCGVFGHSSGSCPKSVGVRDEVRVKPVDLPSTVPSTSAPPGCGRRGRQALVPCFASW
LEEGGVGFAGRVGLPPLRGPLPLAMVTPQSNFITFERAPQFSVDPEWGKGALDLPSDDYVLVVMGEWSGEDLGNVESESFLFRSYFCGPQMVHGTVCDLSSQVSVEVVCV
YASNSEGERRVLWEEILAVCSAWSSPGLVMGDFNAIRCSLEGFGGVLISLRWRHLMICGWLGDFYHCPLIFDPGLSVARRPPAFRFFSHWAKEDGFLDVVAESWVPYQDV
SPLVGFGRNLRALKSVLRRKFGCSIDVISRQVVEARRRMEDAQVAVGLNPLSSSLWIRLLGLLSLFGSGLGGRSLRFARSLGSSGSRWVIIILLFSTGVLRLGRFVMVLI
LWWMKEVIFLLANLTLQGLLLIIIEVFWGLSCLGTGPYSQIAEIVGFVWPEECGVDLGRSIVDNILLCQELVGDYHLDRGPPRCVLKVDLQKAYDSVQWDFLFGVLLAVG
TPVQFVSWGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCEKVGLTHLCFADDLMIFSAGDRSSISFVQDCLQDLALFIGVSLASLPVRYLGLPLISGRLMYRDCKPLL
ERITARISSWSARVLSYVGRLQLVQSVLQSFQVVWSEVCLPREEGGLGVRHVRSWNEAAVMKLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRD
QFRSLIRFSIGDGRRCLTWVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLELLLLLPSVQGVRPCLAREDSVVWLPSRSGGFSVSSAWEAIRPR
SSIVPWFRLLWFGGNIPKHSFIAWLGVRDRLYTRDRLHRWDASSPTDCLLCFGAPESRDHLFFGCPFNQVVWGGILSWGDCSHRVASWDTELSWICQVSGASVVAVECGG
LPGVPRFIIFGGRGICVSMVRSSLSSDCLTCGSGRYAGRCASWLGGVSSPLPGASPWSMFA