; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy12g005690 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy12g005690
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr12:24618902..24623035
RNA-Seq ExpressionLcy12g005690
SyntenyLcy12g005690
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042317.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]5.6e-19651.11Show/hide
Query:  VDYYKGINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQ
        V  +KG+NAT +TL+PKR   E + ++RPISCCNV+YKCI+KILA+R                  G P C LKVDLQKAYDS+ WDFLFGLL+A+ TP++
Subjt:  VDYYKGINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQ

Query:  FVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLC
        FVSW++AC +SPMFS+ +NGSLEGFF GR+G++QG+PLSP+ FVMVM+V SRMLN PP GF FH  C+KV LT L FADDLMIF   D   +SFV++ L 
Subjt:  FVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLC

Query:  RFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWA
        +F  L GL AN GK S+F AG +  VA  LA+ +G  L +LPVRYL LPL++GRL   DC PL++RIT+RI  W+ARVLS+AGR QLV+SV +S QV+WA
Subjt:  RFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWA

Query:  SVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWN-EAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWS
        SVF+LP+ V ++V ++LRSYLW+                          +R   SWN  + + +LWLL   S SLWVAWVEAYIL  +SLW V      S
Subjt:  SVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWN-EAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWS

Query:  WCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVV
        WC RAIL  RD  +  +   +GDG  C   +DPWL    I+ +  ERV+YDAAS   A +++F+ PDG W+WP VS+E++ L   VQ VRPCL+  D  V
Subjt:  WCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVV

Query:  WLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG
        W+P    GF ++SA + IRPR   V W  LLW GGN+PKHSF AWL ++++L TRDRLHRWDSS P   +LC G  ESRDHL F CPF + VW  +L   
Subjt:  WLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG

Query:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW
          SHRI  W  ELSWI     G     ++WRV  CA+ Y IW+E N RLHG   R    +   +   +  R  SW
Subjt:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW

KAA0046851.1 uncharacterized protein E6C27_scaffold19358G00020 [Cucumis melo var. makuwa]3.2e-20744.86Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL
        LK +LRR+FG  I  +S +V  A+ +M+ AQ  V  +PLS  L  QA+ A+++FW                                V++R+ RN L SL
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL

Query:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------------------------------------------
        +D  G+ ++    +A++ V+Y+                                                                              
Subjt:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------------------------------------------

Query:  --------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPR
                      G+NAT+ITL+PK    E L D+RPISCCNVLYKCI+KILA+R                                        G PR
Subjt:  --------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPR

Query:  CVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQK
        C LKVDLQKAYDSV WDFLFGLL+A+GTP++FVSW+RAC +S MFS+ +NGSLEGFF GR+GL+QGDPLSP+LFVMVMEVLSRMLN  P  F FH RC+K
Subjt:  CVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQK

Query:  VGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITA
        V LTHL FADDLMIF A D   ISF+++CL +F   SGL ANP KSS+F  GV+   A  LA+ IG S  S P            L   DC PL++RIT+
Subjt:  VGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITA

Query:  RISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLML
        RI  W+ARVLS+AGRLQLV+SVL+S QV+WASVF+LPA V +EV ++LRSYLW+G     G  KVAW +VCLP EEGGLGIR   SWN  + + L +L+ 
Subjt:  RISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLML

Query:  KSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSW
          GSLWVAW+EAYIL G+SLW V      SWC RAIL  R++ +  +                            ERV+YDAAS   A ++DF+ P+G W
Subjt:  KSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSW

Query:  RWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCL
         WP VSLE++ L   VQ V PCL+  DS VW+P R  GF ++SAWEAI PR   V W  LLW GGNIPKHSF AWL ++DRL TRDRLHRWDSS P  C+
Subjt:  RWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCL

Query:  LCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFG
        LC G  ESRDHL F CPF   VW  +    + SHRI  W  ELSWI     GK    ++WRV WCA+IY IW ERN RLHG   R P  + H++   +  
Subjt:  LCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFG

Query:  RCASW
        R  SW
Subjt:  RCASW

KAA0062318.1 uncharacterized protein E6C27_scaffold154G00690 [Cucumis melo var. makuwa]1.7e-20049.6Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL
        LK ++RR+FG  I  +S +V  A+ +++ AQ  V  +P+S  L  QA  ++++FW                                V++R+ RN L SL
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL

Query:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCIT
        +D  G    G  +++ V+ D  +                                         G+NAT+ITL+PK    E L D+RPISCCN LYKCI+
Subjt:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCIT

Query:  KILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVN
        KILA+R                                        G PRC LKVDLQKAYDSV WDFLFGLL+A+GTP++FVSW+RAC +SPMFS+ +N
Subjt:  KILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVN

Query:  GSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFC
        GSLEGFF GR+G++QGDPLS +LFVMVMEVLSRMLN  P  F FH RC+KV LTHL FADDLMIF A +   I F+++CL +F  LSGL ANP KSS+F 
Subjt:  GSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFC

Query:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS
        AGV+   A  LA+ +G    +L VRYLGLPL++GRL   D  PL++RIT+RI  W+ARVLS+AGRLQLV SVL+SFQV+WASVF+LPA V +EV ++LRS
Subjt:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS

Query:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRF
        YLW+G     G  KVAW +VCLP EEGGLGIR   SWN A  + +LWL++  SGSLWVAWVEAYIL GRSLW V      SWC RAIL  R++ + L+R 
Subjt:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRF

Query:  RIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR
        ++G+G      +DPWLPEG+I+ +  ERV+YDAAS   A ++DF+ PDG W WP VSLE++ L   VQ V PCL+  DS VW+P R  GF ++SAWEA+R
Subjt:  RIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR

Query:  PRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCF
        PR   V W  LLW GGNI KH F AWL ++DRL T DRLHRWDSS P  C+L F
Subjt:  PRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCF

TYK12108.1 non-LTR retroelement reverse transcriptase-like protein [Cucumis melo var. makuwa]1.2e-17746.51Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECV---------KARV-----------------------IRNGLN
        LK +L R+FG  I  +S ++  A+ +M+ AQ  V  +P+S  L  QA+ A+++FW  V         K+R+                       +R  L 
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECV---------KARV-----------------------IRNGLN

Query:  SLMDEG--------------GNVLTGQSDIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR-------------
        S MD G              G       D    V+ +++      G+NAT+ITL+PK    + L D+RPISCCNVLYKCI+KIL +R             
Subjt:  SLMDEG--------------GNVLTGQSDIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR-------------

Query:  ---------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPL
                                   G PRC LKVDLQKAYDSV W+FLFGLL+A+GTP++FVSW+R C +SPMFS+ +NGSLEG F GR+ ++QG+PL
Subjt:  ---------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPL

Query:  SPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSL
        SP+LFVMVMEVLSRMLN  P  F FH RC+KV LTHL FADDLMIF A D   I F+++CL +F  LSGL ANP KS +F AGV+   A  LA+ +G   
Subjt:  SPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSL

Query:  ASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSE
         +LPVRYLGLPL++GRL   DC P+++RIT+RI  W ARVLS+AGRLQLV SVL+S QV+WASVF+LPA V +EV ++LRSYLW+  P            
Subjt:  ASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSE

Query:  VCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEG
                        SWN A  + +LWL++   GSLWVAWVEAYIL GRSLW V      SWC RAIL                              G
Subjt:  VCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEG

Query:  SIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIP
        +I+ +  ERV+YDAAS   A ++DF+  +G W W  V LE++ L   VQ V PCL+  DS VW+P R  GF ++SAW+AIRPR   V W  LLW G NIP
Subjt:  SIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIP

Query:  KHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGIL
        KHSF AWL ++DRL TRDRLHRWDSS P  C+LC G  ESRDHL F CPF   VW  +L
Subjt:  KHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGIL

XP_031737043.1 uncharacterized protein LOC116402131 [Cucumis sativus]6.6e-20548.03Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECVKARVIRNGLNSLMDEGGNVLTGQSDIARVV------------
        LKS+LR  FG  I  IS  VV  R    + +     +    +   ++    +   + VKAR   N L S++D  GN LT    +++ +            
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECVKARVIRNGLNSLMDEGGNVLTGQSDIARVV------------

Query:  ----------------VDYYKG-----------INATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILA----------------------------
                        + ++KG           +N  +ITL+PKR   + L D+RPISCCNV+YKCI++ILA                            
Subjt:  ----------------VDYYKG-----------INATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILA----------------------------

Query:  ------------NRGPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLS
                    +RG PRC +KVDLQKAYDSV WDFLFGLL+A+G  I+FVSWVRAC +S MFS+ +NGSLEGFF GR+GL+QGDPLS +LFVMVMEVLS
Subjt:  ------------NRGPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLS

Query:  RMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLI
        RMLN PP  F FH  C+KV LTHL FADDLMIF A D   +SF+++ + RF  LSGL AN  KSS+F  GV+   A  LA+ +G S+  LPVRYLGLPL+
Subjt:  RMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLI

Query:  SGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIR
         GRL   DC PL++RIT+RI  WSARVLS+AGRLQLV+SVL+S QV+WASVF+LP +V  +V ++LRSYLW+G     G AKVAW EVCLP +EGGL IR
Subjt:  SGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIR

Query:  HVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLR----DQFRSLIRFR--IGDGRRCLTIVDPWLPEGSIIPRFS
           SWN A  + +LWLL++KSGSLWVAWVEAYIL GRS+          W    +L L        + ++R+R  +     C      W+  G+II +F 
Subjt:  HVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLR----DQFRSLIRFR--IGDGRRCLTIVDPWLPEGSIIPRFS

Query:  ERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAW
        ERVIYDA S   A + DF++ DG WRWP VSL+++ +   +QGVRP  + ED  VW+P     F ++SAWE IRP SS V W  LLW  GNIPKHSF AW
Subjt:  ERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAW

Query:  LGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRER
        L ++DRL TRDRL +WD S P  C+LC G  ESRDHL F CPF   +W  IL +   SHRI  W  ELSWI     GK    ++W + WCA+IY IW+ER
Subjt:  LGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRER

Query:  NLRLHGSGPRSP
        N  LHG   R P
Subjt:  NLRLHGSGPRSP

TrEMBL top hitse value%identityAlignment
A0A5A7TKU4 Non-LTR retroelement reverse transcriptase-like protein2.7e-19651.11Show/hide
Query:  VDYYKGINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQ
        V  +KG+NAT +TL+PKR   E + ++RPISCCNV+YKCI+KILA+R                  G P C LKVDLQKAYDS+ WDFLFGLL+A+ TP++
Subjt:  VDYYKGINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQ

Query:  FVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLC
        FVSW++AC +SPMFS+ +NGSLEGFF GR+G++QG+PLSP+ FVMVM+V SRMLN PP GF FH  C+KV LT L FADDLMIF   D   +SFV++ L 
Subjt:  FVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLC

Query:  RFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWA
        +F  L GL AN GK S+F AG +  VA  LA+ +G  L +LPVRYL LPL++GRL   DC PL++RIT+RI  W+ARVLS+AGR QLV+SV +S QV+WA
Subjt:  RFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWA

Query:  SVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWN-EAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWS
        SVF+LP+ V ++V ++LRSYLW+                          +R   SWN  + + +LWLL   S SLWVAWVEAYIL  +SLW V      S
Subjt:  SVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWN-EAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWS

Query:  WCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVV
        WC RAIL  RD  +  +   +GDG  C   +DPWL    I+ +  ERV+YDAAS   A +++F+ PDG W+WP VS+E++ L   VQ VRPCL+  D  V
Subjt:  WCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVV

Query:  WLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG
        W+P    GF ++SA + IRPR   V W  LLW GGN+PKHSF AWL ++++L TRDRLHRWDSS P   +LC G  ESRDHL F CPF + VW  +L   
Subjt:  WLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG

Query:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW
          SHRI  W  ELSWI     G     ++WRV  CA+ Y IW+E N RLHG   R    +   +   +  R  SW
Subjt:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW

A0A5A7TZS0 Reverse transcriptase domain-containing protein1.5e-20744.86Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL
        LK +LRR+FG  I  +S +V  A+ +M+ AQ  V  +PLS  L  QA+ A+++FW                                V++R+ RN L SL
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL

Query:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------------------------------------------
        +D  G+ ++    +A++ V+Y+                                                                              
Subjt:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------------------------------------------

Query:  --------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPR
                      G+NAT+ITL+PK    E L D+RPISCCNVLYKCI+KILA+R                                        G PR
Subjt:  --------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPR

Query:  CVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQK
        C LKVDLQKAYDSV WDFLFGLL+A+GTP++FVSW+RAC +S MFS+ +NGSLEGFF GR+GL+QGDPLSP+LFVMVMEVLSRMLN  P  F FH RC+K
Subjt:  CVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQK

Query:  VGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITA
        V LTHL FADDLMIF A D   ISF+++CL +F   SGL ANP KSS+F  GV+   A  LA+ IG S  S P            L   DC PL++RIT+
Subjt:  VGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITA

Query:  RISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLML
        RI  W+ARVLS+AGRLQLV+SVL+S QV+WASVF+LPA V +EV ++LRSYLW+G     G  KVAW +VCLP EEGGLGIR   SWN  + + L +L+ 
Subjt:  RISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLML

Query:  KSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSW
          GSLWVAW+EAYIL G+SLW V      SWC RAIL  R++ +  +                            ERV+YDAAS   A ++DF+ P+G W
Subjt:  KSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSW

Query:  RWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCL
         WP VSLE++ L   VQ V PCL+  DS VW+P R  GF ++SAWEAI PR   V W  LLW GGNIPKHSF AWL ++DRL TRDRLHRWDSS P  C+
Subjt:  RWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCL

Query:  LCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFG
        LC G  ESRDHL F CPF   VW  +    + SHRI  W  ELSWI     GK    ++WRV WCA+IY IW ERN RLHG   R P  + H++   +  
Subjt:  LCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFG

Query:  RCASW
        R  SW
Subjt:  RCASW

A0A5A7V3Z0 Reverse transcriptase domain-containing protein8.1e-20149.6Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL
        LK ++RR+FG  I  +S +V  A+ +++ AQ  V  +P+S  L  QA  ++++FW                                V++R+ RN L SL
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFW------------------------------ECVKARVIRNGLNSL

Query:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCIT
        +D  G    G  +++ V+ D  +                                         G+NAT+ITL+PK    E L D+RPISCCN LYKCI+
Subjt:  MDEGGNVLTGQSDIARVVVDYYK-----------------------------------------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCIT

Query:  KILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVN
        KILA+R                                        G PRC LKVDLQKAYDSV WDFLFGLL+A+GTP++FVSW+RAC +SPMFS+ +N
Subjt:  KILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVN

Query:  GSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFC
        GSLEGFF GR+G++QGDPLS +LFVMVMEVLSRMLN  P  F FH RC+KV LTHL FADDLMIF A +   I F+++CL +F  LSGL ANP KSS+F 
Subjt:  GSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFC

Query:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS
        AGV+   A  LA+ +G    +L VRYLGLPL++GRL   D  PL++RIT+RI  W+ARVLS+AGRLQLV SVL+SFQV+WASVF+LPA V +EV ++LRS
Subjt:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS

Query:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRF
        YLW+G     G  KVAW +VCLP EEGGLGIR   SWN A  + +LWL++  SGSLWVAWVEAYIL GRSLW V      SWC RAIL  R++ + L+R 
Subjt:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRF

Query:  RIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR
        ++G+G      +DPWLPEG+I+ +  ERV+YDAAS   A ++DF+ PDG W WP VSLE++ L   VQ V PCL+  DS VW+P R  GF ++SAWEA+R
Subjt:  RIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR

Query:  PRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCF
        PR   V W  LLW GGNI KH F AWL ++DRL T DRLHRWDSS P  C+L F
Subjt:  PRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCF

A0A5D3CLU1 Non-LTR retroelement reverse transcriptase-like protein5.7e-17846.51Show/hide
Query:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECV---------KARV-----------------------IRNGLN
        LK +L R+FG  I  +S ++  A+ +M+ AQ  V  +P+S  L  QA+ A+++FW  V         K+R+                       +R  L 
Subjt:  LKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAARASQSFWECV---------KARV-----------------------IRNGLN

Query:  SLMDEG--------------GNVLTGQSDIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR-------------
        S MD G              G       D    V+ +++      G+NAT+ITL+PK    + L D+RPISCCNVLYKCI+KIL +R             
Subjt:  SLMDEG--------------GNVLTGQSDIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR-------------

Query:  ---------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPL
                                   G PRC LKVDLQKAYDSV W+FLFGLL+A+GTP++FVSW+R C +SPMFS+ +NGSLEG F GR+ ++QG+PL
Subjt:  ---------------------------GPPRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPL

Query:  SPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSL
        SP+LFVMVMEVLSRMLN  P  F FH RC+KV LTHL FADDLMIF A D   I F+++CL +F  LSGL ANP KS +F AGV+   A  LA+ +G   
Subjt:  SPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSL

Query:  ASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSE
         +LPVRYLGLPL++GRL   DC P+++RIT+RI  W ARVLS+AGRLQLV SVL+S QV+WASVF+LPA V +EV ++LRSYLW+  P            
Subjt:  ASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSE

Query:  VCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEG
                        SWN A  + +LWL++   GSLWVAWVEAYIL GRSLW V      SWC RAIL                              G
Subjt:  VCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEG

Query:  SIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIP
        +I+ +  ERV+YDAAS   A ++DF+  +G W W  V LE++ L   VQ V PCL+  DS VW+P R  GF ++SAW+AIRPR   V W  LLW G NIP
Subjt:  SIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIP

Query:  KHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGIL
        KHSF AWL ++DRL TRDRLHRWDSS P  C+LC G  ESRDHL F CPF   VW  +L
Subjt:  KHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGIL

A0A5D3D7P6 Reverse transcriptase3.7e-17747.98Show/hide
Query:  GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSV
        G+NAT+ITL+PK    E L D+RPISCCNVLYKCI+KILA+R                                        G PRC LKVDLQKAYDSV
Subjt:  GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSV

Query:  QWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMI
         WDFLFGL +++ TP++FVSW+ AC +SPMFS+ +NGSLEGFF GR+G++QGDPLS +LFVMVMEVLSRMLN  P  F FH RC+K              
Subjt:  QWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMI

Query:  FSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAG
                         RF  LSGL ANP KSS+F AGV+   A  LA+ +G    +LPVRYLGLPL++GRL   DC PL++RIT+RI   SARVLS+AG
Subjt:  FSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAG

Query:  RLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAY
        RLQLV SVL S QV+WA VF+LPA V +                                EEGGLGIR   +W  A  + +LWL++  SGSLWVAWVEAY
Subjt:  RLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEA-VVSLLWLLMLKSGSLWVAWVEAY

Query:  ILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLL
        +L GRSLW V      SWC RAIL  +++ +  +R ++G+G RC   +DPWL  G+I+ R  ERV+YDAAS   A +++F+ PDG W WP          
Subjt:  ILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLL

Query:  PSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLL
                                GF ++SAWEAIRPR   V W  LLW GGNIPKHSF AWL ++DRL TRDR HRWDSS P  C+LC G  ESRDHL 
Subjt:  PSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLL

Query:  FECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW
        F CPF   VW  +L     SHRI  W  ELSWI      K    ++WRV WCA+IY IW ERN RLHG     P  + H++   +  R  SW
Subjt:  FECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-2224.93Show/hide
Query:  SITLVPKRPSPEGL--GDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWD
        SI L+PK P  +     ++RPIS  N+  K + KILANR                                             ++ +D +KA+D +Q  
Subjt:  SITLVPKRPSPEGL--GDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWD

Query:  FLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNG-SLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFS
        F+   L  +G    ++  +RA +  P  ++ +NG  LE F L + G +QG PLSP LF +V+EVL+R +    +  G     ++V L+   FADD++++ 
Subjt:  FLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNG-SLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFS

Query:  AGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISG--RLTYRDCKPLLERITARISYWSARVLSYAG
                 +   +  F  +SG   N  KS  F    +R     +   +  ++AS  ++YLG+ L      L   + KPLL+ I    + W     S+ G
Subjt:  AGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISG--RLTYRDCKPLLERITARISYWSARVLSYAG

Query:  RLQLVQSVL--QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVS
        R+ +V+  +  +    F A    LP     E+ +    ++W        RA++A S +    + GG+ +   + + +A V+
Subjt:  RLQLVQSVL--QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVS

P08548 LINE-1 reverse transcriptase homolog5.5e-2123.02Show/hide
Query:  SITLVPK-RPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDF
        +ITL+PK    P    +YRPIS  N+  K + KIL NR                                             +L +D +KA+D++Q  F
Subjt:  SITLVPK-RPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDF

Query:  LFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAG
        +   L  +G    F+  + A +S P  ++ +NG     F  R G +QG PLSP LF +VMEVL+  +       G H   +++ L+   FADD++++   
Subjt:  LFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAG

Query:  DRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISG--RLTYRDCKPLLERITARISYWSARVLSYAGRL
         R   + + + +  +  +SG   N  KS  F    +    + +   I  ++    ++YLG+ L      L   + + L + I   ++ W     S+ GR+
Subjt:  DRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISG--RLTYRDCKPLLERITARISYWSARVLSYAGRL

Query:  QLVQ-SVL-QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVV
         +V+ S+L ++   F A     P     ++ +++  ++W        + ++A + +    + GG+ +  +R + +++V
Subjt:  QLVQ-SVL-QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVV

P0C2F6 Putative ribonuclease H protein At1g657501.8e-3528.37Show/hide
Query:  LPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGG
        +P++  R+       +LER+++R+S W  + LS+AGRL L ++VL S  V   S  +LP  + + + +L R++LW        +  V WS+VC P++EGG
Subjt:  LPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGG

Query:  LGIRHVRSWNEAVVSLL-WLLMLKSGSLWVAWVEAYILCGR---SLWTVRPSPRWSWCWRAI-LSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPR
        LG+R  +S N A++S + W L+ +  SLW   ++     G    S W + P   WS  WR+I + LRD     + +  GDG++     D W+    ++  
Subjt:  LGIRHVRSWNEAVVSLL-WLLMLKSGSLWVAWVEAYILCGR---SLWTVRPSPRWSWCWRAI-LSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPR

Query:  FSERVIYDAASSLYAPVADFLLPDGSWRWPSVS--------LEV-LLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR----PRSSIVPWFRL
         +     D  + +     D  +P   W +  +         LE+  ++L  V G R      D + W  S+   F V SA+E +     PR ++  +F  
Subjt:  FSERVIYDAASSLYAPVADFLLPDGSWRWPSVS--------LEV-LLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIR----PRSSIVPWFRL

Query:  LWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVW
        LW      +     WL     + T +  HR   S+   C +C G  ES  H+L +CP    +W
Subjt:  LWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVW

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-2223.22Show/hide
Query:  SITLVPK-RPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDF
        +ITL+PK +  P  + ++RPIS  N+  K + KILANR                                             ++ +D +KA+D +Q  F
Subjt:  SITLVPK-RPSPEGLGDYRPISCCNVLYKCITKILANR----------------------------------------GPPRCVLKVDLQKAYDSVQWDF

Query:  LFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNG-SLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSA
        +  +L   G    +++ ++A +S P+ ++ VNG  LE   L + G +QG PLSPYLF +V+EVL+R +    +  G     ++V ++ L  ADD++++ +
Subjt:  LFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNG-SLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSA

Query:  GDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLIS--GRLTYRDCKPLLERITARISYWSARVLSYAGR
          ++    + + +  F  + G   N  KS  F    ++   +++      S+ +  ++YLG+ L      L  ++ K L + I   +  W     S+ GR
Subjt:  GDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLIS--GRLTYRDCKPLLERITARISYWSARVLSYAGR

Query:  LQLVQSVL--QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVV
        + +V+  +  ++   F A    +P +  +E+   +  ++W      + + ++A S +   R  GG+ +  ++ +  A+V
Subjt:  LQLVQSVL--QSFQVFWASVFILPARVTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVV

P14381 Transposon TX1 uncharacterized 149 kDa protein4.1e-1624.49Show/hide
Query:  DIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR---------------------------------------GP
        D  RV+ + +K            ++L+PK+     + ++RP+S  +  YK + K ++ R                                       G 
Subjt:  DIARVVVDYYK------GINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANR---------------------------------------GP

Query:  PRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRC
            L +D +KA+D V   +L G L A     QFV +++  ++S    V +N SL       RG++QG PLS  L+ + +E    +L     G       
Subjt:  PRCVLKVDLQKAYDSVQWDFLFGLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRC

Query:  QKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRL-TYRDCKPLLER
         +V L+   +ADD +I  A D   +   Q+C   +   S    N  KSS    G  +V     A F  +S  S  ++YLG+ L +      ++   L E 
Subjt:  QKVGLTHLCFADDLMIFSAGDRSFISFVQDCLCRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRL-TYRDCKPLLER

Query:  ITARISYWS--ARVLSYAGRLQLVQSVLQSFQVFWASVFILPAR-VTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRS
        +  R+  W   A+VLS  GR  ++  ++ S Q+++  + + P +    ++ R L  +LW       G+  V+     LP +EGG G+  +RS
Subjt:  ITARISYWS--ARVLSYAGRLQLVQSVLQSFQVFWASVFILPAR-VTHEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRS

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-3432.06Show/hide
Query:  RSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQ
        R+ WT+  +   SW WR +  LR+  R  +   +G G       D W   G                    P+ D + P G     +V L +        
Subjt:  RSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQ

Query:  GVRPCLAREDSVVW---LPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLF
        G+  C   +DS +W   L + S  F  +    A+ P++ IVPW++ +WF  ++PKH+FI W+   +RL+TRDRL  W  S P  CLLC    ESR HL F
Subjt:  GVRPCLAREDSVVW---LPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLF

Query:  ECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGR
        ECPF   VW       +     A     L W+   S  K  +  + R+A+ A +Y IWRERNL LH    R   +VL  +Q  +  R
Subjt:  ECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGR

AT1G60720.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.9e-3132.71Show/hide
Query:  ILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWP-SVSLEVLLLLPSVQGV-RPCLAR-EDSVVWL
        +L LR      ++  +G+GR      D W   G +I    +         L A V + L  +G W+ P S S     +   +  +  P  A  EDS  W+
Subjt:  ILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWP-SVSLEVLLLLPSVQGV-RPCLAR-EDSVVWL

Query:  PSR--SRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG
              +GF  +  W+AIRPR+  + W + +WF G +PKH+F  W+   DRL TR RL  W      DC LC    ESRDHLLF C F+  VW    S  
Subjt:  PSR--SRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG

Query:  DCSHRI-ASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADM
            R+  SW   LSW+   S       R  +V+  A IY+IWR+RN  LH +   +P  +  +V  ++
Subjt:  DCSHRI-ASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADM

AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.6e-5530.34Show/hide
Query:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS
        AGV      D+      +  +LPVRYLGLPL++ ++T  D  PL+E+I  RI  W+AR LS+AGRLQL+ SV+ S   FW S F LP+    E+  +  S
Subjt:  AGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARVTHEVYRLLRS

Query:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFR
        +LW G      +AKVAWS+VC P++EGGLGIR ++  N+             GS W   +      G            SW W+ IL  R      ++  
Subjt:  YLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFR

Query:  IGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLL---PDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRG------FFV
        I +G       D W   G +I     R   D   +L+A VA+ ++   P        + +E ++     QG+    + ED+V W   +  G      F  
Subjt:  IGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLL---PDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRG------FFV

Query:  SSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDT
           W A R     V W++ +WF    PK+S +AW+ +++RL T DR+  W++ + + C+LC    E+RDHL F CP+S                      
Subjt:  SSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDT

Query:  ELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRS
        E+ ++ +++             +  +++ +W+ERN R HG  P++
Subjt:  ELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRS

AT4G04650.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein5.1e-3834.34Show/hide
Query:  LRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLL-----LLPSVQGVRPCLAREDSVVW--
        LR   R  I   +G G       D W+  G +I              + A V D L     W   S S   ++     LLP  QG+  C   +DS +W  
Subjt:  LRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLL-----LLPSVQGVRPCLAREDSVVW--

Query:  -LPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG
         L + S  F     W A+ P+S  VPW + +WF  ++PKH+FI W+   +RL+TRDRL  W  S P +CLLC    +SR HL FEC FS VVW    +  
Subjt:  -LPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWG

Query:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQ
        + +      D  L+W+   S  ++  C + R+A+ + +Y IWRERN RLH    RS +++L  +Q
Subjt:  DCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQ

AT5G18880.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.4e-2628.21Show/hide
Query:  ILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSII-------PRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLP---SVQGVRPCLAR
        ++ L+      +R  +G+G       D W   G ++       PR   R+  DA     +   D+ LP        + L  L + P     +G    L R
Subjt:  ILSLRDQFRSLIRFRIGDGRRCLTIVDPWLPEGSII-------PRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLP---SVQGVRPCLAR

Query:  EDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGG
          +  +LPS    F     WE IR  S  VPW +++WF   IP+ S I W+   +RL TRDRL  W  + P+  +LC    E+  HL FEC FS  +W  
Subjt:  EDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRLLWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGG

Query:  ILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW
          S    S          SWI Q    +  S  + ++   +++YH+W+ERN R+  S   S  ++   +   M  R  S+
Subjt:  ILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYHIWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCCCCAGAGGTTTGAGTGTGGTTGCTAGTGCCATAGGCAAACCGGTGTGCCTTGATAAGGCTACTAAAGAGCGCAGGCGGTTGTCTTTTGCACGTATTTGTGTGGA
GATTGGGCCGAGGGATGAATTGCCAAGTACAGTGGAGGTGTGTATTCGTGGCCAAAACTTTGTTGTGGTTGTGGATTATTCTTGGAAGCCTAAGAGGTGTGCGCCTTGTG
GTGTGTTTGGTCACTCGAGTGGTAGTTGTCCGGAGTCAGTGGGGGTTCGTGACGAGGTTACTGTTAAGCCTGTTGACCTCCCTTCTACGGTTCCTTCTTCTAGTGCCCCT
CCTGCAGGTGGTCGTCGTGGGTCGACAAGCCCTAGTCCCTGTGTTTCGCCGGTTGGCTCGAAGAGGGTGGGGCCAGCTTCACCGATTAAAAGGGTTGGAGCTTCCTCCCC
TTCGGGTCATTCGAGCTCAGGGAACTCGTTTGCTATGTTGAAGGGTGAGGATGAGTTTGCACTTGCTGTGGCGCTTAAATCTGTTCTTCGGAGGAAGTTTGGGTGTTCTA
TTGCTGTTATTAGTAGGCAGGTGGTGGAGGCCAGGAGGAGTATGGAGGATGCTCAAGTGGCTGTTGGGCTGGACCCTCTTTCTTCTAGTCTTGTGGATCAGGCTGCTAGG
GCTTCTCAGTCTTTTTGGGAGTGTGTTAAGGCCAGGGTGATTCGTAATGGTCTTAATTCTTTGATGGATGAGGGAGGTAATGTTCTTACTGGCCAGTCTGACATTGCGAG
GGTTGTTGTTGATTATTATAAAGGTATTAATGCTACTTCTATCACTCTAGTTCCTAAGCGACCTAGTCCAGAAGGGTTGGGTGATTATCGGCCTATTTCTTGCTGTAATG
TGTTGTATAAATGCATTACTAAGATCTTAGCTAACAGGGGGCCTCCCAGATGTGTTCTGAAGGTTGACCTTCAGAAGGCTTATGATTCTGTACAGTGGGACTTTCTGTTT
GGGCTGCTTTTAGCGGTGGGTACACCTATCCAGTTTGTTAGTTGGGTGCGAGCTTGTTTTTCGTCTCCTATGTTCTCTGTGTCAGTAAATGGTTCCCTTGAGGGGTTCTT
TCTGGGGAGGAGGGGGCTGAAACAGGGGGATCCACTTTCTCCTTATCTATTTGTGATGGTGATGGAGGTGTTATCACGCATGTTGAATGCTCCTCCAGATGGCTTTGGTT
TTCATTTTCGATGTCAGAAGGTGGGGTTGACCCACTTGTGCTTTGCAGATGACTTGATGATTTTCAGTGCAGGGGATCGGTCCTTCATTTCGTTTGTTCAGGATTGTCTC
TGTAGGTTTCAGGTTCTTTCGGGGCTTGTGGCTAACCCTGGGAAGAGTTCCTTGTTCTGTGCGGGGGTGTCTCGAGTGGTTGCAGAGGATCTTGCCTCGTTCATTGGGGT
CTCCTTAGCTTCTCTGCCTGTACGTTATCTTGGTCTCCCTCTTATTTCAGGTCGGTTAACGTATAGGGATTGTAAGCCTCTGTTGGAGCGGATCACTGCTCGGATTAGCT
ATTGGTCTGCCCGGGTTTTGTCGTATGCTGGTCGCCTTCAGCTTGTTCAGTCGGTACTTCAGAGCTTTCAGGTCTTTTGGGCTAGTGTGTTTATTCTCCCTGCTAGGGTC
ACCCATGAGGTTTATCGGTTGTTACGTTCTTATTTATGGAAGGGTGATCCGGCCTTGCATGGTAGGGCCAAGGTTGCCTGGTCGGAGGTGTGCCTTCCTCGTGAGGAGGG
GGGCTTGGGTATCAGGCATGTTCGCTCGTGGAATGAGGCAGTAGTTTCGTTGTTGTGGCTGCTTATGTTGAAGTCTGGTTCGTTGTGGGTTGCTTGGGTTGAGGCTTATA
TCTTGTGTGGGCGGTCTCTGTGGACTGTGCGCCCCTCTCCGCGGTGGTCTTGGTGTTGGAGAGCGATTCTTAGTCTTCGGGATCAGTTTCGATCACTCATTCGTTTCCGC
ATTGGTGATGGTCGGAGATGTCTTACAATTGTGGATCCGTGGTTGCCTGAGGGTTCTATTATTCCTCGTTTTTCTGAGAGGGTAATTTATGATGCTGCTAGCTCGTTGTA
TGCTCCCGTGGCTGATTTCCTCTTGCCTGATGGGTCTTGGCGTTGGCCTAGTGTTTCGCTTGAGGTATTGCTTCTTTTGCCTTCAGTACAAGGGGTTCGACCGTGCCTTG
CTAGGGAGGATAGTGTGGTTTGGCTCCCGAGTAGGTCTAGGGGTTTCTTTGTCTCCAGTGCTTGGGAGGCTATTAGGCCTCGTTCTTCGATTGTTCCTTGGTTTCGCTTG
TTATGGTTTGGGGGTAATATCCCTAAGCACTCCTTTATTGCTTGGCTTGGGGTGCAGGATAGGCTCTATACGAGGGACCGGTTGCATCGATGGGATTCTTCGAGCCCTAC
TGATTGTCTGTTATGCTTTGGTGCCCCGGAGTCCCGAGATCATCTGCTTTTTGAGTGCCCTTTCAGCCAGGTCGTATGGGGTGGGATCTTGAGTTGGGGTGACTGTTCTC
ATAGGATTGCCTCATGGGATACTGAGCTGTCTTGGATCTTCCAGTTTAGTGGGGGCAAGCGTCGTAGCTGCCGAGTGTGGAGGGTTGCCTGGTGTGCCTCGATCTACCAC
ATTTGGAGGGAGAGGAATCTGCGTCTCCATGGTTCGGGTCCTCGCTCTCCTCAGACTGTCTTACATGTGGTTCAGGCTGATATGTTCGGTCGTTGTGCCTCCTGGCTAGG
GGGTGTGTCTTCTCCCCTACCAGGGGCGTCTCCCTGGAGCCTGTTTGCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCCCCAGAGGTTTGAGTGTGGTTGCTAGTGCCATAGGCAAACCGGTGTGCCTTGATAAGGCTACTAAAGAGCGCAGGCGGTTGTCTTTTGCACGTATTTGTGTGGA
GATTGGGCCGAGGGATGAATTGCCAAGTACAGTGGAGGTGTGTATTCGTGGCCAAAACTTTGTTGTGGTTGTGGATTATTCTTGGAAGCCTAAGAGGTGTGCGCCTTGTG
GTGTGTTTGGTCACTCGAGTGGTAGTTGTCCGGAGTCAGTGGGGGTTCGTGACGAGGTTACTGTTAAGCCTGTTGACCTCCCTTCTACGGTTCCTTCTTCTAGTGCCCCT
CCTGCAGGTGGTCGTCGTGGGTCGACAAGCCCTAGTCCCTGTGTTTCGCCGGTTGGCTCGAAGAGGGTGGGGCCAGCTTCACCGATTAAAAGGGTTGGAGCTTCCTCCCC
TTCGGGTCATTCGAGCTCAGGGAACTCGTTTGCTATGTTGAAGGGTGAGGATGAGTTTGCACTTGCTGTGGCGCTTAAATCTGTTCTTCGGAGGAAGTTTGGGTGTTCTA
TTGCTGTTATTAGTAGGCAGGTGGTGGAGGCCAGGAGGAGTATGGAGGATGCTCAAGTGGCTGTTGGGCTGGACCCTCTTTCTTCTAGTCTTGTGGATCAGGCTGCTAGG
GCTTCTCAGTCTTTTTGGGAGTGTGTTAAGGCCAGGGTGATTCGTAATGGTCTTAATTCTTTGATGGATGAGGGAGGTAATGTTCTTACTGGCCAGTCTGACATTGCGAG
GGTTGTTGTTGATTATTATAAAGGTATTAATGCTACTTCTATCACTCTAGTTCCTAAGCGACCTAGTCCAGAAGGGTTGGGTGATTATCGGCCTATTTCTTGCTGTAATG
TGTTGTATAAATGCATTACTAAGATCTTAGCTAACAGGGGGCCTCCCAGATGTGTTCTGAAGGTTGACCTTCAGAAGGCTTATGATTCTGTACAGTGGGACTTTCTGTTT
GGGCTGCTTTTAGCGGTGGGTACACCTATCCAGTTTGTTAGTTGGGTGCGAGCTTGTTTTTCGTCTCCTATGTTCTCTGTGTCAGTAAATGGTTCCCTTGAGGGGTTCTT
TCTGGGGAGGAGGGGGCTGAAACAGGGGGATCCACTTTCTCCTTATCTATTTGTGATGGTGATGGAGGTGTTATCACGCATGTTGAATGCTCCTCCAGATGGCTTTGGTT
TTCATTTTCGATGTCAGAAGGTGGGGTTGACCCACTTGTGCTTTGCAGATGACTTGATGATTTTCAGTGCAGGGGATCGGTCCTTCATTTCGTTTGTTCAGGATTGTCTC
TGTAGGTTTCAGGTTCTTTCGGGGCTTGTGGCTAACCCTGGGAAGAGTTCCTTGTTCTGTGCGGGGGTGTCTCGAGTGGTTGCAGAGGATCTTGCCTCGTTCATTGGGGT
CTCCTTAGCTTCTCTGCCTGTACGTTATCTTGGTCTCCCTCTTATTTCAGGTCGGTTAACGTATAGGGATTGTAAGCCTCTGTTGGAGCGGATCACTGCTCGGATTAGCT
ATTGGTCTGCCCGGGTTTTGTCGTATGCTGGTCGCCTTCAGCTTGTTCAGTCGGTACTTCAGAGCTTTCAGGTCTTTTGGGCTAGTGTGTTTATTCTCCCTGCTAGGGTC
ACCCATGAGGTTTATCGGTTGTTACGTTCTTATTTATGGAAGGGTGATCCGGCCTTGCATGGTAGGGCCAAGGTTGCCTGGTCGGAGGTGTGCCTTCCTCGTGAGGAGGG
GGGCTTGGGTATCAGGCATGTTCGCTCGTGGAATGAGGCAGTAGTTTCGTTGTTGTGGCTGCTTATGTTGAAGTCTGGTTCGTTGTGGGTTGCTTGGGTTGAGGCTTATA
TCTTGTGTGGGCGGTCTCTGTGGACTGTGCGCCCCTCTCCGCGGTGGTCTTGGTGTTGGAGAGCGATTCTTAGTCTTCGGGATCAGTTTCGATCACTCATTCGTTTCCGC
ATTGGTGATGGTCGGAGATGTCTTACAATTGTGGATCCGTGGTTGCCTGAGGGTTCTATTATTCCTCGTTTTTCTGAGAGGGTAATTTATGATGCTGCTAGCTCGTTGTA
TGCTCCCGTGGCTGATTTCCTCTTGCCTGATGGGTCTTGGCGTTGGCCTAGTGTTTCGCTTGAGGTATTGCTTCTTTTGCCTTCAGTACAAGGGGTTCGACCGTGCCTTG
CTAGGGAGGATAGTGTGGTTTGGCTCCCGAGTAGGTCTAGGGGTTTCTTTGTCTCCAGTGCTTGGGAGGCTATTAGGCCTCGTTCTTCGATTGTTCCTTGGTTTCGCTTG
TTATGGTTTGGGGGTAATATCCCTAAGCACTCCTTTATTGCTTGGCTTGGGGTGCAGGATAGGCTCTATACGAGGGACCGGTTGCATCGATGGGATTCTTCGAGCCCTAC
TGATTGTCTGTTATGCTTTGGTGCCCCGGAGTCCCGAGATCATCTGCTTTTTGAGTGCCCTTTCAGCCAGGTCGTATGGGGTGGGATCTTGAGTTGGGGTGACTGTTCTC
ATAGGATTGCCTCATGGGATACTGAGCTGTCTTGGATCTTCCAGTTTAGTGGGGGCAAGCGTCGTAGCTGCCGAGTGTGGAGGGTTGCCTGGTGTGCCTCGATCTACCAC
ATTTGGAGGGAGAGGAATCTGCGTCTCCATGGTTCGGGTCCTCGCTCTCCTCAGACTGTCTTACATGTGGTTCAGGCTGATATGTTCGGTCGTTGTGCCTCCTGGCTAGG
GGGTGTGTCTTCTCCCCTACCAGGGGCGTCTCCCTGGAGCCTGTTTGCCTGA
Protein sequenceShow/hide protein sequence
MVPRGLSVVASAIGKPVCLDKATKERRRLSFARICVEIGPRDELPSTVEVCIRGQNFVVVVDYSWKPKRCAPCGVFGHSSGSCPESVGVRDEVTVKPVDLPSTVPSSSAP
PAGGRRGSTSPSPCVSPVGSKRVGPASPIKRVGASSPSGHSSSGNSFAMLKGEDEFALAVALKSVLRRKFGCSIAVISRQVVEARRSMEDAQVAVGLDPLSSSLVDQAAR
ASQSFWECVKARVIRNGLNSLMDEGGNVLTGQSDIARVVVDYYKGINATSITLVPKRPSPEGLGDYRPISCCNVLYKCITKILANRGPPRCVLKVDLQKAYDSVQWDFLF
GLLLAVGTPIQFVSWVRACFSSPMFSVSVNGSLEGFFLGRRGLKQGDPLSPYLFVMVMEVLSRMLNAPPDGFGFHFRCQKVGLTHLCFADDLMIFSAGDRSFISFVQDCL
CRFQVLSGLVANPGKSSLFCAGVSRVVAEDLASFIGVSLASLPVRYLGLPLISGRLTYRDCKPLLERITARISYWSARVLSYAGRLQLVQSVLQSFQVFWASVFILPARV
THEVYRLLRSYLWKGDPALHGRAKVAWSEVCLPREEGGLGIRHVRSWNEAVVSLLWLLMLKSGSLWVAWVEAYILCGRSLWTVRPSPRWSWCWRAILSLRDQFRSLIRFR
IGDGRRCLTIVDPWLPEGSIIPRFSERVIYDAASSLYAPVADFLLPDGSWRWPSVSLEVLLLLPSVQGVRPCLAREDSVVWLPSRSRGFFVSSAWEAIRPRSSIVPWFRL
LWFGGNIPKHSFIAWLGVQDRLYTRDRLHRWDSSSPTDCLLCFGAPESRDHLLFECPFSQVVWGGILSWGDCSHRIASWDTELSWIFQFSGGKRRSCRVWRVAWCASIYH
IWRERNLRLHGSGPRSPQTVLHVVQADMFGRCASWLGGVSSPLPGASPWSLFA