; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy06g012760 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy06g012760
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationChr06:17110703..17114067
RNA-Seq ExpressionLcy06g012760
SyntenyLcy06g012760
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]4.9e-16037.42Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        RLS  +SLPW+  GDFNE+ +  EKEGGR+R  +QM  F E + SC LRDLG+ G  FTW +      W+RE+LDR L ++       +  + H     S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRR----RPLKLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKE
        +H ++L +     D+ S SRR    +P + E  WLK     +++   W K    +  +S +  ++ C + L++WNK+ + G + K +   + ++E +E +
Subjt:  NHRIILAQLQFQGDARSNSRR----RPLKLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKE

Query:  TKGFP--TEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP-----
         KG P   E++      L  LLE EE+ W  RSR  WL+ GD+NT  FH KAS R ++N I  I+D +G W E+ E I K+ VE+F  LF +S P     
Subjt:  TKGFP--TEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP-----

Query:  -----------------------------------------------------------------------------NE---------------------
                                                                                     NE                     
Subjt:  -----------------------------------------------------------------------------NE---------------------

Query:  ----------RSIEDVLK------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMA
                  ++I D LK            AFV +RLI+DNV++  E +H ++ +R GK G +A+KLDMSKAYDRVEW  + +IM KLGF E W+  IM 
Subjt:  ----------RSIEDVLK------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMA

Query:  CIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEE
        C+ SV Y V ING PQG I P R LRQGDPLSPYLFLFC E L A+ +Q  +    RG+  ++  P LSHLFF DDSLI  +AT ++C  I+ ILK YE+
Subjt:  CIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEE

Query:  ASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCF
        +SGQ +N  K+    S+N   +    ++ + G Q  K    YLG+PS  G++K+  F +LK+++ K L GWKEKL S  GKEVL K +A+A+PT TMSCF
Subjt:  ASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCF

Query:  KLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTW
        K+PN +CD++  + ++FWWG    + K  WLSW KLC  KD+GG+GFR LK FN+ LLAK  WR+  HPNSL  +V K +YF   +F  A LG  PS  W
Subjt:  KLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTW

Query:  RSILWGRELFKSGYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSID-ILNTPTGGRNYKDEIIW
        RSI+  +E+ K G +W++GNG  I +  D W+   A Q + +P     L  +  V++L+D E   W E L+  +  G +  D +L  P       D  IW
Subjt:  RSILWGRELFKSGYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSID-ILNTPTGGRNYKDEIIW

Query:  KCDPKG
          +PKG
Subjt:  KCDPKG

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]6.6e-15733.24Show/hide
Query:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN
        L+ +  LPWL  GDFNEI +++EK GG  R Q QMD F +++  CG  DLG+ G  +TW    +G + I  +LDR L   D       + V+HL     +
Subjt:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN

Query:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLG---NDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIE-KETK
        H    A L      R   R +    E  W K    + II+  W   +     +  S +L+I  C V+L+ W+ + + G I K ++ K+  +  +  +E  
Subjt:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLG---NDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIE-KETK

Query:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP---------
           + ++ +  + + +LL++EE YW  R++  WL+ GDRNTK FHA+AS+R+K+N IVGI D+ G W +NEE I++ A+ +F ++++SS+P         
Subjt:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP---------

Query:  ---------NERSIEDVLK---------------------------------------------------------------------------------
                 NE  I +  K                                                                                 
Subjt:  ---------NERSIEDVLK---------------------------------------------------------------------------------

Query:  ---------------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE
                                   AF   RLI+DNV++ FE +H L+++  GKEG +A+KLDMSKA+DRVEW  I+K+ME++GF   W   +M CI 
Subjt:  ---------------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE

Query:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG
        SV Y ++ING   G+I P R LRQGDPLSP LFL C E L AL+NQ  R     G+ IN+ CP ++HLFF DDS++ C+A  ++C  ++ IL  YEEASG
Subjt:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG

Query:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP
        Q IN DKS    S N  ++    +  ILG   +     YLG+PS  G++KS VF  LK+++   L GWK KL S GGKE+L K +AQAIPT TMSCF LP
Subjt:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP

Query:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI
          +CDD+ R+   FWWG  + + K  W+SW ++C SK  GGLGFR LK FN  +LAK +WRIL +PNSL+ +VLK RYF     LNA LG+ PS +WRSI
Subjt:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI

Query:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENF----VASLLDEGGR-WKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIW---
            E+ + G RW++GNG+ I I +D W+      +  +V      NF    V+SL+D   + WK + + + F   +   IL  P      +D++IW   
Subjt:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENF----VASLLDEGGR-WKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIW---

Query:  ------------------------KC---DP-------------------------------------KGMTCEET----------TSHAMWSCKLAKKV
                                +C   DP                                     +G+ C  T           +HA+  C+ A  V
Subjt:  ------------------------KC---DP-------------------------------------KGMTCEET----------TSHAMWSCKLAKKV

Query:  WIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVHNAINSDLN-SIIRAIESRRSEGLTSQSSNLEEPLPRLESQLSLV
        W ++     +    N    S  D    L  +   + LEL  ++ W IW +RNKIVHN  +S L+ S +  + +   E     +S     L  +  + S +
Subjt:  WIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVHNAINSDLN-SIIRAIESRRSEGLTSQSSNLEEPLPRLESQLSLV

Query:  SWIPPPLGSWKINVDASWS
         W  PPLG +K+NVD + S
Subjt:  SWIPPPLGSWKINVDASWS

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]2.3e-15731.18Show/hide
Query:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH
        + ++ + +  PWLIGGDFNEI    EKEGG  R  RQM+ F   +E C L DL F G  FTWR G +G   I+ +LDRF+       +     V HL   
Subjt:  MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH

Query:  KSNHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDS-ASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENI-EKET
        KS+H  IL +++     R   R+R  + EE WL      N++KD W++  GND   +  ++I++    L  W+  +  G +K  +E  + ++    +K  
Subjt:  KSNHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDS-ASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENI-EKET

Query:  KGFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERSI---
          +P E+ L+ E  L  LL  E  YW+ RSR  WL  GD NT+ FH +AS RKK+N I G+ + DG W   + D+  + +++FG LF++S P    +   
Subjt:  KGFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERSI---

Query:  -------------------------------------------------------------------EDVLK----------------------------
                                                                           ED L+                            
Subjt:  -------------------------------------------------------------------EDVLK----------------------------

Query:  ----------------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACI
                                    AFVP R ISDN +L FE  H L  R GG  G+ A+KLDMSKAYDRVEW  I  +M  +GF + WI  IM C+
Subjt:  ----------------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACI

Query:  ESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEAS
         +V Y  ++NG P+G +IP R LRQGD +SPYLFL C E L  +L+ EE  +   G+ I    PS++HLFF DDS +  +A  ++C  +K ILK YE+AS
Subjt:  ESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEAS

Query:  GQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKL
        GQ +N  KS    SKN+       L  + G++       YLG+P++   +K+  F+ + ++    +K WK+K  S  GKEV+ K++ Q++PT  MSCF+L
Subjt:  GQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKL

Query:  PNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRS
        P  +C +++R  A+FWWG S    K HWL+W K+C  K++GGLGFR ++ FNQ LLAK  WRIL+HP+SLL K LK +YF    F++A +    S TWRS
Subjt:  PNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRS

Query:  ILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL-RVSELLKENFVASLLDEGGR-----WKEDLILAEFCGVDSIDIL-NTPTGGRNYKDEIIW
        ++ G+ L + G R+++G+G  I +  DPWI       P   V E L++  VA L+D   +     W E+L  A     D +D++   P   RN +D +IW
Subjt:  ILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL-RVSELLKENFVASLLDEGGR-----WKEDLILAEFCGVDSIDIL-NTPTGGRNYKDEIIW

Query:  KCDPKGM----------------------------------------------------------------------------TCE-ETTSHAMWSCKLA
          D +G+                                                                             CE ETT H    C + 
Subjt:  KCDPKGM----------------------------------------------------------------------------TCE-ETTSHAMWSCKLA

Query:  KKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEI---EELELAILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLES
          +W++      S   L  +  + +   +W+   +++    ++++  ++LW IWS RNK+V N    +    +       SE             PR   
Subjt:  KKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEI---EELELAILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPLPRLES

Query:  QLSLVSWIPPPLGSWKINVDASWSVALSAGGI
          +   W+ PP G  KINVD ++      GGI
Subjt:  QLSLVSWIPPPLGSWKINVDASWSVALSAGGI

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]4.0e-15436.99Show/hide
Query:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN
        L   +S+PWL  GDFNEI+ L+EKEGGR R +RQM+ F + +  CG R++ F G  +TW         IRE+LDR L N + + +     + HL    S+
Subjt:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN

Query:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDS-ASFHLK--IQRCLVKLASWNKNRLQGSIKKALE-YKKLEIENIEKETK
        H  +   L      +    R+  + E  WLK      I+K  W+  +G  S A   LK  ++ C   L  WNK       +K  E  +KLE   ++  + 
Subjt:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDS-ASFHLK--IQRCLVKLASWNKNRLQGSIKKALE-YKKLEIENIEKETK

Query:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERS-----
        G   E L      L   LE+E+  W+ RSR +W + GDRNT  FHAKAS R +KN I GI D+ G W E+E  I +VAV +F  LF SS P E S     
Subjt:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERS-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------IEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE
                             I D   AFV  RLI+DNV++ FE +H ++ ++GGK G +A+KLDMSKAYDRVEW+ + KIMEKLGF  +    IM CI 
Subjt:  ---------------------IEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE

Query:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG
        +V Y + ING P+G IIP R +RQGDPLSPYLFL C E L AL+          G+ I +  P LSHLFF DDSLI C+ATI +C  ++ +L  YE+ASG
Subjt:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG

Query:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP
        Q +N  K+    S N  ++    ++   G Q  K    YLG+PS  GKNK   F  +K+++ K L GWKEKL S  GKE+L K +A A+PT TMSCFKLP
Subjt:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP

Query:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI
        + +CD++  +  KFWWG   ++N+  WLSW K+C+SK  GG+GF+ LKLFN  LLAK  WR+    +SL+ +VLK +YF    F++A LG  PS +WRSI
Subjt:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI

Query:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAI---QGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDP
        +  + L K G +W++GNG  I++ +D W+       V TP     L  +  VA LLD E G W+ ++I   F   ++  I + P   R   D++IW   P
Subjt:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAI---QGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDP

Query:  KGM
         G+
Subjt:  KGM

XP_030939647.1 uncharacterized protein LOC115964488 [Quercus lobata]1.2e-15539.7Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTW--RKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH
        +LS    LPW+  GDFNEI + SEKEGG AR + QM +F E +  C LRD+G+SG  FTW  R GS+G  W+RE+LDR   + +      R  + H+   
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTW--RKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYH

Query:  KSNHRIILAQLQFQGDARSNSRRRPL-KLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENI-EKE
         S+H +++  L+     R N RR  L + E  WL+      I+   W +   G     F   ++ C   L SWNKN   G++ + +   + +I+ + EK 
Subjt:  KSNHRIILAQLQFQGDARSNSRRRPL-KLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENI-EKE

Query:  TKGFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP-------
              E + + +  L  ++  EE  W  RSR  WL+ GD NT  FH KAS R ++N I  I D +G W + +  I  V VE+F  LF SS P       
Subjt:  TKGFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP-------

Query:  -----------NERSIED--------VLKAFVP-------KRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESW
                   N   I++         LK   P       +RLI+DNV++  E ++ +N +R GK G +A+KLDMSKAYDRVEW  +  IM KL F E W
Subjt:  -----------NERSIED--------VLKAFVP-------KRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESW

Query:  IIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGI
        I  IM+C+ SV Y V +NG P G I P R LRQGDPLSPYLF+   E LYALL++  R    +G+  + R P +SHLFF DDSLI  RAT  +C  I+ +
Subjt:  IIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGI

Query:  LKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPT
        LK YE +S Q +N  K+    S N   +    L+ +   Q  ++   YLG+PS  GK+K  +F +LK R+   + GWKEKL S+ GKEVL K +AQA+P+
Subjt:  LKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPT

Query:  NTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGA
         TMSCFKLPN +C+++  +  +FWWG    + K  W+SW K+C  K+R G+GFR LK FN  LLAK  WR+  + +SL  +V K +YF    F++A LG 
Subjt:  NTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGA

Query:  IPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSIDILNTPTGGRNYK
         PS  WRSI   + L + G RW++GNG+ I+I  D W+       V TP R +   +   V+ L+D E   WK D++   F   D   IL+ P      +
Subjt:  IPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLD-EGGRWKEDLILAEFCGVDSIDILNTPTGGRNYK

Query:  DEIIWKCDPKG
        D+I+W  D  G
Subjt:  DEIIWKCDPKG

TrEMBL top hitse value%identityAlignment
A0A2N9GRT8 Reverse transcriptase domain-containing protein1.9e-15735.37Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        +L    +LPWL  GDFNEI ++ E+ G      R M  F   +   GL DLGF G  FTW    +G++ I+++LDR + N   +   N   V+H+    S
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLASWNKNRLQGSIKKALEYKKLEIEN-IEKETK
        +H  +L  L       S  +RRP K EE W   P    II+  W   +   S  F L  KI++C   L  W K+ + G     ++     + + I     
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLASWNKNRLQGSIKKALEYKKLEIEN-IEKETK

Query:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERS-----
        G    ++   +  +  LL  EE++W+ RSR  WL  GD NTK FH +A QR++ N IVG+ + +  W   EE +  + V +F ++F +S P + S     
Subjt:  GFPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERS-----

Query:  ------------------------------------IEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKL
                                             +D   AFVP RLI+DNV + FE IH L  +R GK+G +A+KLDMSKAYDRVEW  +  IM ++
Subjt:  ------------------------------------IEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKL

Query:  GFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDC
        GF E WI  +M CI +V Y V+I+G P+G I P R +RQGDPLSPY+FL C E L A+L +     H +GL++ +  P +SHLFF DDSL+  +ATI++C
Subjt:  GFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDC

Query:  ITIKGILKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTI
          +  IL  YE +SGQ IN DK+    S N  +D    ++   G Q   +   YLG+P+  G++K  +F  LK+RI + L+GWKE+  S  G+E+L K +
Subjt:  ITIKGILKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTI

Query:  AQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFL
        AQAIPT  M+CF+LP   CD++N + A++WWG    + K HW+ W KLC +K  GGLGFR L  FN  LLAK  WRIL +P SL  +V K RYF   SF+
Subjt:  AQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFL

Query:  NALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAI-QGVSTPLRVSELL-------KENFVASLLDEGGRWKEDLILA------EFCGV
        +A LG+ PS  WRS LWGR+    G  W+   GQ ++     W A   G+ T     ++L       K    +   +    W+++  LA       F   
Subjt:  NALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAI-QGVSTPLRVSELL-------KENFVASLLDEGGRWKEDLILA------EFCGV

Query:  DSIDILNTPTGGRNYKDEIIWKCDPKGMTC---EETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSH
           + L  PT  + ++ +I  +  P    C   EE+T HA+W C +A+  W    ++   + ++  +    S +  W+ +    EE+E   +  W IW+ 
Subjt:  DSIDILNTPTGGRNYKDEIIWKCDPKGMTC---EETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSH

Query:  RNKIVHNAINSDLNSIIRAIESRRSEGLTSQSS
        RN  V         +I     S R + + ++SS
Subjt:  RNKIVHNAINSDLNSIIRAIESRRSEGLTSQSS

A0A2N9HE04 Reverse transcriptase domain-containing protein9.2e-16535.69Show/hide
Query:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS
        +L+   SLPWL  GDFNEI   +EK G R R  R+M  F E++  C   DLG+ G  FTW       ++++E+LDR +       + N + V HL   KS
Subjt:  RLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKS

Query:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKG
        +H  IL +   Q  +++ ++RR  + EE W   P    +I+  W+  +G  S  F L  KI+RC + LA W+K    GS +  +  +   +E +  +  G
Subjt:  NHRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHL--KIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKG

Query:  FPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPN--ERSIEDV
             +   ++ + SLL  +E++WK RSR  WL+ GD NTK FH  A+QR++ N+I G+ ++ G W+     +  ++ ++F D+F SS P   E ++E V
Subjt:  FPTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPN--ERSIEDV

Query:  LK-------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVV
         +                   AFVP RLI+DN+++ +E I++L ++R G+ G +A+KLDMSKAYDRVEW  + +IM+K+GF   WI  +M C++S  Y +
Subjt:  LK-------------------AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVV

Query:  VINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHD
        ++NG P G + P R +RQGDPLSPYLFL C E   ALL + ER     G+ + +  P +SHL F DDSL+ C+A +++C  +  IL  YEE+SGQ IN D
Subjt:  VINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHD

Query:  KSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDD
        K+    S+N  E+K   ++   G Q       YLG+P+  G++K   FK LKDRI + ++GW E+  S  G+EVL K +AQAIPT TMSCF LP   C D
Subjt:  KSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDD

Query:  INRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWGREL
        ++ + A FWWG S   NK HW +W K+C  K++GG+GFR L  FNQ +LAK  WR L+  +SL+ +V K +YF   S + A LG  PS  WRS+L GR+ 
Subjt:  INRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWGREL

Query:  FKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENFVASLLDEGGRWKE------------------DLILAEFCGVDSIDILNT---------
           G +WK+G+G+ I + +D W+ +    TP    +      V  L+DE    K+                  +++L E     S +  N          
Subjt:  FKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENFVASLLDEGGRWKE------------------DLILAEFCGVDSIDILNT---------

Query:  ----------------------PTGGRNYKDEII--WKCDPKGMTCE---ETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEI
                              PT    YK +II    C+     CE   ETTSH +W+C  A  VW     +   L +  +   +  +    L   ++ 
Subjt:  ----------------------PTGGRNYKDEII--WKCDPKGMTCE---ETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEI

Query:  EELELAILILWQIWSHRNKIVHNAINSDLNSII
        EE+E   ++ W +W+ RN+ +H  + S L  I+
Subjt:  EELELAILILWQIWSHRNKIVHNAINSDLNSII

A0A7N2L6Z9 Reverse transcriptase domain-containing protein1.5e-15937.18Show/hide
Query:  LPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILA
        LPW   GDFNEI ++ EK GG  R Q QMD F  ++ SCG +DLG+SG  +TW    +G+  I  +LDR L   D ++    + V+HL    S+H  +  
Subjt:  LPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILA

Query:  QLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLG-NDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGFPTEKLLQ
                    R R    E  W K    R II+  W +    N        ++ C  +LASWN + L+   K   E +K+  +  E++  GF   ++  
Subjt:  QLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLG-NDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGFPTEKLLQ

Query:  AEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPN-----------------
          + L  LL++EE++W  RS+  WL+ GDRNTK FHA+AS+R+K+N I G+ DK G W E+ + I+  AV +F D++++S P+                 
Subjt:  AEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPN-----------------

Query:  -------------------------------------------------------------------------------------------------ERS
                                                                                                          ++
Subjt:  -------------------------------------------------------------------------------------------------ERS

Query:  IEDVLKAFVP------------KRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVIN
        + + LKAF+P             RLI+DNV++ +E +H L +++ GK+  +A KLDMSKA+DRVEW  I ++M K+GF E WI  IM CI SV Y V+IN
Subjt:  IEDVLKAFVP------------KRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVIN

Query:  GSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSM
        G   G+I+P R LRQGDPLSPYLFL C E L ALL+   R     G+ + + CP ++HLFF DDSL+ C+A  ++C  +K IL+ YE ASGQ +N DKS 
Subjt:  GSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSM

Query:  FMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDDINR
           S N   +    +  ILG         YLG+PS  G++K +VF  +K+R+   L GWK KL S+GGKE+L K +AQAIPT TMSCF LP  +CD++ +
Subjt:  FMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDDINR

Query:  VCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWGRELFKS
        +   FWWG  + ++K  W+SW K+CK K  GGLGFR L  FN  LLAK +WRIL +P SL  ++LK +YF     LNA LG+ PS TWRSI    E+ K 
Subjt:  VCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWGRELFKS

Query:  GYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLDEGGR-WKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKG
        G RW++GNG+ I I  D W+   +   V TP R++E      V+SL+D   R WK D I A F  VD+  IL  P       D IIW  + KG
Subjt:  GYRWKIGNGQHIKINQDPWI---AIQGVSTPLRVSELLKENFVASLLDEGGR-WKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKG

A0A7N2LIH6 Uncharacterized protein1.1e-15734.39Show/hide
Query:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN
        L+ +  +PWL+ GDFNEI +  EK G + R   QMD F E++  CGL DLGF G  FTW  G  G      +LDR + N     M     V+H+    S+
Subjt:  LSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSN

Query:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGFPT
        H ++   L    + R   +R     EE W +    + I++  W  +  + +     +++RC   L  WN+N   G++ K ++ KK  ++ +E       T
Subjt:  HRIILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGFPT

Query:  -EKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP----------NE
         E++   +K +  L   EE+ WK RSR  WL++GD+N+K FHA ASQR++KN I G+ D  G W E++E   K+ +++F D+++S+ P          +E
Subjt:  -EKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYP----------NE

Query:  R---------------------------------------------------------------------------------------------------
        R                                                                                                   
Subjt:  R---------------------------------------------------------------------------------------------------

Query:  -----------------SIEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVH
                          I++   AFVP R+I+DNVI+ FE +H++N RR GKEG +A+KLDMSKAYDRVEW  +  +M+K+GF + WI  IM C+ SV 
Subjt:  -----------------SIEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVH

Query:  YVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTI
        + V+ING P+GS  P R LRQGDP+SPYLFL C E L A++ ++ER    RG+   ++ P +SHLFF DDS+I CRAT+ +C  +  +L+ YEE SGQ +
Subjt:  YVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTI

Query:  NHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFM
        N DK+    S+N K++     + I G Q  +    YLG+P   G+ K   F R+KD++ + + GWK KL S  G+EVL K +AQA PT TM+ FKLP+ +
Subjt:  NHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFM

Query:  CDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWG
        C ++N +   FWWG    + K  W+SW  LCK K  GG+GF+ LK FN  LLAK  WR+ ++PNSL  +VLK +YF   SF+ A LG  PS  WRSI+  
Subjt:  CDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSILWG

Query:  RELFKSGYRWKIGNGQHIKINQDPWI--AIQGVSTPLRVSELLKENFVASLLDEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMTC
        + + K G RW +G+G+ I+I    W+     G     R   +  E   + +  E G WK  L+   F   ++ +IL+ P    N  D ++W   P G   
Subjt:  RELFKSGYRWKIGNGQHIKINQDPWI--AIQGVSTPLRVSELLKENFVASLLDEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGMTC

Query:  EETTSHAMWSCKL
         ++     + C L
Subjt:  EETTSHAMWSCKL

A0A803QC75 Uncharacterized protein3.3e-16232.25Show/hide
Query:  LPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILA
        LPWL  GDFNEI + ++K GG  R +  MD F   ++ C L+++ ++GD FTW K       ++E+LD    NN      +   V+HL Y  S+HR +  
Subjt:  LPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILA

Query:  QLQF----QGDARSNSRRRPLKLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKK---ALEYKKLEIENIEKETKGF
           F    +  ARS SR    + E+ WL  P S  II   W  +F+ +   +    +  C   L SW+  +  G +KK   +L+ K  ++ N    +   
Subjt:  QLQF----QGDARSNSRRRPLKLEESWLKFPVSRNIIKDCW-KAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKK---ALEYKKLEIENIEKETKGF

Query:  PTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERSIEDVL--
          ++L  AE  LE LLE+EE+YW+ RSR DWL  GDRNTK FHAKAS RK  N+I  + +  G  V ++ DI+ V   F+ DLF+S+  +E ++   L  
Subjt:  PTEKLLQAEKGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERSIEDVL--

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------KAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE
                                   AF+P RLI+DNV++ FE +HA+ N+  G+ G  + KLDMSKA+DRVEW  I ++M K+GF E WI  IM+C+ 
Subjt:  --------------------------KAFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIE

Query:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG
        + ++  +ING   G++ P R LRQG PLSPYLFL C E    LL  E+  N+  G K+ +  P ++HLFF DDSL+ C+A  + C+ IK +L TY +ASG
Subjt:  SVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASG

Query:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP
        Q +N DKS+   S N          + L +   +    YLG+PS +G++K  +F  +K+RIWK +  W EK+FSAGGKE+L K + Q+IPT  MSCF+LP
Subjt:  QTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLP

Query:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI
         + C  +  + A FWWGL+ + ++ HW SW  LCKSK  GG+GFR    FNQ LLAK +WRI + P+SLL ++LK RYF   +FL A LG  PSLTW+ I
Subjt:  NFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIPSLTWRSI

Query:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENFVASLLDEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGM-
         W REL   G RWK+G+G+HI+   DPWI       P   S       V++L+ +  +W   L+   F  +D   IL+ P    + +D +IW     G+ 
Subjt:  LWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENFVASLLDEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPKGM-

Query:  -------------------------------------------------------------------TCE------ETTSHAMWSCKLAKKVWIYFIILM
                                                                           TC       E+  HA++SCK AK VW +  ++ 
Subjt:  -------------------------------------------------------------------TCE------ETTSHAMWSCKLAKKVWIYFIILM

Query:  SSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH-------------------NAINSD----LNSIIRAIESRRSEGLTSQSSNL
           +RL+  A    DY   LS      E+E     LW IW+ RN+IVH                   N  N+     LN +         E L S    +
Subjt:  SSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVH-------------------NAINSD----LNSIIRAIESRRSEGLTSQSSNL

Query:  EEPLPRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI
            P+ +   +   W PP   S+K+NVDA+  V     GI
Subjt:  EEPLPRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein6.4e-1422.78Show/hide
Query:  FVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPL
        F+P      N+      I  +N  R   + H+ + +D  KA+D+++   + K + KLG    ++  I A  +     +++NG    +   +   RQG PL
Subjt:  FVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPL

Query:  SPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRIL
        SP LF   +E L   + QE+ +   +G+++ K    LS   F DD ++     I     +  ++  + + SG  IN  KS      N ++ +  ++  + 
Subjt:  SPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRIL

Query:  GIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALK----GWKEKLFSAGGKEVLFK--TIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSK
           ASK +  YLG+  Q  ++   +FK     + K +K     WK    S  G+  + K   + + I        KLP     ++ +   KF W    ++
Subjt:  GIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALK----GWKEKLFSAGGKEVLFK--TIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSK

Query:  NKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISW
             ++ + L +    GG+     KL+ +  + K +W
Subjt:  NKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISW

P0C2F6 Putative ribonuclease H protein At1g657506.0e-2834.02Show/hide
Query:  MPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGG
        MP    +     F  + +R+   + GW+EK  S  G+  L K +  ++P ++MS   LP  + + ++++   F WG ++ K K H + W+K+C  K  GG
Subjt:  MPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFKTIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGG

Query:  LGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIP----SLTWRSILWG-RELFKSGYRWKIGNGQHIKINQDPWIA
        LG R  K  N+ L++K+ WR+L+  NSL T VL+ +Y  GE  +      IP    S TWRSI  G R++   G  W  G+GQ I+   D W++
Subjt:  LGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNALLGAIP----SLTWRSILWG-RELFKSGYRWKIGNGQHIKINQDPWIA

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-1725.22Show/hide
Query:  FVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPL
        F+P      N+      IH +N  +   + H+ + LD  KA+D+++   + K++E+ G    ++  I A        + +NG    +I  +   RQG PL
Subjt:  FVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPL

Query:  SPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSM-FMASKNIKEDKVAMLQRI
        SPYLF   +E L   + Q++ +   +G++I K    +S L   DD ++           +  ++ ++ E  G  IN +KSM F+ +KN + +K       
Subjt:  SPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSM-FMASKNIKEDKVAMLQRI

Query:  LGIQAS--KSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFK--TIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKN
          I  +  K LG  L    ++  +K+  FK LK  I + L+ WK+   S  G+  + K   + +AI        K+P    +++     KF W      N
Subjt:  LGIQAS--KSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGGKEVLFK--TIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKN

Query:  KAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISW
        K   ++ + L   +  GG+    LKL+ + ++ K +W
Subjt:  KAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISW

P92555 Uncharacterized mitochondrial protein AtMg012507.1e-1343.06Show/hide
Query:  HYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDS
        + + +ING+PQG + P R LRQGDPLSPYLF+ C E L  L  + +      G++++   P ++HL F DD+
Subjt:  HYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDS

P93295 Uncharacterized mitochondrial protein AtMg003103.6e-3345.39Show/hide
Query:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSK-DRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLN
        A+P   MSCF+L   +C  +     +FWW    +K K  W++W KLCKSK D GGLGFR L  FNQ LLAK S+RI+  P++LL+++L+ RYF   S + 
Subjt:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSK-DRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLN

Query:  ALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL
          +G  PS  WRSI+ GREL   G    IG+G H K+  D WI  +    PL
Subjt:  ALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.9e-0525.17Show/hide
Query:  LIGGDFNEISNLSEKEG--GRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNND-MVQMCNRLGVNHLGYHKSNHR---I
        ++ GDF++I+  S+       +   R ++ F   +    L D+   G  +TW      +  IR KLDR + N D      + + V  L    S+H    I
Subjt:  LIGGDFNEISNLSEKEG--GRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNND-MVQMCNRLGVNHLGYHKSNHR---I

Query:  ILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASF----HLK-IQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGF
        IL  L      RS    R      +   F VS  +    W+  +   S  F    HLK  ++C   L       +Q   K+AL+     +E+I+ +    
Subjt:  ILAQLQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASF----HLK-IQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGF

Query:  PTEKLLQAE----KGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFAS
        P++ L + E    K         E +++ +SR  WL+ GD NT+ FH      + KN I  ++  D   VEN   + ++ V ++  L  S
Subjt:  PTEKLLQAE----KGLESLLEEEEMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFAS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.4e-0835.29Show/hide
Query:  AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKI
        +F+P R+ +DN++   E +H++  ++G K G + +KLD+ KAYDR+ W  +   +   GF E W+ +I
Subjt:  AFVPKRLISDNVILGFECIHALNNRRGGKEGHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKI

AT4G29090.1 Ribonuclease H-like superfamily protein2.5e-3737.27Show/hide
Query:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNA
        A+PT TM+CF LP  +C  I  V A FWW         HW +W  L   K  GG+GF+ ++ FN  LL K  WR+L  P SL+ KV K RYF     LNA
Subjt:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNA

Query:  LLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENF--------VASLLDEGGR-WKEDLILAEFCGVDSIDILN
         LG+ PS  W+SI   +E+ + G R  +GNG+ I I +  W+  +  S  LR+  +  + +        V+ L+DE GR W++D+I   F  V+   I  
Subjt:  LLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENF--------VASLLDEGGR-WKEDLILAEFCGVDSIDILN

Query:  TPTGGRNYKDEIIWKCDPKG
           GGR   D   W     G
Subjt:  TPTGGRNYKDEIIWKCDPKG

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.6e-3445.39Show/hide
Query:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSK-DRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLN
        A+P   MSCF+L   +C  +     +FWW    +K K  W++W KLCKSK D GGLGFR L  FNQ LLAK S+RI+  P++LL+++L+ RYF   S + 
Subjt:  AIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSK-DRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLN

Query:  ALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL
          +G  PS  WRSI+ GREL   G    IG+G H K+  D WI  +    PL
Subjt:  ALLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.1e-1443.06Show/hide
Query:  HYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDS
        + + +ING+PQG + P R LRQGDPLSPYLF+ C E L  L  + +      G++++   P ++HL F DD+
Subjt:  HYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTAGGCTGAGTCACAGATCATCTCTCCCTTGGCTTATAGGGGGAGACTTCAATGAAATATCGAACCTCTCTGAGAAGGAGGGAGGAAGAGCTCGAGTGCAACGCCA
AATGGATTTGTTCAATGAGATGATGGAAAGCTGCGGTTTAAGGGATTTGGGTTTCTCGGGTGACATATTTACTTGGAGGAAAGGTAGCAAGGGTAGTAGTTGGATCAGGG
AGAAGCTAGACAGGTTCCTAGGCAATAATGATATGGTGCAAATGTGTAACAGATTGGGGGTCAATCACTTAGGTTATCACAAATCTAATCATAGGATCATTTTAGCCCAA
CTCCAGTTTCAAGGCGATGCCAGATCAAACTCCCGTAGAAGGCCTTTAAAGCTTGAAGAATCTTGGTTAAAATTCCCTGTAAGCAGGAATATCATTAAAGACTGCTGGAA
GGCTTTCCTTGGAAATGATTCAGCTTCATTCCATCTTAAGATTCAAAGGTGCTTAGTTAAGCTTGCAAGTTGGAATAAAAACAGATTGCAAGGATCCATAAAAAAAGCCT
TGGAGTATAAAAAGCTGGAGATTGAGAACATTGAGAAAGAAACCAAAGGCTTCCCTACCGAGAAACTCTTACAAGCAGAAAAAGGGCTGGAGAGTTTGCTGGAAGAGGAA
GAAATGTACTGGAAAATTAGATCCAGGGAAGATTGGCTCCGCTGGGGGGATAGGAACACAAAACGGTTCCATGCTAAAGCCTCCCAAAGGAAAAAGAAAAATGAGATTGT
GGGGATTAAAGATAAGGATGGGAGTTGGGTGGAAAATGAAGAGGATATCAGCAAGGTAGCCGTTGAGTTCTTTGGTGACCTATTTGCTTCTTCTTACCCGAATGAGAGAA
GTATAGAGGATGTGTTAAAAGCGTTTGTTCCCAAAAGACTCATTTCTGACAATGTGATTCTTGGGTTCGAATGCATTCATGCCCTAAACAACAGAAGAGGAGGAAAGGAA
GGCCACCTAGCAATGAAATTGGATATGAGCAAAGCCTACGACAGGGTGGAGTGGATACTCATTAGTAAAATCATGGAGAAGTTAGGGTTCGTGGAAAGCTGGATCATCAA
AATTATGGCCTGCATTGAGTCGGTGCATTATGTGGTGGTCATTAATGGATCCCCCCAAGGATCTATCATTCCTCAACGTAGGCTTAGACAGGGAGACCCTCTTTCCCCTT
ATCTTTTCTTGTTTTGTGTGGAAAACCTCTATGCTTTACTAAATCAGGAAGAGAGGCTTAACCATTTTAGAGGCCTAAAGATTAACAAAAGATGTCCCTCACTTTCTCAT
TTGTTTTTTGTAGATGACAGCCTTATTATGTGCAGGGCGACAATCCAAGATTGCATCACTATTAAAGGCATTCTGAAGACATACGAAGAAGCCTCGGGACAAACGATTAA
CCACGACAAATCCATGTTTATGGCCAGTAAAAACATCAAGGAGGACAAAGTGGCTATGTTGCAAAGGATTTTGGGCATTCAAGCTTCAAAATCTCTTGGGCATTATTTAG
GGATGCCTTCTCAGAATGGGAAAAACAAGAGCATTGTGTTCAAGAGGTTGAAGGACAGAATTTGGAAAGCTCTCAAAGGCTGGAAGGAAAAATTGTTTTCAGCCGGAGGA
AAAGAAGTTCTTTTCAAAACCATAGCTCAAGCCATTCCCACCAATACTATGAGTTGCTTCAAGTTGCCAAATTTCATGTGTGATGACATTAACAGAGTGTGCGCCAAGTT
TTGGTGGGGATTGTCGAGTTCAAAAAACAAAGCCCACTGGCTGAGTTGGACAAAGCTATGCAAGAGCAAGGATAGAGGAGGGCTTGGCTTTCGTGGCCTAAAGCTATTCA
ATCAACCGTTGTTAGCCAAAATTAGTTGGCGGATCCTCAAACACCCCAATTCTCTCCTAACCAAAGTCTTAAAGGGGAGATATTTTAAGGGAGAATCCTTCCTAAATGCC
CTTTTGGGTGCTATTCCCTCTTTGACTTGGAGGAGCATTTTGTGGGGAAGGGAGTTGTTTAAATCGGGTTACAGATGGAAGATTGGTAACGGCCAACATATCAAGATTAA
TCAAGACCCTTGGATTGCCATACAAGGAGTTAGTACCCCTCTGAGGGTCAGCGAGCTCCTTAAGGAAAATTTTGTTGCATCTTTGCTAGATGAGGGGGGTAGATGGAAGG
AGGATTTGATCCTAGCCGAATTCTGTGGGGTTGACTCAATTGATATCTTAAACACTCCAACAGGGGGAAGAAATTACAAGGATGAGATCATATGGAAGTGTGACCCAAAA
GGAATGACTTGTGAAGAAACTACATCGCACGCTATGTGGAGCTGTAAGTTAGCCAAGAAAGTGTGGATTTATTTCATTATTCTTATGTCCTCCTTGTTTCGTTTGAATAT
GGAAGCTTGGAGCCCCTCGGACTATTGGGATTGGTTGTCTAAGAACGTGGAGATTGAGGAGTTGGAGTTAGCTATCCTAATTCTTTGGCAAATTTGGTCTCACCGAAACA
AGATTGTTCACAACGCAATCAATTCAGATCTCAATTCCATCATCAGAGCTATCGAGTCTAGGAGATCTGAAGGTCTTACCTCACAATCCTCTAATCTAGAGGAGCCGTTG
CCGAGATTGGAGAGCCAGCTGAGTCTGGTGTCGTGGATTCCCCCGCCGTTGGGTTCGTGGAAGATCAATGTGGACGCGTCTTGGAGCGTAGCCCTCTCCGCTGGAGGAAT
TTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTTAGGCTGAGTCACAGATCATCTCTCCCTTGGCTTATAGGGGGAGACTTCAATGAAATATCGAACCTCTCTGAGAAGGAGGGAGGAAGAGCTCGAGTGCAACGCCA
AATGGATTTGTTCAATGAGATGATGGAAAGCTGCGGTTTAAGGGATTTGGGTTTCTCGGGTGACATATTTACTTGGAGGAAAGGTAGCAAGGGTAGTAGTTGGATCAGGG
AGAAGCTAGACAGGTTCCTAGGCAATAATGATATGGTGCAAATGTGTAACAGATTGGGGGTCAATCACTTAGGTTATCACAAATCTAATCATAGGATCATTTTAGCCCAA
CTCCAGTTTCAAGGCGATGCCAGATCAAACTCCCGTAGAAGGCCTTTAAAGCTTGAAGAATCTTGGTTAAAATTCCCTGTAAGCAGGAATATCATTAAAGACTGCTGGAA
GGCTTTCCTTGGAAATGATTCAGCTTCATTCCATCTTAAGATTCAAAGGTGCTTAGTTAAGCTTGCAAGTTGGAATAAAAACAGATTGCAAGGATCCATAAAAAAAGCCT
TGGAGTATAAAAAGCTGGAGATTGAGAACATTGAGAAAGAAACCAAAGGCTTCCCTACCGAGAAACTCTTACAAGCAGAAAAAGGGCTGGAGAGTTTGCTGGAAGAGGAA
GAAATGTACTGGAAAATTAGATCCAGGGAAGATTGGCTCCGCTGGGGGGATAGGAACACAAAACGGTTCCATGCTAAAGCCTCCCAAAGGAAAAAGAAAAATGAGATTGT
GGGGATTAAAGATAAGGATGGGAGTTGGGTGGAAAATGAAGAGGATATCAGCAAGGTAGCCGTTGAGTTCTTTGGTGACCTATTTGCTTCTTCTTACCCGAATGAGAGAA
GTATAGAGGATGTGTTAAAAGCGTTTGTTCCCAAAAGACTCATTTCTGACAATGTGATTCTTGGGTTCGAATGCATTCATGCCCTAAACAACAGAAGAGGAGGAAAGGAA
GGCCACCTAGCAATGAAATTGGATATGAGCAAAGCCTACGACAGGGTGGAGTGGATACTCATTAGTAAAATCATGGAGAAGTTAGGGTTCGTGGAAAGCTGGATCATCAA
AATTATGGCCTGCATTGAGTCGGTGCATTATGTGGTGGTCATTAATGGATCCCCCCAAGGATCTATCATTCCTCAACGTAGGCTTAGACAGGGAGACCCTCTTTCCCCTT
ATCTTTTCTTGTTTTGTGTGGAAAACCTCTATGCTTTACTAAATCAGGAAGAGAGGCTTAACCATTTTAGAGGCCTAAAGATTAACAAAAGATGTCCCTCACTTTCTCAT
TTGTTTTTTGTAGATGACAGCCTTATTATGTGCAGGGCGACAATCCAAGATTGCATCACTATTAAAGGCATTCTGAAGACATACGAAGAAGCCTCGGGACAAACGATTAA
CCACGACAAATCCATGTTTATGGCCAGTAAAAACATCAAGGAGGACAAAGTGGCTATGTTGCAAAGGATTTTGGGCATTCAAGCTTCAAAATCTCTTGGGCATTATTTAG
GGATGCCTTCTCAGAATGGGAAAAACAAGAGCATTGTGTTCAAGAGGTTGAAGGACAGAATTTGGAAAGCTCTCAAAGGCTGGAAGGAAAAATTGTTTTCAGCCGGAGGA
AAAGAAGTTCTTTTCAAAACCATAGCTCAAGCCATTCCCACCAATACTATGAGTTGCTTCAAGTTGCCAAATTTCATGTGTGATGACATTAACAGAGTGTGCGCCAAGTT
TTGGTGGGGATTGTCGAGTTCAAAAAACAAAGCCCACTGGCTGAGTTGGACAAAGCTATGCAAGAGCAAGGATAGAGGAGGGCTTGGCTTTCGTGGCCTAAAGCTATTCA
ATCAACCGTTGTTAGCCAAAATTAGTTGGCGGATCCTCAAACACCCCAATTCTCTCCTAACCAAAGTCTTAAAGGGGAGATATTTTAAGGGAGAATCCTTCCTAAATGCC
CTTTTGGGTGCTATTCCCTCTTTGACTTGGAGGAGCATTTTGTGGGGAAGGGAGTTGTTTAAATCGGGTTACAGATGGAAGATTGGTAACGGCCAACATATCAAGATTAA
TCAAGACCCTTGGATTGCCATACAAGGAGTTAGTACCCCTCTGAGGGTCAGCGAGCTCCTTAAGGAAAATTTTGTTGCATCTTTGCTAGATGAGGGGGGTAGATGGAAGG
AGGATTTGATCCTAGCCGAATTCTGTGGGGTTGACTCAATTGATATCTTAAACACTCCAACAGGGGGAAGAAATTACAAGGATGAGATCATATGGAAGTGTGACCCAAAA
GGAATGACTTGTGAAGAAACTACATCGCACGCTATGTGGAGCTGTAAGTTAGCCAAGAAAGTGTGGATTTATTTCATTATTCTTATGTCCTCCTTGTTTCGTTTGAATAT
GGAAGCTTGGAGCCCCTCGGACTATTGGGATTGGTTGTCTAAGAACGTGGAGATTGAGGAGTTGGAGTTAGCTATCCTAATTCTTTGGCAAATTTGGTCTCACCGAAACA
AGATTGTTCACAACGCAATCAATTCAGATCTCAATTCCATCATCAGAGCTATCGAGTCTAGGAGATCTGAAGGTCTTACCTCACAATCCTCTAATCTAGAGGAGCCGTTG
CCGAGATTGGAGAGCCAGCTGAGTCTGGTGTCGTGGATTCCCCCGCCGTTGGGTTCGTGGAAGATCAATGTGGACGCGTCTTGGAGCGTAGCCCTCTCCGCTGGAGGAAT
TTGA
Protein sequenceShow/hide protein sequence
MVRLSHRSSLPWLIGGDFNEISNLSEKEGGRARVQRQMDLFNEMMESCGLRDLGFSGDIFTWRKGSKGSSWIREKLDRFLGNNDMVQMCNRLGVNHLGYHKSNHRIILAQ
LQFQGDARSNSRRRPLKLEESWLKFPVSRNIIKDCWKAFLGNDSASFHLKIQRCLVKLASWNKNRLQGSIKKALEYKKLEIENIEKETKGFPTEKLLQAEKGLESLLEEE
EMYWKIRSREDWLRWGDRNTKRFHAKASQRKKKNEIVGIKDKDGSWVENEEDISKVAVEFFGDLFASSYPNERSIEDVLKAFVPKRLISDNVILGFECIHALNNRRGGKE
GHLAMKLDMSKAYDRVEWILISKIMEKLGFVESWIIKIMACIESVHYVVVINGSPQGSIIPQRRLRQGDPLSPYLFLFCVENLYALLNQEERLNHFRGLKINKRCPSLSH
LFFVDDSLIMCRATIQDCITIKGILKTYEEASGQTINHDKSMFMASKNIKEDKVAMLQRILGIQASKSLGHYLGMPSQNGKNKSIVFKRLKDRIWKALKGWKEKLFSAGG
KEVLFKTIAQAIPTNTMSCFKLPNFMCDDINRVCAKFWWGLSSSKNKAHWLSWTKLCKSKDRGGLGFRGLKLFNQPLLAKISWRILKHPNSLLTKVLKGRYFKGESFLNA
LLGAIPSLTWRSILWGRELFKSGYRWKIGNGQHIKINQDPWIAIQGVSTPLRVSELLKENFVASLLDEGGRWKEDLILAEFCGVDSIDILNTPTGGRNYKDEIIWKCDPK
GMTCEETTSHAMWSCKLAKKVWIYFIILMSSLFRLNMEAWSPSDYWDWLSKNVEIEELELAILILWQIWSHRNKIVHNAINSDLNSIIRAIESRRSEGLTSQSSNLEEPL
PRLESQLSLVSWIPPPLGSWKINVDASWSVALSAGGI