; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015845 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015845
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:27078932..27083335
RNA-Seq ExpressionLag0015845
SyntenyLag0015845
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]1.2e-25539.52Show/hide
Query:  KEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG-TFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFS
        K GG  R+ ++M  FK++I  C L D+ F G KFTW + R G  +I+ERLDR +C+  + S F       L    SDH PI   +      L  +K  F 
Subjt:  KEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG-TFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFS

Query:  KPFRFEESWIRHNECSHLI--ENSGFWGDSSSSSLSK---DLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGL-TINFEAIHNLEFELDNLLEE
        + + +E+ W  +  CS+++  E   F G+S  S + K         + LK W KE +  RK K     + LK   +  L  I+ E I  LE ++ N+L +
Subjt:  KPFRFEESWIRHNECSHLI--ENSGFWGDSSSSSLSK---DLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGL-TINFEAIHNLEFELDNLLEE

Query:  EEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFT
        EEVYWKQRSR +WLK GDKNT+ FH KAS R + N+I G+ D+ GNW+ DP  IE  F  +F+ +F+SSNP  +QI  AL+ + P+VS  MN  L  PFT
Subjt:  EEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFT

Query:  REEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVL
         E++ R++ +M PTK+PGPDG PA F+Q +W IVG+     CL ILN+  +++  N T IALIPK+  P+KV ++RPISLCNV Y+IV+K IANRLK +L
Subjt:  REEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVL

Query:  KDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKA
          IISP+QSAFI  RLI+DN++IG+ECLH I+ ++     + A+KLD+SKA+DRVEW FL++ M+ LGF + W +LI++CI+T  FSV IN    G  K 
Subjt:  KDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKA

Query:  SRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHP
         RGLRQG PLSPYLF+L AE  S+L++ A  E K+  +  +     I+HLLFADDSLVF KA+  +  ++K +   Y KASG+  NF KSSM FS K   
Subjt:  SRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHP

Query:  DRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGS
        ++   + S+  ++ V     YLG+P     +K   F  V  K+   +      LFS  GKEILIK+V  A+P+Y MSVFK+PK LC++I K  ++FWWG+
Subjt:  DRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGS

Query:  SSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNG
          +K  +HW  W+ MS  K  GGL FRD+  FN+ALVAKQ WR++  PNSL+A  M++ Y+K     +A++G NPS++WRS+LWG +++  G+R ++ +G
Subjt:  SSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNG

Query:  QNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVAPEDVQIIRRIPIDLSK-EDKYIWHYDKCGNYSVKSGYNIFM-KNKISAG
        + +  + D WIP+   F+PI  +       V+  I     W  D L      ED++ I +I +   K ED+ +WH+DK G YSVKSGY + + +N  +  
Subjt:  QNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVAPEDVQIIRRIPIDLSK-EDKYIWHYDKCGNYSVKSGYNIFM-KNKISAG

Query:  SSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLN
         S+NS + +W+  W L +P KVK F WRA+ + +P+  N+ +R    + +C RC   +ET  H+LI C  A+++W L        +D N  F      + 
Subjt:  SSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLN

Query:  SCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICR
        S  +  +   + V CW IWS RN          S   +         Y +V+   +           +KW PP    LK+NVDAA        G+G I R
Subjt:  SCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICR

Query:  NEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIA
        +  G I+              +AE +AI  GL  A +     ++VESDC +   L++     R+E+  ++  +R  +  F  ++ S+  R  N  A A+A
Subjt:  NEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIA

Query:  KWARSEKNCCVWTHVYP
        K+A    +  VW   +P
Subjt:  KWARSEKNCCVWTHVYP

XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]1.6e-25538.97Show/hide
Query:  KKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGT--FIQERL
        +KWRFTG YG  DT++++++W LL+ L ++ + PW+  GDFNE++   EKEGG SR    M NF + IN C+L+DL +IG+ FTW S R G   +++ERL
Subjt:  KKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGT--FIQERL

Query:  DRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWG--DSSSSSLSKDLNSCASVLKKWGKEV
        DR + ++ + + F +  +YH     SDH  + +    S  RRKR   KPFRFE  W++   C+ ++  +   G    S SSL   L  C   L  W K V
Subjt:  DRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWG--DSSSSSLSKDLNSCASVLKKWGKEV

Query:  YINRKNKITACKNALKE-AYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIES
        + +   KI + +  L+    + G  +  E IH    EL+ LLE EE+ W QRSR  WLK GDKNT  FH KAS R + N I  I D+NG W  D   I  
Subjt:  YINRKNKITACKNALKE-AYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIES

Query:  TFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWN
         F+ YF  +F++SNP  S  D  L  +Q +V+ +MN  LL  F   EVER++KQMFPT +PGPDG P +FYQ+YW  V        LD LN G     +N
Subjt:  TFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWN

Query:  QTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVE
        +T+I LIPK+ +PK V+D+RP SLCNV YKI SKTIA+RLK +L  ++  +QSAF+  RLI+DN+++  E +H I   R       A+KLD+SKA+DRVE
Subjt:  QTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVE

Query:  WPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDS
        W  L +IM KLGF   W  LI+ C+S+  ++V IN   +G+   SRGLRQGDPLSPYLFL  AEGLS++   A+   +L  ++ S SGP +SHL FADDS
Subjt:  WPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDS

Query:  LVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFS
        L+F +A   E   I+ +LK+YE +SG+ +N  K+S+ FS     + +  + +L G + +     YLG+PS    SKT  F+ +  K+ K L G K  L S
Subjt:  LVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFS

Query:  VAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFM
         AGKE+LIK+V  A+P+Y MS FKIP  +CDE+T   SQFWWG   E+RK+ W SW+K+ +PK  GG+ FRD++ FN+AL+AKQ WR+ + PNSL     
Subjt:  VAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFM

Query:  RSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFAL---EKVSFFIS-PSGSWNQDLLRIAVAP
        ++ YF   N   A +G+NPSY WRS++  +E++  G++ +V NG +I  ++D W+P     K I S     L    KVS  I      WN+ L+R  +  
Subjt:  RSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFAL---EKVSFFIS-PSGSWNQDLLRIAVAP

Query:  EDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGYNIFMKNKISAG----SSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDC
        E+  ++  IP+ L    D+ IW  +  G ++V+  Y   M+  I+      S +  M ++W+ +W +  P+K++ F W+A    +P+K N+ RR +  D 
Subjt:  EDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGYNIFMKNKISAG----SSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDC

Query:  LCVRCGENLETTDHILISCPKAQEVWS---LTFTHDVLLQDFNHCFAD-RWFVLNS-CLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKY
        LC  CG+  E   H+   C KA EVWS   L F   +   +    F D  W ++     N + L K    CW IW +RN+  H           +   + 
Subjt:  LCVRCGENLETTDHILISCPKAQEVWS---LTFTHDVLLQDFNHCFAD-RWFVLNS-CLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKY

Query:  TTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMA----ELKAILEGLIFAERRGL
          +Y   N   SS    I      KW PP   + K+NVD A   G   +G+G++ R+  G ++ A +      +P+P+A    E K +   +IFA   G 
Subjt:  TTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMA----ELKAILEGLIFAERRGL

Query:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPD
        R++  E+D L     +    E  S + T++ SI SL   F     S+ +R  N  A  +A++A+   +  VW    P+
Subjt:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPD

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]4.2e-25339.16Show/hide
Query:  AVDMDVALDKKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG
        AV  + + D KWR TG YG  +T+++  +W LL  L++    PWL  GDFNEIL   EK GG +R++H MD F+  +N C   DL + G  +TW + + G
Subjt:  AVDMDVALDKKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG

Query:  -TFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENS-GFWGD-SSSSSLSKDLNSCASV
           I  RLDR +    + + F    V+HL     DH  +   L++    R R   K F FE  W +  +C  +IE S GF  D S+   +S++L  CA  
Subjt:  -TFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENS-GFWGD-SSSSSLSKDLNSCASV

Query:  LKKWGKEVYINRKNKITACKNALKEAYEHGLTINFE-AIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWL
        L KW   +Y     KI   ++ L       L  +    I+ L  E++ LL++EE YW QR++  WLK GD+NT+ FH +AS R K N I GI D  G W 
Subjt:  LKKWGKEVYINRKNKITACKNALKEAYEHGLTINFE-AIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWL

Query:  CDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNK
         +   I    ISYF +I+SSS+P  SQI+     I  +V++ MN SL+  FT+EEV  ++KQ+ P K+PGPDG  A+F+Q YWSIVG+      L++LN 
Subjt:  CDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNK

Query:  GKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDL
           I + N+TNI+LIPK NNPK+++D+RPISLCNV YK++SK +ANRLK +L  IIS +QSAF   RLI+DN+++  E +H + +     +   AIKLD+
Subjt:  GKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDL

Query:  SKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYIS
        SKAFDRVEW F+ K+M ++GF + W +L++ CI++  +S+ IN  + GN   SRGLRQGDPLSP LFLL AEGLS+LI+ A     ++ +S +   P ++
Subjt:  SKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYIS

Query:  HLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQ
        HL FADDS++FCKA   E   ++ +L  YE+ASG+ IN  KSS+ FS     + R  + ++LG         YLG+PS    SK++ F+ +  K+G  L 
Subjt:  HLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQ

Query:  GCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKP
        G K  L S+ GKEILIK+V  AIP+Y MS F +P+ LCD++ +    FWWG  +++ K+ W SW++M   K+ GGL FR+++ FN A++AKQ WRIL  P
Subjt:  GCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKP

Query:  NSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSE-ERFALEKVSFFISPSGS-WNQDLL
        NSLV   +++ YF  G++L+A++G +PSY WRS+    E++  G R +V NG+ I  + D W+P    +K I  +   F    VS  I P    W  + L
Subjt:  NSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSE-ERFALEKVSFFISPSGS-WNQDLL

Query:  RIAVAPEDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRR
        R    P +V+ I RIP+  +  EDK IW  +K G +SVKS Y    +I   N+    S+ +    +W+KLW L +P K+K F WRA  D +P+  NI +R
Subjt:  RIAVAPEDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRR

Query:  GIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHN-KPIPPSLIRSNWIK
        GI     C  CG   E  +H L+ C  A  VW     +    Q  N  F D    L      + L    V  WAIW +RN   HN  P+ PS +   W+ 
Subjt:  GIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHN-KPIPPSLIRSNWIK

Query:  KYTT--DYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGL
           T  D+ K     S    P   + I +W  PP+G  K+NVD A      ++ IG+I R+  G +V A   Y    + +   E  A+ +G++ A    L
Subjt:  KYTT--DYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGL

Query:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQISSA
         ++++E D L     ++ D    +ELG ++  IRS++ SF      +  R  NVVA  +A+ AR E++ C+W  V    +  LV +D +  A
Subjt:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQISSA

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]5.0e-25437.15Show/hide
Query:  SDGAVDM---DVALDKKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTW
        SD  +D+    V    +WRFTG+YG +    +  TW L+  +  + + PWL+GGDFNEIL+  EKEGGP R    M+ F++ +  C L DL+F+G  FTW
Subjt:  SDGAVDM---DVALDKKWRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTW

Query:  HSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIEN--SGFWGDSSSSSLSKDLN
           R G  I+ RLDRF+   S+  LF  + V HL    SDH PI + + S++PR++R   + FRFEE W+   EC++++++      G+    ++   + 
Subjt:  HSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIEN--SGFWGDSSSSSLSKDLN

Query:  SCASVLKKWGKEVYINRKNKITACKNALKEAYEHGLTINFEAIH-NLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDN
             L  W  + + + K +I   +  L   Y+  L+   E     LE +L++LL  E  YW+QRSR  WL  GD NTR FH +AS R K N I G+ +N
Subjt:  SCASVLKKWGKEVYINRKNKITACKNALKEAYEHGLTINFEAIH-NLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDN

Query:  NGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECL
        +G W  +  ++E+  + YF  +FS+S+P    ++L        V+  MN  L+  F  EE+ +++ QM P K+PGPDGF  +FYQ YWS+VG   ++   
Subjt:  NGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECL

Query:  DILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAA
          +N    + + N T + LIPK+   + +   RPISLCNV YK+ SK +ANRLK +L+DII+P+QSAF+ GR ISDN ++  E  H +K          A
Subjt:  DILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAA

Query:  IKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSIS
        +KLD+SKA+DRVEW F++ +M  +GFD  W   I+ C++T  +S  +N E +G+   +RGLRQGD +SPYLFLL AEGLS ++S    + +L  ++ ++ 
Subjt:  IKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSIS

Query:  GPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKL
         P I+HL FADDS VF KA   E   +KE+LK YE ASG+ +NF KS + FS  V    +  L+ + GVE VD    YLG+P+  S SKT+ F F++ K 
Subjt:  GPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKL

Query:  GKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWR
           ++  K    SVAGKE++IKSV  ++P+Y+MS F++PK LC E+ +  ++FWWG S + RK+HW +W+KM +PK  GGL FR++E FN+AL+AKQ WR
Subjt:  GKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWR

Query:  ILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICS-EERFALEKVSFFISP-SGSW
        IL  P+SL+   +++ YF   + + A + +  SY WRSL+ G+ LL  G+R +V +G  IS + DPWIP+ + F+P  +  E      V+  I P S  W
Subjt:  ILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICS-EERFALEKVSFFISP-SGSW

Query:  NQDLLRIAVAPEDVQIIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI--FMKNKISAGSSNNSMA--EVWQKLWKLTVPSKVKHFCWRAINDFIPSKV
          D L      ++V +IR+IP+ L + ED+ IWH+DK G YSVKSGY++   + +  S  S++NS    ++W+++W   V  KV++F WR + + +P+KV
Subjt:  NQDLLRIAVAPEDVQIIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI--FMKNKISAGSSNNSMA--EVWQKLWKLTVPSKVKHFCWRAINDFIPSKV

Query:  NIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHC--FADRWFV-LNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSL
        N+ RR    + +C  C    ETT H+ + C     +W  +    + L+  NH       W + +   LN   +    +  WAIWS+RN    N      +
Subjt:  NIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHC--FADRWFV-LNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSL

Query:  IRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFA
            W     ++Y + +   S+ + P  R    KW  PP GRLKINVD A+K      GIG++ R+E+G   GA +        +   E +A   GL+ A
Subjt:  IRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFA

Query:  ERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQISSAHVA
           G +Q+ +E+DC      ++  +E  SE+  ++D  ++    F  I++ +  R +N VA+ +A +A  +    +     P ++  ++++D  ++  +A
Subjt:  ERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQISSAHVA

XP_030936391.1 uncharacterized protein LOC115961572 [Quercus lobata]1.0e-25438.21Show/hide
Query:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRR-GTFIQERLDRF
        W FTG YG  DT Q+ ++W  L+ L  + + PWL  GDFNEI    EKEGG  R R  M+NF   IN C  +++DFIG K+TW   R  G  I+ERLDR 
Subjt:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRR-GTFIQERLDRF

Query:  ICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSSSS--LSKDLNSCASVLKKWGKEVYIN
        + N  +  LF    +YHL    SDH P+ +HL+    R+K++  K FRFE  W++ + C  +++ +   G+ S +   L   L  C   L+KW KE + +
Subjt:  ICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSSSS--LSKDLNSCASVLKKWGKEVYIN

Query:  RKNKITACKNALK--EAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTF
           KI+  +  L+  E       I  E +      L+  LE+E+  W+QRSR  W + GD+NT  FH KAS R++ N I GI D  G W  D ++IE   
Subjt:  RKNKITACKNALK--EAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTF

Query:  ISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQT
        ++YF+ +F+SS P   +    L  +QP+V+  MN  L   +T +EV  ++KQM+P K+PGPDG P LF+Q++W+  G+   S  LD LN G S  ++N+T
Subjt:  ISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQT

Query:  NIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWP
        +I LIPKIN PK VSDYRPISLCNV YKI SK IANRLK  L  IIS +QSAF+ GRLI+DN+++  E +H I   +       AIKLD+SKA+DRVEW 
Subjt:  NIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWP

Query:  FLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLV
        F++KIM KLGFD N  +LI+ CI+T  +++ IN   +G    SRG+RQGDPLSPYLFLL AEGLS+LI  ++  G +  ++    GP +SHL FADDSL+
Subjt:  FLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLV

Query:  FCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVA
        FCKA   E   ++ +L +YE+ASG+ +N AK+S+ FSS    + +  +    G + +     YLG+PS    +K   F+ +  KLGK L G K  L S A
Subjt:  FCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVA

Query:  GKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRS
        GKEILIK+V LA+P+Y MS FK+P  LCDE+T    +FWWG    + ++ W SW+KM   KS GG+ F++++ FN AL+AKQ WR+    +SLV   +++
Subjt:  GKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRS

Query:  IYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPIC------SEERFALEKVSFFISPSGSWNQDLLRIAVAP
         YF     + A +G NPSY WRS++  + L+  G++ +V NG +I  + D W+P     K I       S+ R A        S  G W  +++     P
Subjt:  IYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPIC------SEERFALEKVSFFISPSGSWNQDLLRIAVAP

Query:  EDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGYN-----IFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTD
         +   I+ IPI      DK IW     G ++V+S Y      + M NK  A S  + M   W+++W + VP K++HF WRA  + +P+K N+ RR I  D
Subjt:  EDVQIIRRIPIDLS-KEDKYIWHYDKCGNYSVKSGYN-----IFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTD

Query:  CLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFAD-RW-FVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKK---
         +C  C E  E+  H+L  C KA+E    +      L   +  F D  W  ++   +  E + + A T WAIW +RN            +R   ++K   
Subjt:  CLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFAD-RW-FVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKK---

Query:  ----YTTDYLKVNSGPSSSRCPIIRAPIK--KWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAER
            + T+YL+     +    P++  P +  +W PP  G  KINVD A  +     G+G++ R+  G +  A +    +   +   E KA+  GL+FA+ 
Subjt:  ----YTTDYLKVNSGPSSSRCPIIRAPIK--KWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAER

Query:  RGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQIS
         G+R I++E D     N +       S +  +++ I+ +   F  I+ S+ +R  N+ A  +AK A S  +   W    P  + + +  D  S
Subjt:  RGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQIS

TrEMBL top hitse value%identityAlignment
A0A2I4E9H8 uncharacterized protein LOC1089875821.2e-24538.06Show/hide
Query:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG-TFIQERLDRF
        W  TG YG  +T ++  +WDLL+L+       WL  GDFNEI    EK GG +R    M+ F++ I +C LK     G KFTW + RRG  F +E+LDR 
Subjt:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRG-TFIQERLDRF

Query:  ICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKP--FRFEESWIRHNECSHLIE---NSGFWGDSSSSSLSKDLNSCASVLKKWGKEV
        + N S+ + F  +    L  + SDH P+ I + +   +      KP  FRFE +W    ECS++I+   N    G   +  +++ L  C+  LKKW    
Subjt:  ICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKP--FRFEESWIRHNECSHLIE---NSGFWGDSSSSSLSKDLNSCASVLKKWGKEV

Query:  YINRKNKITACKNALKEAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIEST
         +     ITA    L+E  +       EAI  L+ EL+  L E+++ WKQR+++ WL+ GDKNT+ FH  A+ R K N+I  + D+N   + +  EI + 
Subjt:  YINRKNKITACKNALKEAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIEST

Query:  FISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQ
        F S+F D+F+SSNP +  I+  L  + P+VS  MN  LL  FT EEV+ ++ QM    SPGPDGFPALFYQ++W I+G +     L+ILN G S+E  N+
Subjt:  FISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQ

Query:  TNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEW
        T I LIPK+  PK+V DYRPISLCNV YKIV+K ++NR+K VL DIIS +QSAF+ GR IS+NI++ +E LH +     +     A+KLD+SKA+DRVEW
Subjt:  TNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEW

Query:  PFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSL
         FL  +M++LGFD  W +++LNCIS+A +S+ +N E++  FK SRGLRQGDPLSPYLF+L  E LS++I  A   G +SSV        +SHL FADDSL
Subjt:  PFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSL

Query:  VFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSV
        +FCK+N LE  ++  +L IYE ASG+ +N  KSS+ FS     + +  + S+ GV        YLG+PS    +KT  F  +++K    +   K +  S 
Subjt:  VFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSV

Query:  AGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMR
        AGKEIL+KSV  AIP+Y M +F++P+ +   + +   +FWWGS+  + K+ W  W  +S  K  GGL FR    FN AL+AKQ W IL  PNSL A  ++
Subjt:  AGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMR

Query:  SIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSF-FISPSGSWNQDLLRIAVAPEDVQ
          YF   N+L A++G  PS +WRS++ G ELL  G+  ++ NG  +  +   W+PK  L      + R  LE+V++ FI     WN  LL+   + ++++
Subjt:  SIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSF-FISPSGSWNQDLLRIAVAPEDVQ

Query:  IIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI---FMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRC
        +I  IPI L ++EDK  W   + G +SV+S Y++    ++ +++  S  + ++  W+ LW+L VP+ V+ F WRA N+ IP+  N+ +R ++   LC  C
Subjt:  IIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI---FMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRC

Query:  GENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPI--PPSLIRSNW-------IKKYT
            ETT H+L  C  AQ+VW       + +   +  F + W  L+  L   +L+        +W  RN   H K +  P S++   +       + K  
Subjt:  GENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPI--PPSLIRSNW-------IKKYT

Query:  TDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILV
         + L VN  P              W  PP G+ KIN DAA  +     GIG I R+  G ++G       +      AE  A+L  +IF +  G   I++
Subjt:  TDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILV

Query:  ESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAK
        E D LQ    +   V+  S+ G +++  +++  S P   I + +R +N+ A  +AK
Subjt:  ESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAK

A0A7N2L6Z9 Reverse transcriptase domain-containing protein3.1e-24939.52Show/hide
Query:  PWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGT-FIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIH
        PW   GDFNEI+   EK GG +R +  MD F+  IN C  KDL + G  +TW + + G   I  RLDR +    +   F    V+HL    SDH  +   
Subjt:  PWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGT-FIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIH

Query:  LLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSS----SSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEH------GLTIN
         +S+    KR  S+ F FE  W +  +C  +IE+   WG +S       ++  L  CAS L  W           I   +  L +  E       G+ IN
Subjt:  LLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSS----SSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEH------GLTIN

Query:  FEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFI
              +  EL+ LL++EE++W QRS+  WLK GD+NT+ FH +AS R K N I G+ D  G W  D   I +  ++YF+DI+S+SNP  S +D     I
Subjt:  FEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFI

Query:  QPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNV
           +++ MN  L   FTREE+  ++KQ+ PTKSPGPDG  A+F+Q YW IVG    +  L++LN G S++  N+TNI LIPK +NPK+++D+RPISLCNV
Subjt:  QPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNV

Query:  NYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCIST
         YK++SKT+ANRLK  L  II+ +QSAF   RLI+DN++I +E +H +K+ +       A KLD+SKAFDRVEW F++++M K+GF+  W +LI+ CIS+
Subjt:  NYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCIST

Query:  AKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGE
          +SV IN E  GN   +RGLRQGDPLSPYLFLL AEGLS+L+  A     L+ +S     P I+HL FADDSL+FCKAN  E   +KE+L+ YE ASG+
Subjt:  AKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGE

Query:  SINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPK
         +N  KSS+ FS    P+ +  + ++LG         YLG+PS    SK   F+ +  ++G  L G K  L S  GKEILIK+V  AIP+Y MS F +PK
Subjt:  SINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPK

Query:  RLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLL
         LCDE+ K    FWWG  +++ K+ W SW KM  PKSLGGL FR++  FN AL+AKQ WRIL+ P SL A  +++ YF  G++L+A +G NPSY WRS+ 
Subjt:  RLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLL

Query:  WGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFK----PICSEERFALEKVSFFISPSGS-WNQDLLRIAVAPEDVQIIRRIPIDLS-KEDKYIWHYDK
           E+L  G R +V NG+ I  ++D W+P    +K    P  +E+      VS  I P    W  D +R    P D + I +IP+  +  +D+ IW  +K
Subjt:  WGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFK----PICSEERFALEKVSFFISPSGS-WNQDLLRIAVAPEDVQIIRRIPIDLS-KEDKYIWHYDK

Query:  CGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVW
         G +SVKS Y    N+    +I   SS +    +W+ LWKL +P KVK F WRA  + +P+  N+H RG+  +  C  C E +E  +H L++C  A  VW
Subjt:  CGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVW

Query:  SL--TFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPP
        +L       +LL+  +       F+ NS      L       WAIW +RN   H++     L      ++   DY     G      P +++ ++ W  P
Subjt:  SL--TFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPP

Query:  PVGRLKINVDAAWKLG-FPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDS
        P G  K+NVD A  +     +G+G++ R+E G ++ A        +P+   EL AI +GL+ A+   L QI++ESD L A   I+       E G LV+ 
Subjt:  PVGRLKINVDAAWKLG-FPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDS

Query:  IRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDD
        I      F     SY +R+ N+VA  +A++A+S     VW  V P ++  L+  D
Subjt:  IRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDD

A0A803PIB6 Uncharacterized protein8.4e-24737.58Show/hide
Query:  KWRFTGIYGFADTNQKKKTWDLL-RLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRR-GTFIQERLD
        +W F+ +YGF +   KK TW L+ RL D S  DPWLL GD NEI  +  K GGP R+ + M  F+ T++ C L ++  +G++FTW   R+  + ++ERLD
Subjt:  KWRFTGIYGFADTNQKKKTWDLL-RLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRR-GTFIQERLD

Query:  RFICNNSFDSLFFRTDVYHLDWLFSDHR----PIEIHLL--SSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSS----SSSLSKDLNSCASVL
            N+ +   F    + HLD+  SDHR     I+ +L   +  PR+ R     FRFE+ W++  EC  +I NS  W  SS    +S L   L  CA+ L
Subjt:  RFICNNSFDSLFFRTDVYHLDWLFSDHR----PIEIHLL--SSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSS----SSSLSKDLNSCASVL

Query:  KKWGKEVYINRKNKITACKNALKEAYEHGLTI------NFEA-IHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDN
          W    +   K  I   +  +     HGL        +F A +H+ E  LD LL  EE YW+QRSR +WL+ GD+NT+ FH KAS R+  N IK ++D+
Subjt:  KKWGKEVYINRKNKITACKNALKEAYEHGLTI------NFEA-IHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDN

Query:  NGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECL
        +GN +     I      YF+++F++SN     +   L  I   +SD  N  L   FT  EV  ++K +   KSPG DG  A+FY + W+IVG+      L
Subjt:  NGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECL

Query:  DILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAA
        D+LN G + E +N+T I LIPKI  PK + D+RPISLCNV YKI+SK +A R K VL  +IS +QSAF+  RLI+DNI++  E +H++K+     K  AA
Subjt:  DILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAA

Query:  IKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSIS
        +KLD+SKAFDRVEW FL  +M K+GF     +LI+ C+ T  FS  IN E  G+    RGLRQGDPLSPYLFL+ +EGLS L+      G+L  ++ S  
Subjt:  IKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSIS

Query:  GPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKL
         P I+HLLFADDSL+FC+AN+     IK  L +Y +ASG+ +N  KS M FS       +     +LG+        YLG+P+     K++ F+ +  ++
Subjt:  GPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKL

Query:  GKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWR
         K +      +FS+ GKE+L+K+V  AIP+Y MS F++  + C +I    ++FWWGSS++ +K+HW +W+ +   K  GGL FR    FN+A +AKQ WR
Subjt:  GKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWR

Query:  ILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERF---ALEKVSFFISPSGS
        I   PNSL++  ++  Y+   + ++A++    S  W+ ++WGRELL  G+ +K+ +G  ++  +D WIP    FKP+    RF       V+ +I+ +  
Subjt:  ILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERF---ALEKVSFFISPSGS

Query:  WNQDLLRIAVAPEDVQIIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI--FMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVN
        W+ +LL    +P D+  I  IP+   S  D++ WHYD  G+Y+VKSGYN+   ++NK    SS+ S    WQ  W L +PSKV+ F WR IN  +P   N
Subjt:  WNQDLLRIAVAPEDVQIIRRIPIDL-SKEDKYIWHYDKCGNYSVKSGYNI--FMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVN

Query:  IHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVW-----SLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPS
        +  R + T   C  C    E+  H L SC  A+ VW      L FT    ++D      D    L++ L   +L K+  T W IWSDRN++ H K +   
Subjt:  IHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVW-----SLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPS

Query:  LIRSNWIKKYTTDYLKVNSGPSSS-RCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLI
        +  S+  + Y  ++  V S  + +  C    A   KW PP    LK+NVDAA        GIG+I R+  G ++ A +     ++ S   E KA+  GL 
Subjt:  LIRSNWIKKYTTDYLKVNSGPSSS-RCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLI

Query:  FAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQI
        +A++  L+   VE+DCL   + +       S    LV+ I     SF +  IS+ +R++N  A  +AK A    N C+W    P  +  +V +D +
Subjt:  FAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDDQI

A0A803PV25 Uncharacterized protein4.3e-25137.9Show/hide
Query:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFI
        W FTG YG  D  Q++ +W LLR L      PWL  GDFNEI+  +EK GG  R   +MD FK+ ++DC+  D      + TW +    + I ERLDR +
Subjt:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFI

Query:  CNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSSS----SLSKDLNSCASVLKKWGKE
        C   +   F   D+  LDW  SDHR + + +   L      K +    F FEE+W +  EC+ +++    W + + S    S    +N C   L+ W ++
Subjt:  CNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSSS----SLSKDLNSCASVLKKWGKE

Query:  VYINRKNKITACKNALKEAYEHGLTIN-----FEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDP
            +K+K+ +    LK+A  H LT+      +E I  +E +L+ LLE++E YW+QRSR  WL+WGD+NT+ FH KAS R K NEIKG+ D+ G W  D 
Subjt:  VYINRKNKITACKNALKEAYEHGLTIN-----FEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDP

Query:  MEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKS
        + +      Y++ +F+SS+  +S ++  L  +QP+VS  MN  LL  F  EEV R++K+M PTK+PG DG PALFYQ +WS +    ++  L++LN G  
Subjt:  MEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKS

Query:  IEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKA
        ++  N T +ALIPK++ P+++ ++RPISLCNV YKIVSK +ANR++  L  ++S SQSAF++GRLI DN ++G+E LH ++ +R    +  A+KLD++KA
Subjt:  IEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKA

Query:  FDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLL
        +DRVEW FL+ +M KLG+   W   I+NC+++ +FS  IN E +G     RGLRQGDPLSP+LFLL AE  S LI  A  +G+L  V        +SHL 
Subjt:  FDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLL

Query:  FADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCK
        FADDSLVF  A E E    +ELL+ Y  ASG+ +NF KS M F   V    R  L++ +GV+ VD+ G YLG+PS    +K + F F +NK+   L+G K
Subjt:  FADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCK

Query:  RSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSL
         S FS AGKE+LIK++  AIP+Y MS F++PK+  + I    ++FWWGSS +  K+HWC W  +   K  GGL FRD+  FN+AL+AKQ+WR +  PNSL
Subjt:  RSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSL

Query:  VASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVA
         +  +++ Y+    +L A+ G + S++WRSL+WG++++  G R ++ NG ++   +DPW+P+   FK            V      +G W+++ +R    
Subjt:  VASFMRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVA

Query:  PEDVQIIRRIPI-DLSKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMA-EVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLC
        P D ++I ++   +   EDK +WHY K G YSV+SGY +    ++    SN       W++LWKL +P KVKHF W+  + +IP+   +  R +Q +  C
Subjt:  PEDVQIIRRIPI-DLSKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMA-EVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLC

Query:  VRCGEN-LETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNS--HFHNKPIPPSLIRSNWIKKYTTDY
         RC     E   H L SC    +VW ++     + +            ++S L  ED     V  W +W  RNS  H  +KP+  +++   W  K+  ++
Subjt:  VRCGEN-LETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNS--HFHNKPIPPSLIRSNWIKKYTTDY

Query:  LKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESD
           N    S +    RA   +W  P  G   INVDA  K+G     +  + R+  G +  A+    + +     AEL AI +G+    ++ L    VE+D
Subjt:  LKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESD

Query:  CLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYP
        CLQA NL+  D     ++  LV  IR L        IS+  R +N  A  +A  A   K   +W  V P
Subjt:  CLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYP

A0A803PWX1 Uncharacterized protein1.3e-24737.94Show/hide
Query:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFI
        W  TG YG  + + +K +W LLR L      PWL  GDFNEI+  AEK GG  R    MD FK+ I+DCR  D      + TW +    + I ERLDR +
Subjt:  WRFTGIYGFADTNQKKKTWDLLRLLDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFI

Query:  CNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFW----GDSSSSSLSKDLNSCASVLKKWGKE
        CN  +   F   DV  LDW  SDHR + I++   +      K +    F FEE+W +  EC+ +I+N   W    G     S    +N C   L+ W K+
Subjt:  CNNSFDSLFFRTDVYHLDWLFSDHRPIEIHL---LSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFW----GDSSSSSLSKDLNSCASVLKKWGKE

Query:  VYINRKNKITACKNALKEAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIES
              N+I   K  L E         +EAI ++E +L+ LLE++E YW+QRSR  WL+WGD+NT+ FH KAS R K NEIKG+ D  G W  D + +  
Subjt:  VYINRKNKITACKNALKEAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIES

Query:  TFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWN
            Y+K +F  S+     +   L  +QP+VS  MN  L++ F+ EEV +++K M PTK+PG DG PALFYQ +WS + D+ ++ CL++LN G  +   N
Subjt:  TFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWN

Query:  QTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVE
         T +ALIPK++ P+K+ ++RPISLCNV YKIVSK +ANRL+  L  ++S SQSAF++GRLI DN ++G+ECLH ++ NR       A+KLD++KA+DRVE
Subjt:  QTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVE

Query:  WPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDS
        W FL+ +M KLG+D  W + I+ C+++ +FS  IN E +G     RGLRQGDPLSP+LFLL AE  S LI  A   G+L  +     G  +SHL FADDS
Subjt:  WPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDS

Query:  LVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFS
        LVF  ANE E    KELL+ Y KASG+ +NF KS M F  KV    R  L++++GV+ VD+ G YLG+PS    +K + F F+ NK+   L+G K S FS
Subjt:  LVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFS

Query:  VAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFM
         AGKE+LIK+V  AIP+Y MS F++PK+  + I    ++FWWGSS +  K+HWC W+ +   K  GGL FRD+  FN+AL+AKQVWR +  PNSL +  +
Subjt:  VAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFM

Query:  RSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVAPEDVQ
        ++ Y+  G ++ A+ G + S++WRSL+WG++++  G R ++ NG ++   +DPW+P+   FK               +  P              P+ + 
Subjt:  RSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVAPEDVQ

Query:  IIRRIPIDLSKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGEN-
        +     IDL K    +  + + G +   + Y           S   +  + W KLWKL +P KVKHF W+  + +IP+   +  R IQ +  C RC    
Subjt:  IIRRIPIDLSKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGEN-

Query:  LETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSS
         E   H L  C   +EVW+    +  L +            ++S    ++     +  W +W  RNS  H    P +    +W  K+  ++ + N     
Subjt:  LETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSS

Query:  SRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLIS
         +    R    KW PP  G  K+NVDA  K G    G+  + R+  G ++ A+  + +        EL+AIL G+    +R L    VESDCLQA NL+ 
Subjt:  SRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLIS

Query:  MDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYP
         + E   ++  L+  I+ L        + +  R +N VA  +A  A   K   +W  V P
Subjt:  MDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.8e-4424.49Show/hide
Query:  DLLRLLDNSGNDPWLLGGDFN---EILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYH
        DL R LD+      L+ GDFN    IL+ + ++      + L     QT      + L     ++T+ S    T+   ++D  + + +  S   RT++  
Subjt:  DLLRLLDNSGNDPWLLGGDFN---EILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYH

Query:  LDWLFSDHRPIEIHL-LSSLPRRKREFSK--PFRFEESWIRHNECSHLIENSGFWGDSSSSSLSKDLNSCASVLKKWGKEVYIN------RKNKITACKN
        +    SDH  I++ L + +L + +    K       + W+ HNE    I+      ++  ++     ++  +V +  GK + +N       ++KI    +
Subjt:  LDWLFSDHRPIEIHL-LSSLPRRKREFSK--PFRFEESWIRHNECSHLIENSGFWGDSSSSSLSKDLNSCASVLKKWGKEVYIN------RKNKITACKN

Query:  ALKEAYE----HGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIF
         LKE  +    H      + I  +  EL  +  ++ +     SR  + +  +K  R   +    + + N+I  I ++ G+   DP EI++T   Y+K ++
Subjt:  ALKEAYE----HGLTINFEAIHNLEFELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIF

Query:  SSSNPLDSQIDLAL-RFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPK
        ++      ++D  L  +  PR++     SL  P T  E+   I  +   KSPGPDGF A FYQ Y   +    L     I  +G     + + +I LIPK
Subjt:  SSSNPLDSQIDLAL-RFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPK

Query:  I-NNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKY-NRTTIKNMAAIKLDLSKAFDRVEWPFLKKI
           +  K  ++RPISL N++ KI++K +ANR++  +K +I   Q  FI G     NI    + ++ I++ NR   KN   I +D  KAFD+++ PF+ K 
Subjt:  I-NNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKY-NRTTIKNMAAIKLDLSKAFDRVEWPFLKKI

Query:  MTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKAN
        + KLG D  +  +I         ++ +N +    F    G RQG PLSP LF ++ E L+  I     E ++  +        +S  LFADD +V+ +  
Subjt:  MTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKAN

Query:  ELEILHIKELLKIYEKASGESINFAKSSMVF---SSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGK
         +   ++ +L+  + K SG  IN  KS       + +      G L   +  + +  LG  L       +   +++  ++ ++ +     K    S  G+
Subjt:  ELEILHIKELLKIYEKASGESINFAKSSMVF---SSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGK

Query:  EILIKSVGLAIPSYIMSV--FKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVW
          ++K   L    Y  +    K+P     E+ KT  +F W     +      +   +S     GG+   D + + KA V K  W
Subjt:  EILIKSVGLAIPSYIMSV--FKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVW

P08548 LINE-1 reverse transcriptase homolog2.4e-4124Show/hide
Query:  GDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDL--DFIGEK--FTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLL
        GDFN  L   ++      ++ ++D    TI    L D+   F   K  +T+ S+  GT+   ++D  + + S  S F + ++  +  +FSDH  I++ L 
Subjt:  GDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDL--DFIGEK--FTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLL

Query:  SSLPRRKREFSKPFR-----FEESWI---RHNECSHLIENS--------GFWGDSSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHG
        ++  R     +K ++      +++W+      E +  +E +          W D++ + L     +  + LKK  +E   N    +   K   KE + + 
Subjt:  SSLPRRKREFSKPFR-----FEESWI---RHNECSHLIENS--------GFWGDSSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHG

Query:  LTINFEAIHNLEFELDNLLEEEEVYWKQRSREEW----LKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQ
             + I  +  EL N +E + +  +    + W    +   DK      +K  +++ ++ I+   + N     DP EI+     Y+K ++S       +
Subjt:  LTINFEAIHNLEFELDNLLEEEEVYWKQRSREEW----LKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQ

Query:  IDLALRFIQ-PRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKI-NNPKKVS
        ID  L     PR+S +    L  P +  E+  +I+ +   KSPGPDGF + FYQ +   +    L+   +I  +G     + + NI LIPK   +P +  
Subjt:  IDLALRFIQ-PRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKI-NNPKKVS

Query:  DYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKY-NRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSN
        +YRPISL N++ KI++K + NR++  +K II   Q  FI G     NI    + ++ I++ N+   K+   + +D  KAFD ++ PF+ + + K+G +  
Subjt:  DYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKY-NRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSN

Query:  WTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKE
        +  LI    S    ++ +N     +F    G RQG PLSP LF ++ E    +++ AI E K +     I    I   LFADD +V+ +        + E
Subjt:  WTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKE

Query:  LLKIYEKASGESINFAKS-SMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLG--MPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGL
        ++K Y   SG  IN  KS + ++++    ++   +   +    V     YLG  +     +   +++  +  ++ + +   K    S  G+  ++K   L
Subjt:  LLKIYEKASGESINFAKS-SMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLG--MPSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGL

Query:  --AIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVW
          AI ++     K P     ++ K    F W      +K    +   +S     GG+   D+  + K++V K  W
Subjt:  --AIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVW

P0C2F6 Putative ribonuclease H protein At1g657506.8e-4423.81Show/hide
Query:  FSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKA
        F  ++ ++   + G +    S AG+  L K+V  ++P + MS   +P+ + + + +    F WGS++EK+K H   W K+  PK  GGL  R  +  N+A
Subjt:  FSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKA

Query:  LVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSY--LWRSLLWG-RELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKV-
        L++K  WR+L + NSL    ++  Y  G    S  +    S+   WRS+  G R+++ HG+     +GQ I  + D W+  + L +    E     + V 
Subjt:  LVAKQVWRILSKPNSLVASFMRSIYFKGGNILSAEIGKNPSY--LWRSLLWG-RELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKV-

Query:  -SFFISPSGSWNQDLLRIAVAPEDVQIIRRIPIDL--SKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAI
              P   W+   +           +R + +DL     D+  W + + G +SV+S Y +   +++       +MA  +  LWK+ VP +VK F W   
Subjt:  -SFFISPSGSWNQDLLRIAVAPEDVQIIRRIPIDL--SKEDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAI

Query:  NDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLN-----SCLNLEDLRKVAVTCWAIWSDRNSH
        N  + ++   HRR +    +C  C   +E+  H+L  CP    +W +        Q F       W   N      C ++      AV  W  W  R  +
Subjt:  NDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLN-----SCLNLEDLRKVAVTCWAIWSDRNSH

Query:  FHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKK---WCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPM
           +       R  ++K++  +  + +SG  +    I +  +++   W  P VG +K+N D A +        G + R+  G   G  ++ +     +P 
Subjt:  FHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKK---WCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPM

Query:  AELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCK
        AEL  +  GL FA  + + ++ +E D       +   +     L  LV            ++I +  R +N +AD +A +A S         + PD +  
Subjt:  AELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCK

Query:  LVFDDQISS
        L+ +D + S
Subjt:  LVFDDQISS

P11369 LINE-1 retrotransposable element ORF2 protein4.5e-4027.12Show/hide
Query:  IKGISDNNGNWLCDPMEIESTFISYFKDIFSSS-NPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVG
        I  I +  G+   DP EI++T  S++K ++S+    LD       R+  P+++      L  P + +E+E  I  +   KSPGPDGF A FYQ +     
Subjt:  IKGISDNNGNWLCDPMEIESTFISYFKDIFSSS-NPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVG

Query:  DKTLSECLDILNK--------GKSIEDWNQTNIALIPK-INNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHE
             + + IL+K        G     + +  I LIPK   +P K+ ++RPISL N++ KI++K +ANR++  +K II P Q  FI G     NI     
Subjt:  DKTLSECLDILNK--------GKSIEDWNQTNIALIPK-INNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHE

Query:  CLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLI
         +H I  N+   KN   I LD  KAFD+++ PF+ K++ + G    + N+I    S    ++ +N E         G RQG PLSPYLF ++ E    ++
Subjt:  CLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLI

Query:  SGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLG--M
        + AI + K       I    +   L ADD +V+    +     +  L+  + +  G  IN +  SM F    +      +        V +   YLG  +
Subjt:  SGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLG--M

Query:  PSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGL--AIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLG
             +   K+F  +  ++ + L+  K    S  G+  ++K   L  AI  +     KIP +  +E+     +F W +   +      +   +   ++ G
Subjt:  PSSFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEILIKSVGL--AIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLG

Query:  GLNFRDVEGFNKALVAKQVW
        G+   D++ + +A+V K  W
Subjt:  GLNFRDVEGFNKALVAKQVW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.1e-3724.31Show/hide
Query:  LLGGDFNEILEDAEKEGGPSRN------RHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPI
        ++GGDFN  L+  ++     R+      R L+ +F   ++  R ++ + +   FT+   R G   Q R+DR   ++   S    + +      FSDH  +
Subjt:  LLGGDFNEILEDAEKEGGPSRN------RHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPI

Query:  EIH--LLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGF-------W-GDSSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGL
         +   +  SLP+              W  +N    L+E+ GF       W G  +       LN    V K   K +       ++  +NA  EA  +G 
Subjt:  EIH--LLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGF-------W-GDSSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGL

Query:  TINFEAIHNLEFELDNLLEEEEVYWKQ---------------RSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDI
         ++ E    L    D  L+ E +  K+               RSR + L   D+ +R F+     +    +I  +   +G  L DP  I     S+++++
Subjt:  TINFEAIHNLEFELDNLLEEEEVYWKQ---------------RSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDI

Query:  FSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPK
        F S +P+       L    P VS+R    L  P T +E+ ++++ M   KSPG DG    F+Q +W  +G        +   KG+      +  ++L+PK
Subjt:  FSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPK

Query:  INNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMT
          + + + ++RP+SL + +YKIV+K I+ RLK VL ++I P QS  + GR I DN+ +  + LH   + R T  ++A + LD  KAFDRV+  +L   + 
Subjt:  INNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHECLHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMT

Query:  KLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANEL
           F   +   +    ++A+  V IN          RG+RQG PLS  L+ L  E    L+       +L+ +        +    +ADD ++    + +
Subjt:  KLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSSVSCSISGPYISHLLFADDSLVFCKANEL

Query:  EILHIKELLKIYEKASGESINFAKSSMVFSSKVHPD--RRGFLSSLLGVECVDDLGFYLGMPS-SFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEI
        ++   +E  ++Y  AS   IN++KSS +    +  D     F       + +  LG YL       S +  +    V+ +LGK     K  + S+ G+ +
Subjt:  EILHIKELLKIYEKASGESINFAKSSMVFSSKVHPD--RRGFLSSLLGVECVDDLGFYLGMPS-SFSNSKTKDFSFVVNKLGKTLQGCKRSLFSVAGKEI

Query:  LIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRIL-SKPN----SLVASFMR
        +I  +  +   Y +      +    +I +    F W         HW S    S+P   GG     +         +Q+ R L + P+    +L +SF R
Subjt:  LIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRIL-SKPN----SLVASFMR

Query:  SI
         +
Subjt:  SI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein7.5e-3025.32Show/hide
Query:  SGGRRSKSDGAVDMDVALDKKWRFTGIYGFADTNQKKKTWD--LLRLLDNSGNDPWLLGGDFNEILEDAE-----KEGGPSRNRHLMDNFKQTINDCRLK
        SGG  S+S+    +D  L   WR    Y  ++  +    WD  +  L+    +   +L GDF++I   ++     +   P R    ++ F+  + D  L 
Subjt:  SGGRRSKSDGAVDMDVALDKKWRFTGIYGFADTNQKKKTWD--LLRLLDNSGNDPWLLGGDFNEILEDAE-----KEGGPSRNRHLMDNFKQTINDCRLK

Query:  DLDFIGEKFTWHSTRRGTFIQERLDRFICNNS-FDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGD-
        D+   G  +TW + +    I  +LDR I N   F S      V+ L  + SDH P  I +L +LP+R ++  + F F  +       + L+  +  W + 
Subjt:  DLDFIGEKFTWHSTRRGTFIQERLDRFICNNS-FDSLFFRTDVYHLDWLFSDHRPIEIHLLSSLPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGD-

Query:  ----SSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGLTINFEAIHNLEF----ELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFH
            S   SL + L +     K   ++ + N ++K     ++L+      LT   +++  +E     + +      E +++Q+SR +WL+ GD NTR FH
Subjt:  ----SSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGLTINFEAIHNLEF----ELDNLLEEEEVYWKQRSREEWLKWGDKNTRCFH

Query:  KKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIF-SSSNPLDSQIDLALRFIQP-RVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFP
        K        N IK +  ++   + +  +++   ++Y+  +  S S+ L       ++ I P R +D +   L    + +E+  ++  M   K+PGPD F 
Subjt:  KKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIF-SSSNPLDSQIDLALRFIQP-RVSDRMNRSLLMPFTREEVERSIKQMFPTKSPGPDGFP

Query:  ALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVS
        A F+   W +V D T++   +    G  ++ +N T I LIPK+    ++S +RP+S C V YKI++
Subjt:  ALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVS

AT2G02650.1 Ribonuclease H-like superfamily protein1.6e-2728.74Show/hide
Query:  EVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSC---LN
        EV Q +WKL V  K+KHF WR +   + +   +  R I  D +C RC    ET  HI+ +CP  Q VW          +  N    ++W   +S    LN
Subjt:  EVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSC---LN

Query:  -LEDLRKVAVT-----------CWAIWSDRNSH-FHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIK-------KWCPPPVGRLKINVDA
         L  L K   T            W +W  RN   F  K   P       I+   T++L  N    ++   +   PI+       +W PPP G +K N D+
Subjt:  -LEDLRKVAVT-----------CWAIWSDRNSH-FHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIK-------KWCPPPVGRLKINVDA

Query:  AWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIK
         +  G P T  G   R   G IV            S  AE    L  L      GLR +  ESD      LI+ + E  S LGTL+  IR   L  P   
Subjt:  AWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIK

Query:  ISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVF
        + +  R  N  ADA+A    +           P WL   ++
Subjt:  ISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVF

AT3G09510.1 Ribonuclease H-like superfamily protein8.2e-4528.22Show/hide
Query:  MRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGS---WNQDLLRIAVAP
        M++ YFK  +IL A++ K  SY W SLL G  LL  G R  + +GQNI    D  I   H  +P+ +EE +    ++      GS   W+   +   V  
Subjt:  MRSIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGS---WNQDLLRIAVAP

Query:  EDVQIIRRIPIDLSKE-DKYIWHYDKCGNYSVKSGYNIFM---KNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCL
         D   I RI +  SK+ DK IW+Y+  G Y+V+SGY +        I A +  +   ++  ++W L +  K+KHF WRA++  + +   +  RG++ D  
Subjt:  EDVQIIRRIPIDLSKE-DKYIWHYDKCGNYSVKSGYNIFM---KNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCL

Query:  CVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLN------LEDLRKVAVT--CWAIWSDRNSHFHNK--PIPPSLIRSNW
        C RC    E+ +H L +CP A   W L+ +  +  Q  ++ F +    +++ LN      + D  K+      W IW  RN+   NK    P   + S  
Subjt:  CVRCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDFNHCFADRWFVLNSCLN------LEDLRKVAVT--CWAIWSDRNSHFHNK--PIPPSLIRSNW

Query:  IKKYTTDYLKVNSGPSSSRCPIIRAPIKK--WCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERR
         K  T D+L        +  P  +    K  W  PP   +K N DA + +   +   G I RN  G  +   ++          AE KA+L  L     R
Subjt:  IKKYTTDYLKVNSGPSSSRCPIIRAPIKK--WCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERR

Query:  GLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDD
        G  Q+ +E DC    NLI+  +   S L   ++ I   A  F  I+  + +R  N +A  +AK+  +       +   P WL +   +D
Subjt:  GLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKNCCVWTHVYPDWLCKLVFDD

AT4G29090.1 Ribonuclease H-like superfamily protein3.1e-6827.46Show/hide
Query:  LAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILS
        +A+P+Y M+ F +PK +C +I    + FWW +  E + +HW +W+ +S  K+ GG+ F+D+E FN AL+ KQ+WR+LS+P SL+A   +S YF   + L+
Subjt:  LAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNILS

Query:  AEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERF------------ALEKVSFFISPSG-SWNQDLLRIAVAPEDV
        A +G  PS++W+S+   +E+L  G R  V NG++I  +   W+      KP  +  R             ++ KVS  I  SG  W +D++ +     + 
Subjt:  AEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERF------------ALEKVSFFISPSG-SWNQDLLRIAVAPEDV

Query:  QIIRRI-PIDLSKEDKYIWHYDKCGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCV
        ++I  + P      D Y W Y   G+Y+VKSGY     I  K       S  S+  ++QK+WK     K++HF W+ +++ +P    +  R +  +  C+
Subjt:  QIIRRI-PIDLSKEDKYIWHYDKCGNYSVKSGY----NIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCV

Query:  RCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDF-NHCFADRWFVLNSCLNLEDLRK----VAVTCWAIWSDRNS-HFHNKPIPPSLIRSNWIKKYTT
        RC    ET +H+L  C  A+  W+++     L  ++ +  + + ++V N         K    V    W +W +RN   F  +      +          
Subjt:  RCGENLETTDHILISCPKAQEVWSLTFTHDVLLQDF-NHCFADRWFVLNSCLNLEDLRK----VAVTCWAIWSDRNS-HFHNKPIPPSLIRSNWIKKYTT

Query:  DYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDI--VGASAIYDDLDYPSPMAELKAILEGLIFA----ERRGL
          ++  +    ++  + R+   +W PPP   +K N DA W       GIG + RNE G++  +GA A+      P   + L+A LE + +A     R   
Subjt:  DYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLGFPDTGIGIICRNEVGDI--VGASAIYDDLDYPSPMAELKAILEGLIFA----ERRGL

Query:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKN
          ++ ESD      +++ D E+   L   +  ++ L   F ++K  +  R  N +A+ +A+ + S  N
Subjt:  RQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAIAKWARSEKN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.1e-3239.87Show/hide
Query:  LAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPK-SLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNIL
        +A+P Y MS F++ K LC ++T   ++FWW S   KRK+ W +W+K+   K   GGL FRD+  FN+AL+AKQ +RI+ +P++L++  +RS YF   +++
Subjt:  LAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPK-SLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMRSIYFKGGNIL

Query:  SAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPI
           +G  PSY WRS++ GRELL  G+   + +G +   + D WI  E    P+
Subjt:  SAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGATTTTCCAGAGACGACAGCCCATGTGGATCTACAACGGACGACCTCCAGAAAGATTCAACACCACCTTTACAGAAGAAGATGAATTGGAAACGCAGAGCTCG
TTTGGGGCAATCAAGTGAACCGGTGCAAAGGCTAACAATTGGGGATTCAATTCCAGGGAAAAGAAAAGTCAGAGAGGAGAATCCTAGTGGTGGAAGGAGATCCAAAAGCG
ATGGAGCAGTTGATATGGATGTTGCACTTGACAAGAAATGGAGATTCACTGGCATCTATGGCTTTGCTGATACAAACCAGAAGAAAAAGACCTGGGATTTGTTACGACTG
TTGGATAATTCGGGTAATGATCCATGGCTTCTTGGTGGGGATTTCAACGAAATCCTTGAAGATGCAGAAAAAGAAGGGGGTCCAAGCAGAAACAGGCATCTTATGGATAA
CTTTAAACAAACTATTAATGATTGCAGGCTGAAAGATCTAGATTTTATTGGTGAAAAATTCACATGGCACAGTACTAGACGAGGAACTTTCATTCAAGAACGGCTGGATA
GATTTATCTGTAATAATTCATTTGATTCTTTGTTTTTCAGGACAGATGTGTACCATTTGGACTGGTTATTTTCTGACCACAGACCGATTGAAATCCATCTTCTTAGTTCG
CTCCCCAGAAGAAAAAGGGAGTTCTCTAAACCCTTTAGATTTGAAGAATCTTGGATTAGACATAATGAATGTTCTCATCTTATTGAAAATTCTGGGTTCTGGGGAGATTC
CTCTAGTTCTTCTCTGTCAAAAGACTTGAACTCATGTGCATCTGTGTTAAAAAAATGGGGTAAAGAAGTTTACATTAATCGTAAAAACAAAATTACTGCCTGTAAGAATG
CTCTTAAAGAGGCTTATGAACATGGTCTGACCATTAACTTTGAGGCAATCCATAATCTAGAGTTTGAACTAGATAATCTATTAGAGGAGGAAGAAGTATATTGGAAGCAA
AGATCCCGTGAAGAATGGCTTAAATGGGGTGATAAAAACACCAGGTGCTTTCACAAGAAAGCTTCCATTCGTAATAAAGTCAATGAAATTAAAGGCATTTCTGATAACAA
TGGAAATTGGTTATGTGATCCTATGGAGATTGAAAGCACCTTTATCTCATATTTCAAAGATATTTTCTCTTCCTCTAACCCCCTGGACTCTCAAATTGATTTGGCTTTAA
GATTTATTCAACCTAGAGTCTCAGATAGGATGAATCGTAGTCTGCTTATGCCTTTTACGAGAGAAGAAGTAGAAAGATCCATAAAACAAATGTTCCCCACGAAATCTCCA
GGCCCAGATGGATTTCCAGCCCTCTTTTACCAGAATTATTGGAGTATTGTAGGTGACAAAACATTATCTGAGTGCCTTGATATTTTAAATAAGGGTAAAAGCATTGAAGA
TTGGAACCAAACTAATATAGCTCTAATTCCAAAGATCAATAACCCTAAGAAAGTAAGTGATTATAGGCCCATAAGTCTTTGTAATGTCAATTATAAAATAGTATCAAAGA
CCATAGCCAACCGTCTTAAAGGGGTATTGAAAGACATTATTTCTCCATCTCAATCTGCTTTTATTCAGGGGAGGCTGATTTCTGACAATATTATGATAGGTCATGAGTGC
CTTCATGCTATTAAGTATAATAGAACTACTATAAAAAACATGGCTGCTATTAAACTTGACTTAAGCAAGGCATTTGACAGAGTTGAATGGCCTTTCCTCAAGAAAATCAT
GACTAAATTAGGATTTGATTCAAATTGGACAAACCTGATTTTGAACTGCATTTCAACTGCTAAATTTTCGGTCTTTATTAATAAGGAATCAAAAGGCAATTTTAAAGCTA
GTAGGGGTCTTAGGCAAGGTGACCCATTGTCCCCTTATTTGTTCTTACTAATTGCTGAGGGCTTATCTAGTTTAATTTCAGGTGCTATTGCGGAAGGTAAGCTTTCCAGT
GTATCTTGTTCTATTTCAGGTCCTTATATTTCTCATCTCCTTTTTGCAGATGACAGTTTAGTCTTTTGTAAAGCCAATGAGCTGGAGATTCTGCACATCAAAGAACTTCT
AAAGATCTATGAAAAGGCCTCTGGGGAAAGCATTAACTTTGCAAAATCATCTATGGTTTTTTCAAGCAAGGTCCACCCTGACAGAAGAGGTTTCCTGAGTAGCCTTCTCG
GTGTGGAGTGTGTTGATGACCTTGGATTCTACTTGGGGATGCCGTCCTCTTTTTCGAATAGCAAGACGAAAGACTTCAGCTTTGTTGTGAACAAATTAGGCAAGACCCTT
CAAGGATGTAAGCGTTCCTTGTTTTCAGTGGCTGGGAAGGAGATTCTAATCAAGAGTGTTGGCTTGGCGATCCCTTCTTACATCATGAGCGTCTTCAAAATTCCCAAACG
CTTATGTGACGAGATTACTAAAACTTTTTCTCAGTTTTGGTGGGGCTCATCATCGGAAAAAAGAAAGTTGCACTGGTGTAGCTGGGAGAAGATGTCTATGCCTAAAAGTC
TTGGTGGCCTAAACTTTAGAGATGTAGAAGGATTCAATAAAGCCTTGGTAGCTAAGCAAGTGTGGAGGATACTTTCGAAGCCAAATTCATTAGTGGCTAGCTTTATGAGA
AGCATATATTTTAAGGGAGGTAATATTCTATCTGCAGAGATTGGTAAAAATCCGTCTTATTTGTGGAGAAGTTTGCTTTGGGGTAGAGAGCTTCTTTGTCATGGTATCCG
GCTCAAGGTTTGCAATGGTCAAAATATCTCCGCTTTCAATGACCCCTGGATTCCCAAAGAACATCTTTTTAAGCCAATTTGTTCAGAGGAGAGGTTTGCTCTGGAAAAGG
TATCTTTCTTCATTTCTCCTTCGGGTTCCTGGAACCAAGATCTTCTTAGGATAGCAGTTGCGCCAGAGGATGTCCAAATTATTAGAAGAATCCCTATTGATTTATCTAAG
GAAGACAAATATATTTGGCACTATGACAAATGCGGCAACTATTCAGTGAAGAGTGGCTACAACATTTTTATGAAAAATAAGATCAGTGCTGGATCTAGTAATAACTCTAT
GGCTGAGGTATGGCAGAAGCTATGGAAACTGACTGTCCCTTCTAAAGTTAAGCATTTTTGTTGGCGAGCTATTAATGACTTCATCCCATCTAAAGTTAACATTCACAGAA
GGGGTATCCAAACAGACTGTTTGTGTGTGAGATGTGGAGAGAACCTGGAGACCACAGATCATATTCTTATTAGTTGCCCGAAAGCCCAGGAAGTTTGGAGTCTTACCTTT
ACCCATGATGTCCTGCTTCAAGATTTCAACCACTGCTTTGCTGATAGATGGTTTGTCCTCAACTCTTGTTTGAATTTAGAGGATCTTAGAAAAGTTGCAGTCACTTGTTG
GGCAATTTGGTCAGACAGGAACTCTCATTTTCATAACAAGCCAATCCCTCCTAGTCTAATCAGAAGCAATTGGATTAAAAAATATACTACCGATTACCTTAAAGTAAATT
CGGGTCCTTCTTCAAGCCGTTGTCCAATCATCAGAGCTCCGATCAAGAAATGGTGCCCTCCTCCAGTGGGAAGATTGAAGATAAATGTCGACGCAGCCTGGAAATTGGGC
TTCCCTGATACTGGTATTGGTATTATCTGCAGGAATGAAGTAGGCGACATTGTTGGAGCTTCTGCGATATATGACGATCTTGACTATCCCTCTCCGATGGCTGAGTTGAA
GGCAATCTTGGAAGGCTTGATTTTCGCAGAAAGGAGAGGCTTAAGGCAAATTTTGGTTGAATCCGATTGCCTCCAGGCTACCAATTTAATTTCCATGGATGTTGAAGTTA
GAAGCGAATTGGGGACTTTAGTTGATTCCATTAGAAGCTTAGCTCTTTCTTTCCCTGATATCAAAATCTCTTATAGCCAGCGTAATTCCAACGTGGTTGCTGATGCTATC
GCCAAATGGGCTAGATCTGAAAAGAATTGTTGTGTGTGGACCCACGTTTACCCAGATTGGCTTTGTAAGTTGGTCTTTGATGACCAGATTTCTTCTGCCCATGTGGCATT
TTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGATTTTCCAGAGACGACAGCCCATGTGGATCTACAACGGACGACCTCCAGAAAGATTCAACACCACCTTTACAGAAGAAGATGAATTGGAAACGCAGAGCTCG
TTTGGGGCAATCAAGTGAACCGGTGCAAAGGCTAACAATTGGGGATTCAATTCCAGGGAAAAGAAAAGTCAGAGAGGAGAATCCTAGTGGTGGAAGGAGATCCAAAAGCG
ATGGAGCAGTTGATATGGATGTTGCACTTGACAAGAAATGGAGATTCACTGGCATCTATGGCTTTGCTGATACAAACCAGAAGAAAAAGACCTGGGATTTGTTACGACTG
TTGGATAATTCGGGTAATGATCCATGGCTTCTTGGTGGGGATTTCAACGAAATCCTTGAAGATGCAGAAAAAGAAGGGGGTCCAAGCAGAAACAGGCATCTTATGGATAA
CTTTAAACAAACTATTAATGATTGCAGGCTGAAAGATCTAGATTTTATTGGTGAAAAATTCACATGGCACAGTACTAGACGAGGAACTTTCATTCAAGAACGGCTGGATA
GATTTATCTGTAATAATTCATTTGATTCTTTGTTTTTCAGGACAGATGTGTACCATTTGGACTGGTTATTTTCTGACCACAGACCGATTGAAATCCATCTTCTTAGTTCG
CTCCCCAGAAGAAAAAGGGAGTTCTCTAAACCCTTTAGATTTGAAGAATCTTGGATTAGACATAATGAATGTTCTCATCTTATTGAAAATTCTGGGTTCTGGGGAGATTC
CTCTAGTTCTTCTCTGTCAAAAGACTTGAACTCATGTGCATCTGTGTTAAAAAAATGGGGTAAAGAAGTTTACATTAATCGTAAAAACAAAATTACTGCCTGTAAGAATG
CTCTTAAAGAGGCTTATGAACATGGTCTGACCATTAACTTTGAGGCAATCCATAATCTAGAGTTTGAACTAGATAATCTATTAGAGGAGGAAGAAGTATATTGGAAGCAA
AGATCCCGTGAAGAATGGCTTAAATGGGGTGATAAAAACACCAGGTGCTTTCACAAGAAAGCTTCCATTCGTAATAAAGTCAATGAAATTAAAGGCATTTCTGATAACAA
TGGAAATTGGTTATGTGATCCTATGGAGATTGAAAGCACCTTTATCTCATATTTCAAAGATATTTTCTCTTCCTCTAACCCCCTGGACTCTCAAATTGATTTGGCTTTAA
GATTTATTCAACCTAGAGTCTCAGATAGGATGAATCGTAGTCTGCTTATGCCTTTTACGAGAGAAGAAGTAGAAAGATCCATAAAACAAATGTTCCCCACGAAATCTCCA
GGCCCAGATGGATTTCCAGCCCTCTTTTACCAGAATTATTGGAGTATTGTAGGTGACAAAACATTATCTGAGTGCCTTGATATTTTAAATAAGGGTAAAAGCATTGAAGA
TTGGAACCAAACTAATATAGCTCTAATTCCAAAGATCAATAACCCTAAGAAAGTAAGTGATTATAGGCCCATAAGTCTTTGTAATGTCAATTATAAAATAGTATCAAAGA
CCATAGCCAACCGTCTTAAAGGGGTATTGAAAGACATTATTTCTCCATCTCAATCTGCTTTTATTCAGGGGAGGCTGATTTCTGACAATATTATGATAGGTCATGAGTGC
CTTCATGCTATTAAGTATAATAGAACTACTATAAAAAACATGGCTGCTATTAAACTTGACTTAAGCAAGGCATTTGACAGAGTTGAATGGCCTTTCCTCAAGAAAATCAT
GACTAAATTAGGATTTGATTCAAATTGGACAAACCTGATTTTGAACTGCATTTCAACTGCTAAATTTTCGGTCTTTATTAATAAGGAATCAAAAGGCAATTTTAAAGCTA
GTAGGGGTCTTAGGCAAGGTGACCCATTGTCCCCTTATTTGTTCTTACTAATTGCTGAGGGCTTATCTAGTTTAATTTCAGGTGCTATTGCGGAAGGTAAGCTTTCCAGT
GTATCTTGTTCTATTTCAGGTCCTTATATTTCTCATCTCCTTTTTGCAGATGACAGTTTAGTCTTTTGTAAAGCCAATGAGCTGGAGATTCTGCACATCAAAGAACTTCT
AAAGATCTATGAAAAGGCCTCTGGGGAAAGCATTAACTTTGCAAAATCATCTATGGTTTTTTCAAGCAAGGTCCACCCTGACAGAAGAGGTTTCCTGAGTAGCCTTCTCG
GTGTGGAGTGTGTTGATGACCTTGGATTCTACTTGGGGATGCCGTCCTCTTTTTCGAATAGCAAGACGAAAGACTTCAGCTTTGTTGTGAACAAATTAGGCAAGACCCTT
CAAGGATGTAAGCGTTCCTTGTTTTCAGTGGCTGGGAAGGAGATTCTAATCAAGAGTGTTGGCTTGGCGATCCCTTCTTACATCATGAGCGTCTTCAAAATTCCCAAACG
CTTATGTGACGAGATTACTAAAACTTTTTCTCAGTTTTGGTGGGGCTCATCATCGGAAAAAAGAAAGTTGCACTGGTGTAGCTGGGAGAAGATGTCTATGCCTAAAAGTC
TTGGTGGCCTAAACTTTAGAGATGTAGAAGGATTCAATAAAGCCTTGGTAGCTAAGCAAGTGTGGAGGATACTTTCGAAGCCAAATTCATTAGTGGCTAGCTTTATGAGA
AGCATATATTTTAAGGGAGGTAATATTCTATCTGCAGAGATTGGTAAAAATCCGTCTTATTTGTGGAGAAGTTTGCTTTGGGGTAGAGAGCTTCTTTGTCATGGTATCCG
GCTCAAGGTTTGCAATGGTCAAAATATCTCCGCTTTCAATGACCCCTGGATTCCCAAAGAACATCTTTTTAAGCCAATTTGTTCAGAGGAGAGGTTTGCTCTGGAAAAGG
TATCTTTCTTCATTTCTCCTTCGGGTTCCTGGAACCAAGATCTTCTTAGGATAGCAGTTGCGCCAGAGGATGTCCAAATTATTAGAAGAATCCCTATTGATTTATCTAAG
GAAGACAAATATATTTGGCACTATGACAAATGCGGCAACTATTCAGTGAAGAGTGGCTACAACATTTTTATGAAAAATAAGATCAGTGCTGGATCTAGTAATAACTCTAT
GGCTGAGGTATGGCAGAAGCTATGGAAACTGACTGTCCCTTCTAAAGTTAAGCATTTTTGTTGGCGAGCTATTAATGACTTCATCCCATCTAAAGTTAACATTCACAGAA
GGGGTATCCAAACAGACTGTTTGTGTGTGAGATGTGGAGAGAACCTGGAGACCACAGATCATATTCTTATTAGTTGCCCGAAAGCCCAGGAAGTTTGGAGTCTTACCTTT
ACCCATGATGTCCTGCTTCAAGATTTCAACCACTGCTTTGCTGATAGATGGTTTGTCCTCAACTCTTGTTTGAATTTAGAGGATCTTAGAAAAGTTGCAGTCACTTGTTG
GGCAATTTGGTCAGACAGGAACTCTCATTTTCATAACAAGCCAATCCCTCCTAGTCTAATCAGAAGCAATTGGATTAAAAAATATACTACCGATTACCTTAAAGTAAATT
CGGGTCCTTCTTCAAGCCGTTGTCCAATCATCAGAGCTCCGATCAAGAAATGGTGCCCTCCTCCAGTGGGAAGATTGAAGATAAATGTCGACGCAGCCTGGAAATTGGGC
TTCCCTGATACTGGTATTGGTATTATCTGCAGGAATGAAGTAGGCGACATTGTTGGAGCTTCTGCGATATATGACGATCTTGACTATCCCTCTCCGATGGCTGAGTTGAA
GGCAATCTTGGAAGGCTTGATTTTCGCAGAAAGGAGAGGCTTAAGGCAAATTTTGGTTGAATCCGATTGCCTCCAGGCTACCAATTTAATTTCCATGGATGTTGAAGTTA
GAAGCGAATTGGGGACTTTAGTTGATTCCATTAGAAGCTTAGCTCTTTCTTTCCCTGATATCAAAATCTCTTATAGCCAGCGTAATTCCAACGTGGTTGCTGATGCTATC
GCCAAATGGGCTAGATCTGAAAAGAATTGTTGTGTGTGGACCCACGTTTACCCAGATTGGCTTTGTAAGTTGGTCTTTGATGACCAGATTTCTTCTGCCCATGTGGCATT
TTAA
Protein sequenceShow/hide protein sequence
MERFSRDDSPCGSTTDDLQKDSTPPLQKKMNWKRRARLGQSSEPVQRLTIGDSIPGKRKVREENPSGGRRSKSDGAVDMDVALDKKWRFTGIYGFADTNQKKKTWDLLRL
LDNSGNDPWLLGGDFNEILEDAEKEGGPSRNRHLMDNFKQTINDCRLKDLDFIGEKFTWHSTRRGTFIQERLDRFICNNSFDSLFFRTDVYHLDWLFSDHRPIEIHLLSS
LPRRKREFSKPFRFEESWIRHNECSHLIENSGFWGDSSSSSLSKDLNSCASVLKKWGKEVYINRKNKITACKNALKEAYEHGLTINFEAIHNLEFELDNLLEEEEVYWKQ
RSREEWLKWGDKNTRCFHKKASIRNKVNEIKGISDNNGNWLCDPMEIESTFISYFKDIFSSSNPLDSQIDLALRFIQPRVSDRMNRSLLMPFTREEVERSIKQMFPTKSP
GPDGFPALFYQNYWSIVGDKTLSECLDILNKGKSIEDWNQTNIALIPKINNPKKVSDYRPISLCNVNYKIVSKTIANRLKGVLKDIISPSQSAFIQGRLISDNIMIGHEC
LHAIKYNRTTIKNMAAIKLDLSKAFDRVEWPFLKKIMTKLGFDSNWTNLILNCISTAKFSVFINKESKGNFKASRGLRQGDPLSPYLFLLIAEGLSSLISGAIAEGKLSS
VSCSISGPYISHLLFADDSLVFCKANELEILHIKELLKIYEKASGESINFAKSSMVFSSKVHPDRRGFLSSLLGVECVDDLGFYLGMPSSFSNSKTKDFSFVVNKLGKTL
QGCKRSLFSVAGKEILIKSVGLAIPSYIMSVFKIPKRLCDEITKTFSQFWWGSSSEKRKLHWCSWEKMSMPKSLGGLNFRDVEGFNKALVAKQVWRILSKPNSLVASFMR
SIYFKGGNILSAEIGKNPSYLWRSLLWGRELLCHGIRLKVCNGQNISAFNDPWIPKEHLFKPICSEERFALEKVSFFISPSGSWNQDLLRIAVAPEDVQIIRRIPIDLSK
EDKYIWHYDKCGNYSVKSGYNIFMKNKISAGSSNNSMAEVWQKLWKLTVPSKVKHFCWRAINDFIPSKVNIHRRGIQTDCLCVRCGENLETTDHILISCPKAQEVWSLTF
THDVLLQDFNHCFADRWFVLNSCLNLEDLRKVAVTCWAIWSDRNSHFHNKPIPPSLIRSNWIKKYTTDYLKVNSGPSSSRCPIIRAPIKKWCPPPVGRLKINVDAAWKLG
FPDTGIGIICRNEVGDIVGASAIYDDLDYPSPMAELKAILEGLIFAERRGLRQILVESDCLQATNLISMDVEVRSELGTLVDSIRSLALSFPDIKISYSQRNSNVVADAI
AKWARSEKNCCVWTHVYPDWLCKLVFDDQISSAHVAF