; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008712 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008712
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:28497194..28502932
RNA-Seq ExpressionLag0008712
SyntenyLag0008712
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_006491472.1 uncharacterized protein LOC102626455 [Citrus sinensis]4.0e-17334.41Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        ++  +L +EE YWK R+R +WLK GDKNTK+FHSKA+ R+++N   G+ + +  WV+  + I      +F+ L  SS P+   I    + +  K+S E  
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIKGM-------------------------------LGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL
          L+ PF+ +++ +A+  M                               L     QG   +      ALIPK ++P+++ EFRPISLCNV+Y+++AK +
Subjt:  IKLDAPFSKDELEKAIKGM-------------------------------LGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
        ANR+K +L+ IIS +QS+F+P R I+DN ++G+EC+H +   +  + G VALKLD+SKAYDRVEW F+E+ M  +GFS+ W   +M CI++  FS+LING
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF
         P    KP RGLRQG PLSPYLF+LCAE FS LL + E    + G K        THL FADDSLVF KA   +     G+   Y +ASGQ  N +KSS 
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF

Query:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL
          S   + E I+  + I  +K       YLG+P   G+NK   FK +K +V   +  W   LFS GGKE+LIKAVAQA+P Y MS F+LP  +C    + 
Subjt:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL

Query:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG
         ++FWWG+   K   HW  W  M + K +GG+GFR L +F QA++AKQ WRL+R PNSL+ ++++ RY+K   F  A +G++ S  WR I+WG  +  KG
Subjt:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG

Query:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLAT
         RWR+G+G ++ + KD WI R    + +S         V DL+D  NKW+   + + F  +D   IL +   S    DE++WH +KKG +SVKS Y+LA 
Subjt:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLAT

Query:  REQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LP------------------------------------------------------K
         +   NE   SN+S  +  W   W  ++  + K+  W      LP                                                      +
Subjt:  REQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LP------------------------------------------------------K

Query:  DYWG-----WMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAAWI
        D++      W + S ++ EL   I+  W +W  RN+  +     + ++ + +  ++ L  ++      +             W+ P  N  KLN DAA  
Subjt:  DYWG-----WMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAAWI

Query:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRL-GITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSM
         K+ + GLG +VRD+ G ++  G+KQ      +   EA A+  G++      N++   +L VESD  EVV +LN      +++     +++  +     +
Subjt:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRL-GITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSM

Query:  RFLHYNCLLNTDAHCVARSA
        +F       NT AH +A+ A
Subjt:  RFLHYNCLLNTDAHCVARSA

XP_023878301.1 uncharacterized protein LOC111990748 [Quercus suber]7.5e-18038.36Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        E++ LL++EE+YW  RA+  WLK GD+NTK+FH++A++R+K+N   GI++ + +W +  + I  AA  YF ++  SS P+   I  VTEAI  K++ E  
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIK-----------GMLG-----YSQRQGRGRAD----------------KQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKV
          L   F+K+E+  A+K           GM       Y    G    D                K N+ +LIPK+  PKRM++FRPISLCNV+YK+I+K+
Subjt:  IKLDAPFSKDELEKAIK-----------GMLG-----YSQRQGRGRAD----------------KQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKV

Query:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILIN
        LANR+K +L  IIS +QS+F   R I+DN LV FE +H L +K  GK G++A+KLDMSKA+DRVEW FI + M++MGF + W   VM+CI+SVS+SILIN
Subjt:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILIN

Query:  GEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSS
        G       PSRGLRQGDPLSP LFLLCAEG S L+ +      ++G  IN  CP  THLFFADDS++F KA          +L  YEEASGQ IN DKSS
Subjt:  GEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSS

Query:  FMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDR
           S N  +ET  +   ILG    +    YLG+PS  G++K  VF  +K++V   L GWK  L S+GGKE+LIKAVAQAIP YTMSCF LP  +C   +R
Subjt:  FMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDR

Query:  LCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLK
        +   FWWG    +TK  W+SWK+MC +K+ GG+GFR L AF  AMLAKQ+WR++ NPNSL+ ++L+ RYF   + L A LG+S S +WR I     +  +
Subjt:  LCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLK

Query:  GFRWRVGNGVQISIDKDPWISREGNPRVLSTH-NSFKGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYR
        G RWRVGNG QI I +D W+      +V+S   ++F+   V  L+D + K WK   +  +F   +   IL +P       D++IW  NKKG FSVKSAY 
Subjt:  GFRWRVGNGVQISIDKDPWISREGNPRVLSTH-NSFKGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYR

Query:  LATREQSHNEASQ-SNASKEASFWNSIWKANVLLRTKVEDW--------------------------------------------------------LPK
        +A      NE  + SN       W  +W  N+  + K+  W                                                         P+
Subjt:  LATREQSHNEASQ-SNASKEASFWNSIWKANVLLRTKVEDW--------------------------------------------------------LPK

Query:  DYWGWMKD-------SLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKI-IQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDA
         + G   D       S + + L    +L W +W  RN+  +++ P + +++ + +N   +L +++ +      P  PR   S + WE PP   +K+N D 
Subjt:  DYWGWMKD-------SLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKI-IQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDA

Query:  AWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAP
        A  ++     +G ++RDSNG ++    K +   ++   +EA A+ +GI    D   +L   + +E DAL V+  LN  S   ++L      I+S++    
Subjt:  AWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAP

Query:  SMRFLHYNCLLNTDAHCVARSA
           F H N   N  AH +A+ A
Subjt:  SMRFLHYNCLLNTDAHCVARSA

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.4e-17837.02Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        E+++LL+ EE  W+ R+R +WL  GD+NTK+FH+KA+ R++RN   GI +    W ++++ I   A  YF+++  SS+P    I  V +AI   ++ E  
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIKGM----------------------LG----------YSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKV
          L   F+++E+E A+  M                      +G           +        +K N+  L+PK K P +MS+FRPISLCNV+YK+I+KV
Subjt:  IKLDAPFSKDELEKAIKGM----------------------LG----------YSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKV

Query:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILIN
        LANR+K +L  IIS +QS+F+ GR I+DN LV FE +H L +K++GK G+ A+KLDMSKAYDRVEW FI++ M+KMGF   W + VM CI+SVS+SIL+N
Subjt:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILIN

Query:  GEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSS
        G       P+RGLRQGDP+SPY+FLLCA+GFS+LL        +SG  I   CP  THLFFADDSL+F KA  +   +   +L+LYE+ASGQ IN+DKSS
Subjt:  GEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSS

Query:  FMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDR
           S N  +E   +  R+LG         YLG+PS  GK+K  +F  +K+RVE+ L GWKE L S+GG+E+LIKAVAQAIP YTMSCF++P  +C   + 
Subjt:  FMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDR

Query:  LCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLK
        +  +FWWG  G ++K  W+SWKK+C+ K  GGMGFR L AF  AMLAKQ WRLI NPNSL+ +I + RY+   +  +A LG S S TWR I  G  +  +
Subjt:  LCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLK

Query:  GFRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGL-RVRDLLD-DNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYR
        G RWRVGNG +I I +D W+      +V+S    F    RV  L+D +  +WK+  + ++F   +A  IL++P       D+IIW  N+KG FSVKSAY 
Subjt:  GFRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGL-RVRDLLD-DNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYR

Query:  LATREQSHNEASQSNASKEAS-FWNSIWKANVLLRTKVEDW------------------------------------------LPKDYWG-WMK------
        +A     + E  +S++    S  W  +W  N+  + ++  W                                          + K  W  W+       
Subjt:  LATREQSHNEASQSNASKEAS-FWNSIWKANVLLRTKVEDW------------------------------------------LPKDYWG-WMK------

Query:  --------------DSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAA
                      D  +  +L    ++ W +W  RN+  +      +   +   I    I++   +    +      P S   W  PPP  +K+N D A
Subjt:  --------------DSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAA

Query:  WIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPS
          E      +G ++RD+ G +       +   +S++ +EA AM  G+    +   +L   + +ESDAL VV+ +N  ++    L      I SL +    
Subjt:  WIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPS

Query:  MRFLHYNCLLNTDAHCVARSA
         +  H     N  AH +A+ A
Subjt:  MRFLHYNCLLNTDAHCVARSA

XP_030939975.1 uncharacterized protein LOC115964883 [Quercus lobata]9.9e-17236.32Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        E+ +L+ +EE  W  R+R +WLKSGD NT +FHS+ATQR KRN    +  +    V   K+IG A  DYFK +  S+MP+N     + + I  K++    
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIK--------GMLGYSQ----------------------RQGRGRADKQNLQ-ALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL
          L   F+ DE+E A+K        G+ G S                         G   A   +   +LIPK K P++ ++FRPISLCNV+YK+++K +
Subjt:  IKLDAPFSKDELEKAIK--------GMLGYSQ----------------------RQGRGRADKQNLQ-ALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
        ANR+K++L  ++S SQS+F+  R ISDN LV FE +H L  K KGK+G++A+KLDMSKAYDRVEW F+E+ M+K+GF + W   V  CI SVSFS+L+NG
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF
        EP   F P+RGLRQGDPLSPYLFLLCAEG  +L+++ E    + G  + +  P  +HLFFADDSL+F +A  +   S   +LK YEEASGQ IN +K+  
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF

Query:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL
          S N +     + + +LG+ +T +   YLG+PS  G+ K   F  I++R+   +QGWKE L S GG+E+LIKAV QA+P +TM CF++P ++C   + L
Subjt:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL

Query:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG
          KFWWG  G   K HW+ WKK+C++KS GG+GF+ +  F  AML KQ WRLI N +SL +K+ + ++F   + L+  +  + S  W+ I+  RG+   G
Subjt:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG

Query:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL
         +WR+G+G  + I  D W+    + RV+S   +F    RV  L+D+ N+ W E  I E F   +A  IL++P       D +IW     G ++ KSAYRL
Subjt:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL

Query:  ATREQSHNEASQSNASKEASFWNSIW----------------------KANVLLRTKVEDWLPKDYWGWMKDSL--------------------------
          +         SN + +  FW  +W                      K N+L R  ++D   +   G ++D +                          
Subjt:  ATREQSHNEASQSNASKEASFWNSIW----------------------KANVLLRTKVEDWLPKDYWGWMKDSL--------------------------

Query:  --SKEELAKSII-------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASH-VHWEKPPPNSWKLNFDAAW
          S  +L + I+             + W +W  RN     +    T KI +  +E  L E+ S          P+ P  H  HW  P P+ +K+NFD A 
Subjt:  --SKEELAKSII-------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASH-VHWEKPPPNSWKLNFDAAW

Query:  IEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPS
               GLG ++RDS G +I    ++I    ++  LEA A    I  VF     LG+  +  E D+  V  +L      ++      DE +SLAA   S
Subjt:  IEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPS

Query:  MRFLH
          F H
Subjt:  MRFLH

XP_030946812.1 uncharacterized protein LOC115971195 [Quercus lobata]1.4e-17336.26Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        E+ +L+ +EE  W  R+R +WLKSGD NT +FHS+ATQR KRN    +  +    V   K+IG A  +YFK +  S+MP+N     + + I  K++    
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIKGMLGYSQRQGRGRA-----------------------DKQNLQA--------LIPKSKEPKRMSEFRPISLCNVIYKVIAKVL
          L   F+ DE+E A+K M   +     G +                       +  N+ A        LIPK K P++ ++FRPISLCNV+YK+++K +
Subjt:  IKLDAPFSKDELEKAIKGMLGYSQRQGRGRA-----------------------DKQNLQA--------LIPKSKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
        ANR+K++L  ++S SQS+F+  R ISDN LV FE +H L  K KGK G++A+KLDMSKAYDRVEW F+E+ M+K+GF + W   V  CI SVSFS+L+NG
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF
        EP   F P+RGLRQGDPLSPYLFLLCAEG  +L+++ E    + G  + +  P  +HLFFADDSL+F +A  + + S   +LK YEEASGQ IN +K+  
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF

Query:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL
          S N +     + + +LG+ +T +   YLG+PS  G+ K   F  I++RV + +QGWKE L S GG+E+LIKAV QA+P +TM CF+LP ++C   + L
Subjt:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL

Query:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG
          KFWWG  G   K HW+ WKK+C++KS+GG+GF+ +  F  AML KQ WRLI N +SL +K+ + +YF   + L+  +  + S  W+ I+  RG+   G
Subjt:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG

Query:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL
         +WR+G+G  + I  D W+    + RV+S   +F    RV  L+D+ N+ W E  I E F   +A  IL++P       D +IW     G ++ KSAYRL
Subjt:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNK-WKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL

Query:  ATREQSHNEASQSNASKEASFWNSIWKANV------LLRTKVEDWLP------------------------------------KDYWGW----MKDSLSK
          +       S SN++ E  FW  +W  NV       L     D LP                                    K  W W     KD L++
Subjt:  ATREQSHNEASQSNASKEASFWNSIWKANV------LLRTKVEDWLP------------------------------------KDYWGW----MKDSLSK

Query:  -----EELAKSII-------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEA-----SLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLN
              +L + I+             + W +W  RN     +H     KI +  +E      S+ E + + L  H P          HW    P+ +K+N
Subjt:  -----EELAKSII-------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEA-----SLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLN

Query:  FDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLA
        FD A        GLG +VRDS G +I    ++I    ++  LEA A    I  VF     LG+  +  E D+  +  +L      +S      +E +SL+
Subjt:  FDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLA

Query:  ARAPSMRFLHYNCLLNTDAHCVARSA
        +   S  F H     N  A  +A+ A
Subjt:  ARAPSMRFLHYNCLLNTDAHCVARSA

TrEMBL top hitse value%identityAlignment
A0A2N9GPY1 Reverse transcriptase domain-containing protein2.5e-17335.1Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        +L  LL ++E  W+  +R EWL+ GD+NT++FHSKATQR++RN    + ++   W     ++     +Y+ SL  ++ P +  +  VT+ I   ++ E  
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAIKGMLGYSQRQGRG-------------------------------RADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL
          L   F+ DE+ KA+K M         G                               +A       LIPK K P+ + EFRPISLCNVIYK+++KVL
Subjt:  IKLDAPFSKDELEKAIKGMLGYSQRQGRG-------------------------------RADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
         NR+K +L  I+S SQS+FVPGR I+DN LV FE +H + +++KGK G +ALKLDMSKAYDRVEW +++  M+++GF S W   +M+CI++VS+SIL+NG
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF
        EP    KPSRGLRQGD LSPYLFL CAEGF +L++R +    L G  I+   P  THLFFADDSL+F KA  +++     +L +YE ASGQ IN  K++ 
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF

Query:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL
          S++  +        +LG+ +      YLG+PS  G+ K   F +IK+RV   L+GWKE L S  G+E+LIKAVAQAIP Y MSCFRLP  +    + L
Subjt:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL

Query:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG
          +FWWG SG + K HW+ W  +C++K  GG+G R L  F +A+LAKQ WRL+ N +SL +++ + +YF   + LEA      SL W+ I+    L  KG
Subjt:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG

Query:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLR-VRDLLDD-NNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL
          WRVG+G +I I +D W+    +  +LS       +  VR L+D+ +N WKE  +  +F   +A  IL +P  SK  +D  +W   K G++SV+S Y  
Subjt:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLR-VRDLLDD-NNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL

Query:  ATREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LPK------------------DYWG-------WMKDS------------------
           E  H+    SN S EA  WN+IW   +  + +   W      LP                   +YW        W  ++                  
Subjt:  ATREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LPK------------------DYWG-------WMKDS------------------

Query:  --------------LSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAAWI
                      LS  EL       W +W  +N    +  PA   +++    +  L E++++     S +G   P + V W+ PPP  +K N+D A+ 
Subjt:  --------------LSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAAWI

Query:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSM
        E     G+G ++R+  G+++    ++I    S++ +EA A     +   +  N LG+  +++E D+  +V  L       +      ++ + +A  + S+
Subjt:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGI-TLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSM

Query:  RFLHYNCLLNTDAHCVARSA
         FLH     N  AH +A+ A
Subjt:  RFLHYNCLLNTDAHCVARSA

A0A803P4U9 Uncharacterized protein2.4e-17936.91Show/hide
Query:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK
        +L+ + E+ E YWK R+R  WLK GD+NTK+FH KA+QRK++N  +G+++ R  W  + ++I   A +YF++L   S    E   ++   +  +IS+E+ 
Subjt:  ELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQK

Query:  IKLDAPFSKDELEKAI-----------KGMLGYSQRQG----------------RGRAD----KQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL
          L  PF + E+  A+            G+ G   ++                   +AD     + L  LIPK+K P ++SEFRPISLCNV+YKV++K L
Subjt:  IKLDAPFSKDELEKAI-----------KGMLGYSQRQG----------------RGRAD----KQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
        ANRMK  L+  IS +QS+F+ GR I DNA++GFE +H +   R G    +ALKLDMSKAYDRVEW F+E  M  +G+   W  KVM C+ SVSFSILING
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF
          Q  F P RGLRQGDPLSP+LFLLC+EG + LL   E    + G +  N   N +HL FADDSLVFL A      +   VL  Y   SGQ INLDKS  
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSF

Query:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL
           R +N+E+       LG++   +   YLGMP+  GKNK  +F +I+DRVE  LQGWK  LFS  GKE+LIKAV QA+P Y MSCFR+   I    + +
Subjt:  MASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRL

Query:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG
         ++FWWG +  K K HW SW+KMC+ K  GGMGFR L  F QA+LAKQ W+++ NP+ LL ++L+  YF   NF+EA LG+ SS  WRGI+WGR L LKG
Subjt:  CSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKG

Query:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLA
        +RW +GNG  + I++DPWI R G P  L T     K + V+ L++ N +WK   I   F+ +D   +L + T  ++ ND I W L   G+++V S Y+L 
Subjt:  FRWRVGNGVQISIDKDPWISREGNPRVLSTHNSF-KGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLA

Query:  TREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDWLPKDYWGWMKDSLSKEELAKSII---------------------------------------
         R+ +  E S ++ S+  ++W  +W + +  + K+  W    +W  +K  L+K  ++ ++                                        
Subjt:  TREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDWLPKDYWGWMKDSLSKEELAKSII---------------------------------------

Query:  --------------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASH-----VHWEKPPPNSWKLNFDAAWI
                            + W +W  RN+  + N      K I+ +I    I W +  ++ H  +   +   +     V W  PPP ++ +N DA+ I
Subjt:  --------------------LMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSYLKTHSPNGPRSPASH-----VHWEKPPPNSWKLNFDAAWI

Query:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSMR
        E +   GLG ++RD  G+L+    + I    S+   EA A+   +K       RL   + + SD+  V+  L G +   +D     D+    + +  ++ 
Subjt:  EKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSMR

Query:  FLHYNCLLNTDAHCVA
        F+      N+ AHC+A
Subjt:  FLHYNCLLNTDAHCVA

A0A803QC75 Uncharacterized protein4.8e-18036.3Show/hide
Query:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI
        L+ LLE+EE YW+ R+R +WL  GD+NTK+FH+KA+ RK  N  K + NS  + V +  +I      ++  L  S+  + E +      I   +SAE   
Subjt:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI

Query:  KLDAPFSKDELEKAIKGM-------------------------------LGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA
         L  PF+ DE+  A+  M                               L          A  + L  LIPK K+P+ + E+RPISLCNVI K++ KVL 
Subjt:  KLDAPFSKDELEKAIKGM-------------------------------LGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA

Query:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE
        +R K  L  +IS  QS+F+P R I+DN LV FE +HA+ NK  G+ G  + KLDMSKA+DRVEW FIEE M+KMGF+  W   +M C+++ +FS +INGE
Subjt:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE

Query:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM
              PSRGLRQG PLSPYLFL+C+EGFS LL+ E+  ++L GFK+  + P  THLFFADDSL+F +A ER+  +   VL  Y +ASGQ +NLDKS   
Subjt:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM

Query:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC
         S N          + L +        YLG+PS +G++K  +F  IK+R+ K++  W E +FS GGKE+L+KAV Q+IP Y MSCFRLP   C   + + 
Subjt:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC

Query:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF
        + FWWG + + ++ HW SW  +C++K +GGMGFR    F QA+LAKQ+WR+   P+SLL +IL+ RYF   NFLEA LG+S SLTW+GI W R L +KG 
Subjt:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF

Query:  RWRVGNGVQISIDKDPWISREGNPRVLSTHNS--FKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLA
        RW+VG+G  I    DPWI   G+   L  H S    G+ V +L+ D  +W    + + FS  D   IL++P     S D +IWH +  G+++VKS Y LA
Subjt:  RWRVGNGVQISIDKDPWISREGNPRVLSTHNS--FKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLA

Query:  TREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LP------------------------------------------------------
              +++S SN  +++ +W   W   +  + K+  W      LP                                                      
Subjt:  TREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDW------LP------------------------------------------------------

Query:  ---KDYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESS--------------------------YLKTHSPNGPR
            DY   +    SK E+ +    +W +W  RNR   H H A  AK + S     L+ + ++                           + + SP    
Subjt:  ---KDYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESS--------------------------YLKTHSPNGPR

Query:  SPASHVHWEKPPPNSWKLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGV
         PA+   W  P  NS+KLN DAA    +   G+G ++RDSNG+++    KQ   ++S   +EA A+ + +  V     +L I+L VESDAL VV  L   
Subjt:  SPASHVHWEKPPPNSWKLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGV

Query:  SDDLSDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVARSA
         + +S       ++  L +  P +   H     N  AHC+A+ A
Subjt:  SDDLSDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVARSA

A0A803QH07 Uncharacterized protein2.5e-17335.28Show/hide
Query:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI
        LD+LL  EE YW  R+R +WL+ GDKNTK+FH+KA+ RK  N  K + N     V   + +      Y++ L  S   +++ ++ V  AI   I +    
Subjt:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI

Query:  KLDAPFSKDELEKAIKGM----------------------LGYSQRQ--------GRGRADKQN-LQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA
         L APFS  E+  A++ M                      +G S  Q        G       N +  LIPK   P  M ++RPISLCNVIYK+I+K + 
Subjt:  KLDAPFSKDELEKAIKGM----------------------LGYSQRQ--------GRGRADKQN-LQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA

Query:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE
         R ++VL  +IS +QS+F+  R I+DN LV FE IH L +K +G+ GY ALKLDMSKA+DRVEW ++E  M KMGF+  W   +M CI++ SFS  +NGE
Subjt:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE

Query:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM
             +P RGLRQGDPLSPYLFL+C+EG S LL+ EE+  HL G ++    P+ +HL FADDSL+F +A E++  +    L  Y +ASGQ +N DKS   
Subjt:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM

Query:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC
         S N +          L +  T+    YLG+PS +G++K  +F  IK++V K+L  W E +FS+GGKE+L+KAV Q+IP Y MSCF+L    C+  + + 
Subjt:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC

Query:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF
        + FWWG++ + TK HW  WK +C++K +GGMGFR    F QA+LAKQ+WR+   P+SLL ++L+ RYF   +FL+A +G+S S TW+ I WGR L +KG 
Subjt:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF

Query:  RWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATR
        R++VGNG  I   KDPWI    + R +S +     L V  L++DN +W  + + + F   D   IL++P     + D +IWH +  G ++VKS + LAT 
Subjt:  RWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATR

Query:  EQSHNEASQSNASKE--ASFWN---------------------------------------------------------SIWKAN--VLLRTKVEDWLPK
         +   ++S S+A+++    FWN                                                         +IWK +  ++   K +     
Subjt:  EQSHNEASQSNASKE--ASFWN---------------------------------------------------------SIWKAN--VLLRTKVEDWLPK

Query:  DYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKII------------QSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKL
        DY  ++    ++E+    I L+W +W  RNR  +      ++ II              ++ +      +    TH  + P++   H  W  P  N +KL
Subjt:  DYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKII------------QSNIEASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSWKL

Query:  NFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLA
        N DAA    + + G+G ++R  +G ++    K +  ++    +EA A+   +  V  + ++L IT  +E+DAL V T LN    DLS       +I+ L 
Subjt:  NFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGVSDDLSDLKVFTDEIQSLA

Query:  ARAPSMRFLHYNCLLNTDAHCVARSA
        +  PS+   H     N  AH +AR A
Subjt:  ARAPSMRFLHYNCLLNTDAHCVARSA

M5VU98 Reverse transcriptase domain-containing protein1.1e-17937.26Show/hide
Query:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI
        LD+LL + E YW  R+RE WLK+GDKNT +FH KAT R++RN  KG+ +S   W  + + I     DYF  L  SS   +  +  +  A+  K++A+ + 
Subjt:  LDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKI

Query:  KLDAPFSKDELEKA-------------------------------IKGMLGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA
         L A FS  E++ A                               +  +  + Q     R        LIPK KEP+ M++ RPISLCNV+Y++ AK LA
Subjt:  KLDAPFSKDELEKA-------------------------------IKGMLGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLA

Query:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE
        NRMK V+ ++IS SQS+FVPGR I+DN++V FE  H L  +R+G+ G +ALKLDMSKAYDRVEW F+E+ M  MGF   W + VM C+++VS+S L+NGE
Subjt:  NRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGE

Query:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM
        P     P+RGLRQGDPLSPYLFLLCAEGF+TLL + E    L G  I    P  +HLFFADDS VF KA + N      + ++YE ASGQ IN  KS   
Subjt:  PQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFM

Query:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC
         S N++ +T ++   +LG+   +S   YLG+P   G+NK + F+ +K+RV K LQGW+E   SI GKE+L+K VAQ+IP+Y MSCF LP  +C   +++ 
Subjt:  ASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLC

Query:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF
        ++FWWG  G   K HWM W+++C+ K++GGMGFR L AF  AMLAKQ WRL+ NP+SL  ++L+ +YF   NF EA LG+  S  W+ I   R +   G 
Subjt:  SKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIWGRGLFLKGF

Query:  RWRVGNGVQISIDKDPWISREGNPRVL-STHNSFKGLRVRDLL--DDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL
        R+++G+G  + I  D W+ R     V+ S  +  +  +V +L+  + + +W    +  +F   D  DI+ +P   +   D I+W+ +K G+F+VKSAYR+
Subjt:  RWRVGNGVQISIDKDPWISREGNPRVL-STHNSFKGLRVRDLL--DDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRL

Query:  ATREQSHNE-ASQSNASKEASFWNSIWKANVLLRTKVEDWLPKDYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWE
        A R  S +E  S S+ S     W  IW A V  + K+  W            ++ + L     L+ K  + ++   +      +A  + +    ++  W 
Subjt:  ATREQSHNE-ASQSNASKEASFWNSIWKANVLLRTKVEDWLPKDYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWE

Query:  SSYLKTHSPNG-PRSP-------ASHVH---------------------WEKPPPNSWKLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSI
         S L  H+  G  RSP         +VH                     W  PP    K NFD A+    GRG +G + RD++G  +    K + +  S 
Subjt:  SSYLKTHSPNG-PRSP-------ASHVH---------------------WEKPPPNSWKLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSI

Query:  KNLEACAMLEGIKKVFDTCNRLGITLEV-ESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVAR
        ++ E  A  EG+         LG    + E D+  VV+ +     D S++    ++++ L  + PS  F       N  AH +AR
Subjt:  KNLEACAMLEGIKKVFDTCNRLGITLEV-ESDALEVVTVLNGVSDDLSDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVAR

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein8.5e-2523.14Show/hide
Query:  KLLEEEESYWKLRAREEW-LKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAIT-KKISAEQKI
        K +E +++  K+     W  +  +K  +       +++++N    I N +        EI     +Y+K L  + + N E +    +  T  +++ E+  
Subjt:  KLLEEEESYWKLRAREEW-LKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAIT-KKISAEQKI

Query:  KLDAPFSKDELEKAIKGM------------LGYSQR----------------QGRGRADKQNLQA---LIPK-SKEPKRMSEFRPISLCNVIYKVIAKVL
         L+ P +  E+   I  +              + QR                +  G       +A   LIPK  ++  +   FRPISL N+  K++ K+L
Subjt:  KLDAPFSKDELEKAIKGM------------LGYSQR----------------QGRGRADKQNLQA---LIPK-SKEPKRMSEFRPISLCNVIYKVIAKVL

Query:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING
        ANR++Q +  +I H Q  F+PG Q   N       I   IN+ K K  +V + +D  KA+D+++  F+ + + K+G    + + +       + +I++NG
Subjt:  ANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILING

Query:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKL---YEEASGQTINLDK
        +  + F    G RQG PLSP LF +  E  +  + +E+    + G ++       +   FADD +V+L   E  + S   +LKL   + + SG  IN+ K
Subjt:  EPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKL---YEEASGQTINLDK

Query:  SSFMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTG------KNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSC--FRL
        S      N N +T ++    L     +    YLG+           +N   + K IK+   K    WK    S  G+  ++K       +Y  +    +L
Subjt:  SSFMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTG------KNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSC--FRL

Query:  PNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRN
        P    +  ++   KF W     +     +S K    NK+ GG+       + +A + K +W   +N
Subjt:  PNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRN

P08548 LINE-1 reverse transcriptase homolog1.1e-2725.54Show/hide
Query:  DKNTKWFHSKATQ---------RKKRNDT--KGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAI-TKKISAEQKIKLDAPFSKDELE
        +K+  WF  K  +         RKKR  +    I N  D+      EI     +Y+K L      N + I    EA    ++S ++   L+ P S  E+ 
Subjt:  DKNTKWFHSKATQ---------RKKRNDT--KGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAI-TKKISAEQKIKLDAPFSKDELE

Query:  KAIK--------GMLGYSQRQGRGRADK---------QNLQ--------------ALIPK-SKEPKRMSEFRPISLCNVIYKVIAKVLANRMKQVLDTII
          I+        G  G++    +   ++         QN++               LIPK  K+P R   +RPISL N+  K++ K+L NR++Q +  II
Subjt:  KAIK--------GMLGYSQRQGRGRADK---------QNLQ--------------ALIPK-SKEPKRMSEFRPISLCNVIYKVIAKVLANRMKQVLDTII

Query:  SHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGEPQDEFKPSRGL
         H Q  F+PG Q   N       I   INK K K  ++ L +D  KA+D ++  F+   +KK+G    + + +    S  + +I++NG     F    G 
Subjt:  SHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGEPQDEFKPSRGL

Query:  RQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKS-SFMASRNVNEETI
        RQG PLSP LF +  E  +  +  E++   + G  I +         FADD +V+L+    +      V+K Y   SG  IN  KS +F+ + N   E  
Subjt:  RQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKS-SFMASRNVNEETI

Query:  AKCERILGI--KSTNSLGHYL--GMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYF---DRLCSKF
         K      +  K    LG YL   +     +N   + K I + V K    WK    S  G+  ++K       +Y  +   +   + SYF   +++   F
Subjt:  AKCERILGI--KSTNSLGHYL--GMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYF---DRLCSKF

Query:  WWGSSGHKTKTHWMSWKKMCRNKSQ-GGMGFRYLSAFIQAMLAKQSWRLIRN
         W     +        K +  NK++ GG+    L  + ++++ K +W   +N
Subjt:  WWGSSGHKTKTHWMSWKKMCRNKSQ-GGMGFRYLSAFIQAMLAKQSWRLIRN

P0C2F6 Putative ribonuclease H protein At1g657502.0e-3432.79Show/hide
Query:  FKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQA
        F  I +RV   + GW+E   S  G+  L KAV  ++PV++MS   LP +I +  D+L   F WGS+  K K H + W K+C  K +GG+G R   +  +A
Subjt:  FKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQA

Query:  MLAKQSWRLIRNPNSLLFKILRGRYFKGK---NFLEAPLGNSSSLTWRGIIWG-RGLFLKGFRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFK----
        +++K  WRL++  NSL   +L+ +Y  G+   +    P G+ SS TWR I  G R +   G  W  G+G QI    D W+S  G P +L   N  +    
Subjt:  MLAKQSWRLIRNPNSLLFKILRGRYFKGK---NFLEAPLGNSSSLTWRGIIWG-RGLFLKGFRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFK----

Query:  -GLRVRDLLDDNNKWKESTILEVFSHQDAYD----ILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATREQSHNEASQSNASKEASFWNSIWKANVLL
          +  +DL      W  + I    ++    +    +L++ TG++   D + W  ++ G FSV+SAY + T +    E  + N    ASF+N +WK  V  
Subjt:  -GLRVRDLLDDNNKWKESTILEVFSHQDAYD----ILNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATREQSHNEASQSNASKEASFWNSIWKANVLL

Query:  RTKVEDWL
        R K   WL
Subjt:  RTKVEDWL

P11369 LINE-1 retrotransposable element ORF2 protein3.3e-2926.21Show/hide
Query:  LIPK-SKEPKRMSEFRPISLCNVIYKVIAKVLANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFI
        LIPK  K+P ++  FRPISL N+  K++ K+LANR+++ +  II   Q  F+PG Q   N       IH  INK K K  ++ + LD  KA+D+++  F+
Subjt:  LIPK-SKEPKRMSEFRPISLCNVIYKVIAKVLANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFI

Query:  EECMKKMGFSSNWFQKVMKCISSVSFSILINGEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFL
         + +++ G    +   +    S    +I +NGE  +      G RQG PLSPYLF +  E  +  + +++    + G +I       + L  ADD +V++
Subjt:  EECMKKMGFSSNWFQKVMKCISSVSFSILINGEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDSLVFL

Query:  KAEERNLCSFNGVLKLYEEASGQTINLDKS-SFMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIV---FKRIKDRVEKMLQGWKENLFS
           + +      ++  + E  G  IN +KS +F+ ++N   E   +      I + N    YLG+ + T + K +    FK +K  +++ L+ WK+   S
Subjt:  KAEERNLCSFNGVLKLYEEASGQTINLDKS-SFMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIV---FKRIKDRVEKMLQGWKENLFS

Query:  IGGKEMLIKAVAQAIPVYTMSC--FRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRN
          G+  ++K       +Y  +    ++P    +  +    KF W +   +     +  K     ++ GG+    L  + +A++ K +W   R+
Subjt:  IGGKEMLIKAVAQAIPVYTMSC--FRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRN

P93295 Uncharacterized mitochondrial protein AtMg003102.3e-3043.15Show/hide
Query:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNK-SQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLE
        A+PVY MSCFRL   +C       ++FWW S  +K K  W++W+K+C++K   GG+GFR L  F QA+LAKQS+R+I  P++LL ++LR RYF   + +E
Subjt:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNK-SQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLE

Query:  APLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWISRE
          +G   S  WR II GR L  +G    +G+G+   +  D WI  E
Subjt:  APLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWISRE

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-1024.47Show/hide
Query:  YLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKS
        YLG+P  T K     +  + +++   +  W     S  G+  LI +V  ++  + MS FRLP+      D +CS F W      TK   ++W  +C  K 
Subjt:  YLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEMLIKAVAQAIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKS

Query:  QGGMGFRYLSAFIQAMLAKQSWRLIRNP--NSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIW---GRGLFLKGFRWRVGNGVQI
        +GG+G R L    +       W +  N    S ++K +         F++  + N S+ ++    W   GR + + G R  +  G+ +
Subjt:  QGGMGFRYLSAFIQAMLAKQSWRLIRNP--NSLLFKILRGRYFKGKNFLEAPLGNSSSLTWRGIIW---GRGLFLKGFRWRVGNGVQI

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.7e-1234.09Show/hide
Query:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMK
        +  R+K ++  +I  +Q+SF+PGR  +DN +   E +H++  ++KG  G++ LKLD+ KAYDR+ W ++E+ +   GF   W  ++ +
Subjt:  LANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYVALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMK

AT4G29090.1 Ribonuclease H-like superfamily protein1.3e-4125.81Show/hide
Query:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEA
        A+P YTM+CF LP  +C     + + FWW +       HW +W  +   K++GG+GF+ + AF  A+L KQ WR++  P SL+ K+ + RYF   + L A
Subjt:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEA

Query:  PLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWIS----------REGNPRVLSTHNSFKGLRVRDLLDDNNK-WKESTILEVFSHQDAYDI
        PLG+  S  W+ I   + +  +G R  VGNG  I I +  W+           +   P+  ++ +S   L+V DL+D++ + W++  I  +F   +   I
Subjt:  PLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWIS----------REGNPRVLSTHNSFKGLRVRDLLDDNNK-WKESTILEVFSHQDAYDI

Query:  LNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATREQSHNEASQSNASKEAS---FWNSIWKANVLLRTKVEDWLPKDYWGWMKDS-----------LSK
          +  G +   D   W     G ++VKS Y + T  Q  N+ S      E S    +  IWK+      K++ +L    W  + +S           LSK
Subjt:  LNMPTGSKDSNDEIIWHLNKKGIFSVKSAYRLATREQSHNEASQSNASKEAS---FWNSIWKANVLLRTKVEDWLPKDYWGWMKDS-----------LSK

Query:  E--------------------------------------ELAKSII-------------------------LMWKLWEFRNRAEYHNHPAATAKIIQSNI
        E                                      E A SI                          L+W+LW+ RN   +        ++++   
Subjt:  E--------------------------------------ELAKSII-------------------------LMWKLWEFRNRAEYHNHPAATAKIIQSNI

Query:  EASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSW-KLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCN
        E  L EW            P+   S     +PPP+ W K N DA W     R G+GW++R+  G +   G + + K  S+   E    LE ++    + +
Subjt:  EASLIEWESSYLKTHSPNGPRSPASHVHWEKPPPNSW-KLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCN

Query:  RLGITLEV-ESDALEVVTVLNGVSDDL-SDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVARSAA-----DCGLSSSVRSSLRSS
        R      + ESD+  ++ +LN  +D++   LK    ++Q L ++   ++F+      NT A  VAR +      D  L S V S  RSS
Subjt:  RLGITLEV-ESDALEVVTVLNGVSDDL-SDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVARSAA-----DCGLSSSVRSSLRSS

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.6e-3143.15Show/hide
Query:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNK-SQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLE
        A+PVY MSCFRL   +C       ++FWW S  +K K  W++W+K+C++K   GG+GFR L  F QA+LAKQS+R+I  P++LL ++LR RYF   + +E
Subjt:  AIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNK-SQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLE

Query:  APLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWISRE
          +G   S  WR II GR L  +G    +G+G+   +  D WI  E
Subjt:  APLGNSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWISRE

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.1e-1554.41Show/hide
Query:  LINGEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDS
        +ING PQ    PSRGLRQGDPLSPYLF+LC E  S L  R +    L G +++N  P   HL FADD+
Subjt:  LINGEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTTGGATAAACTACTTGAGGAAGAAGAAAGCTATTGGAAATTGAGAGCTAGGGAAGAGTGGCTAAAAAGTGGTGATAAGAATACCAAATGGTTTCATTCAAAAGC
TACACAAAGAAAGAAAAGAAATGATACAAAAGGCATTTTCAACAGCAGGGACAAGTGGGTTGAAGCTTCCAAGGAGATAGGCTATGCAGCTACTGACTACTTCAAATCCC
TCCTTTGCTCGAGTATGCCCAACAACGAAGGCATAAGAGTAGTCACTGAGGCTATCACGAAAAAGATATCAGCAGAACAAAAGATCAAACTAGATGCCCCTTTCTCCAAA
GATGAACTAGAAAAAGCTATCAAAGGTATGCTTGGGTATTCTCAACGACAAGGAAGAGGTAGAGCCGATAAACAAAACCTACAAGCCCTTATCCCCAAATCTAAGGAGCC
TAAAAGAATGAGCGAGTTTAGACCGATTAGCCTTTGCAATGTGATATACAAAGTGATTGCAAAAGTCCTAGCGAACCGTATGAAACAAGTTCTTGACACCATTATCTCTC
ATTCCCAATCATCTTTTGTCCCTGGTAGGCAAATATCGGATAATGCATTGGTGGGTTTTGAATGCATCCATGCTTTGATCAACAAAAGGAAAGGGAAAGCAGGGTATGTA
GCCTTAAAACTCGACATGAGCAAGGCCTACGATCGAGTTGAATGGGTGTTCATTGAGGAATGCATGAAGAAGATGGGTTTTAGCTCCAATTGGTTCCAAAAAGTGATGAA
GTGTATCTCTTCAGTGAGCTTCTCAATTCTCATAAATGGGGAGCCTCAAGACGAGTTCAAACCTAGCAGAGGTCTTAGACAAGGAGATCCCCTGTCTCCTTATCTTTTCC
TGCTCTGTGCAGAGGGCTTCTCAACTCTTCTAGAAAGGGAAGAATCTCTATCTCACTTATCTGGTTTCAAGATTAACAACTATTGCCCGAACTTTACCCATCTTTTTTTT
GCAGATGACAGCTTGGTCTTTCTCAAAGCTGAAGAAAGGAATCTTTGCTCTTTCAATGGTGTGCTGAAGCTCTATGAGGAAGCATCCGGTCAAACAATCAATCTGGACAA
ATCTTCATTTATGGCTAGCAGAAATGTCAATGAGGAGACTATAGCAAAATGTGAAAGGATCCTTGGAATCAAAAGCACGAACTCCTTAGGCCATTATTTGGGGATGCCTT
CCCAAACAGGAAAGAATAAAGGCATTGTTTTTAAAAGAATTAAAGACAGGGTTGAAAAAATGCTCCAAGGGTGGAAGGAGAATCTTTTCTCCATAGGAGGCAAAGAAATG
CTCATCAAAGCCGTGGCACAAGCGATCCCGGTATACACTATGAGTTGCTTCCGATTACCCAATAATATTTGTTCTTACTTTGACAGGTTATGCTCCAAATTCTGGTGGGG
ATCTTCGGGTCATAAAACCAAAACCCACTGGATGAGCTGGAAGAAGATGTGTAGGAATAAAAGCCAAGGAGGCATGGGATTTCGTTACCTATCAGCGTTTATCCAAGCCA
TGCTCGCCAAACAAAGTTGGAGACTTATCAGGAACCCCAATAGCCTGCTTTTCAAGATCCTGAGAGGAAGATATTTCAAAGGAAAAAATTTCCTCGAGGCCCCTTTAGGT
AACTCTTCGTCGCTCACTTGGAGAGGCATCATATGGGGTCGTGGCCTGTTCCTCAAAGGCTTCCGTTGGAGAGTAGGCAATGGAGTCCAAATATCTATTGATAAAGATCC
GTGGATAAGTAGAGAGGGCAACCCTAGAGTCCTATCAACTCATAACTCTTTCAAAGGATTAAGAGTGAGGGATCTTCTAGACGACAACAACAAGTGGAAAGAAAGCACTA
TTTTGGAAGTGTTCTCCCACCAAGATGCTTATGACATTCTCAACATGCCGACCGGTAGTAAAGACTCAAATGATGAAATTATCTGGCATCTAAACAAAAAGGGCATCTTC
TCTGTGAAGAGCGCCTATCGTCTAGCTACAAGGGAGCAATCCCATAATGAAGCATCTCAATCGAACGCCAGCAAAGAAGCCTCTTTTTGGAACAGTATTTGGAAGGCTAA
TGTGCTCCTTAGAACCAAAGTGGAGGATTGGTTACCTAAAGACTATTGGGGATGGATGAAGGACAGTTTAAGCAAGGAAGAGCTAGCTAAAAGCATCATTCTCATGTGGA
AACTATGGGAATTCAGGAACAGAGCAGAATATCACAACCATCCAGCAGCAACAGCAAAAATCATTCAGAGCAACATTGAAGCAAGCTTAATAGAATGGGAGTCCTCTTAC
CTTAAGACCCATTCTCCGAATGGGCCAAGGAGCCCCGCGAGTCACGTCCACTGGGAGAAACCACCGCCGAACTCCTGGAAACTAAATTTCGACGCTGCCTGGATTGAGAA
GGAAGGTCGAGGAGGGCTCGGATGGCTCGTTCGCGACTCGAATGGATCCTTGATCTGTTGTGGTTTGAAGCAAATCAGCAAGAATTGGTCCATAAAGAATTTGGAAGCGT
GTGCAATGCTAGAAGGCATCAAAAAGGTTTTTGATACCTGTAATCGGTTGGGGATTACGCTGGAAGTCGAATCAGATGCACTCGAGGTCGTCACTGTTCTCAACGGCGTG
TCGGATGATCTTTCAGATTTGAAGGTTTTTACGGATGAAATCCAATCCCTTGCTGCCCGTGCGCCTTCCATGCGTTTTCTTCATTATAATTGCCTTTTGAACACAGATGC
GCACTGTGTTGCGAGAAGCGCCGCCGACTGTGGTCTATCGTCTTCGGTGAGATCTAGCCTCCGGTCGTCTTCTTCGCGGGAAAGGGAGATGTTTTTTTGGGCTCCCCATA
TTCCTTTTTGTTTTTTCCCTCCTATTGATGAGGAACCTATCAATTCCAACATTGTTCGGGAATTTTACGCTAATCTTGATGTTAAGGATGACTTTGAGGTTATAGTTCGA
GGAGTGCCTGTACAGTGGAGCCCAAAAGCCATTAATGATTTGTTTAATCTCCAGGATTTTCCGCATGCAGTTTTTAATGAGATGATGGTTGCACCATCTAGCGACCAATT
AAGTGCAGCTGTCCGAGAGGTTGGCATTGAGGGTGCTCAATGGAGGGTGTCGCAGACGCGGAAGCACACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGA
TGGGTTTCATTAGGCTACGCTTACTGCCGACAACACACGACTCCACAGTATCTTGGGACAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTAATGATGTAGGA
AAAATAATTTCTTATGAGATTGTTGACTGTTGGAAAAAGAAGGTGGGGAAGTTGTTCTTCCCAAATACAATTACTATGCTGTGCAGAAACGCAGGGGTTCCAGTGGATGA
GAATGATGTAATTTTATTTGATAAGGGAATCATTGATACGTCCAATTTGGCGCGACTCCAGCGTACGCAGGAGATACGACAAGGAGGGCTAATCTACGACATGAATACAA
TTCTAGAACAACTGGCACTGTTGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGACCTATGTTAAGAATCGTGATGCTGGTTTGAAGTGGGCACTA
CAGGAGAATTTTTCAAAGCCATATCCAACCCTCTCTGCATTCCCCGAAGACTTACTGAACCCCTGGATTCCACCGCCGCCTGCTGAGAAGGAAGATGAAGAGGAAGATTT
AGGTCAGGAAGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTTGGATAAACTACTTGAGGAAGAAGAAAGCTATTGGAAATTGAGAGCTAGGGAAGAGTGGCTAAAAAGTGGTGATAAGAATACCAAATGGTTTCATTCAAAAGC
TACACAAAGAAAGAAAAGAAATGATACAAAAGGCATTTTCAACAGCAGGGACAAGTGGGTTGAAGCTTCCAAGGAGATAGGCTATGCAGCTACTGACTACTTCAAATCCC
TCCTTTGCTCGAGTATGCCCAACAACGAAGGCATAAGAGTAGTCACTGAGGCTATCACGAAAAAGATATCAGCAGAACAAAAGATCAAACTAGATGCCCCTTTCTCCAAA
GATGAACTAGAAAAAGCTATCAAAGGTATGCTTGGGTATTCTCAACGACAAGGAAGAGGTAGAGCCGATAAACAAAACCTACAAGCCCTTATCCCCAAATCTAAGGAGCC
TAAAAGAATGAGCGAGTTTAGACCGATTAGCCTTTGCAATGTGATATACAAAGTGATTGCAAAAGTCCTAGCGAACCGTATGAAACAAGTTCTTGACACCATTATCTCTC
ATTCCCAATCATCTTTTGTCCCTGGTAGGCAAATATCGGATAATGCATTGGTGGGTTTTGAATGCATCCATGCTTTGATCAACAAAAGGAAAGGGAAAGCAGGGTATGTA
GCCTTAAAACTCGACATGAGCAAGGCCTACGATCGAGTTGAATGGGTGTTCATTGAGGAATGCATGAAGAAGATGGGTTTTAGCTCCAATTGGTTCCAAAAAGTGATGAA
GTGTATCTCTTCAGTGAGCTTCTCAATTCTCATAAATGGGGAGCCTCAAGACGAGTTCAAACCTAGCAGAGGTCTTAGACAAGGAGATCCCCTGTCTCCTTATCTTTTCC
TGCTCTGTGCAGAGGGCTTCTCAACTCTTCTAGAAAGGGAAGAATCTCTATCTCACTTATCTGGTTTCAAGATTAACAACTATTGCCCGAACTTTACCCATCTTTTTTTT
GCAGATGACAGCTTGGTCTTTCTCAAAGCTGAAGAAAGGAATCTTTGCTCTTTCAATGGTGTGCTGAAGCTCTATGAGGAAGCATCCGGTCAAACAATCAATCTGGACAA
ATCTTCATTTATGGCTAGCAGAAATGTCAATGAGGAGACTATAGCAAAATGTGAAAGGATCCTTGGAATCAAAAGCACGAACTCCTTAGGCCATTATTTGGGGATGCCTT
CCCAAACAGGAAAGAATAAAGGCATTGTTTTTAAAAGAATTAAAGACAGGGTTGAAAAAATGCTCCAAGGGTGGAAGGAGAATCTTTTCTCCATAGGAGGCAAAGAAATG
CTCATCAAAGCCGTGGCACAAGCGATCCCGGTATACACTATGAGTTGCTTCCGATTACCCAATAATATTTGTTCTTACTTTGACAGGTTATGCTCCAAATTCTGGTGGGG
ATCTTCGGGTCATAAAACCAAAACCCACTGGATGAGCTGGAAGAAGATGTGTAGGAATAAAAGCCAAGGAGGCATGGGATTTCGTTACCTATCAGCGTTTATCCAAGCCA
TGCTCGCCAAACAAAGTTGGAGACTTATCAGGAACCCCAATAGCCTGCTTTTCAAGATCCTGAGAGGAAGATATTTCAAAGGAAAAAATTTCCTCGAGGCCCCTTTAGGT
AACTCTTCGTCGCTCACTTGGAGAGGCATCATATGGGGTCGTGGCCTGTTCCTCAAAGGCTTCCGTTGGAGAGTAGGCAATGGAGTCCAAATATCTATTGATAAAGATCC
GTGGATAAGTAGAGAGGGCAACCCTAGAGTCCTATCAACTCATAACTCTTTCAAAGGATTAAGAGTGAGGGATCTTCTAGACGACAACAACAAGTGGAAAGAAAGCACTA
TTTTGGAAGTGTTCTCCCACCAAGATGCTTATGACATTCTCAACATGCCGACCGGTAGTAAAGACTCAAATGATGAAATTATCTGGCATCTAAACAAAAAGGGCATCTTC
TCTGTGAAGAGCGCCTATCGTCTAGCTACAAGGGAGCAATCCCATAATGAAGCATCTCAATCGAACGCCAGCAAAGAAGCCTCTTTTTGGAACAGTATTTGGAAGGCTAA
TGTGCTCCTTAGAACCAAAGTGGAGGATTGGTTACCTAAAGACTATTGGGGATGGATGAAGGACAGTTTAAGCAAGGAAGAGCTAGCTAAAAGCATCATTCTCATGTGGA
AACTATGGGAATTCAGGAACAGAGCAGAATATCACAACCATCCAGCAGCAACAGCAAAAATCATTCAGAGCAACATTGAAGCAAGCTTAATAGAATGGGAGTCCTCTTAC
CTTAAGACCCATTCTCCGAATGGGCCAAGGAGCCCCGCGAGTCACGTCCACTGGGAGAAACCACCGCCGAACTCCTGGAAACTAAATTTCGACGCTGCCTGGATTGAGAA
GGAAGGTCGAGGAGGGCTCGGATGGCTCGTTCGCGACTCGAATGGATCCTTGATCTGTTGTGGTTTGAAGCAAATCAGCAAGAATTGGTCCATAAAGAATTTGGAAGCGT
GTGCAATGCTAGAAGGCATCAAAAAGGTTTTTGATACCTGTAATCGGTTGGGGATTACGCTGGAAGTCGAATCAGATGCACTCGAGGTCGTCACTGTTCTCAACGGCGTG
TCGGATGATCTTTCAGATTTGAAGGTTTTTACGGATGAAATCCAATCCCTTGCTGCCCGTGCGCCTTCCATGCGTTTTCTTCATTATAATTGCCTTTTGAACACAGATGC
GCACTGTGTTGCGAGAAGCGCCGCCGACTGTGGTCTATCGTCTTCGGTGAGATCTAGCCTCCGGTCGTCTTCTTCGCGGGAAAGGGAGATGTTTTTTTGGGCTCCCCATA
TTCCTTTTTGTTTTTTCCCTCCTATTGATGAGGAACCTATCAATTCCAACATTGTTCGGGAATTTTACGCTAATCTTGATGTTAAGGATGACTTTGAGGTTATAGTTCGA
GGAGTGCCTGTACAGTGGAGCCCAAAAGCCATTAATGATTTGTTTAATCTCCAGGATTTTCCGCATGCAGTTTTTAATGAGATGATGGTTGCACCATCTAGCGACCAATT
AAGTGCAGCTGTCCGAGAGGTTGGCATTGAGGGTGCTCAATGGAGGGTGTCGCAGACGCGGAAGCACACGTTTCAAGCTGCTTATTTGAAGAGTGAAGCCAACACTTGGA
TGGGTTTCATTAGGCTACGCTTACTGCCGACAACACACGACTCCACAGTATCTTGGGACAGGGTATTGCTTGCCTTTGCCATTCTTCGCTCGATGAGTAATGATGTAGGA
AAAATAATTTCTTATGAGATTGTTGACTGTTGGAAAAAGAAGGTGGGGAAGTTGTTCTTCCCAAATACAATTACTATGCTGTGCAGAAACGCAGGGGTTCCAGTGGATGA
GAATGATGTAATTTTATTTGATAAGGGAATCATTGATACGTCCAATTTGGCGCGACTCCAGCGTACGCAGGAGATACGACAAGGAGGGCTAATCTACGACATGAATACAA
TTCTAGAACAACTGGCACTGTTGGCCAGTAGGCAGGAGTTTGCTGAAAGGCAAACTTTAACCTTCTGGACCTATGTTAAGAATCGTGATGCTGGTTTGAAGTGGGCACTA
CAGGAGAATTTTTCAAAGCCATATCCAACCCTCTCTGCATTCCCCGAAGACTTACTGAACCCCTGGATTCCACCGCCGCCTGCTGAGAAGGAAGATGAAGAGGAAGATTT
AGGTCAGGAAGATTGA
Protein sequenceShow/hide protein sequence
MELDKLLEEEESYWKLRAREEWLKSGDKNTKWFHSKATQRKKRNDTKGIFNSRDKWVEASKEIGYAATDYFKSLLCSSMPNNEGIRVVTEAITKKISAEQKIKLDAPFSK
DELEKAIKGMLGYSQRQGRGRADKQNLQALIPKSKEPKRMSEFRPISLCNVIYKVIAKVLANRMKQVLDTIISHSQSSFVPGRQISDNALVGFECIHALINKRKGKAGYV
ALKLDMSKAYDRVEWVFIEECMKKMGFSSNWFQKVMKCISSVSFSILINGEPQDEFKPSRGLRQGDPLSPYLFLLCAEGFSTLLEREESLSHLSGFKINNYCPNFTHLFF
ADDSLVFLKAEERNLCSFNGVLKLYEEASGQTINLDKSSFMASRNVNEETIAKCERILGIKSTNSLGHYLGMPSQTGKNKGIVFKRIKDRVEKMLQGWKENLFSIGGKEM
LIKAVAQAIPVYTMSCFRLPNNICSYFDRLCSKFWWGSSGHKTKTHWMSWKKMCRNKSQGGMGFRYLSAFIQAMLAKQSWRLIRNPNSLLFKILRGRYFKGKNFLEAPLG
NSSSLTWRGIIWGRGLFLKGFRWRVGNGVQISIDKDPWISREGNPRVLSTHNSFKGLRVRDLLDDNNKWKESTILEVFSHQDAYDILNMPTGSKDSNDEIIWHLNKKGIF
SVKSAYRLATREQSHNEASQSNASKEASFWNSIWKANVLLRTKVEDWLPKDYWGWMKDSLSKEELAKSIILMWKLWEFRNRAEYHNHPAATAKIIQSNIEASLIEWESSY
LKTHSPNGPRSPASHVHWEKPPPNSWKLNFDAAWIEKEGRGGLGWLVRDSNGSLICCGLKQISKNWSIKNLEACAMLEGIKKVFDTCNRLGITLEVESDALEVVTVLNGV
SDDLSDLKVFTDEIQSLAARAPSMRFLHYNCLLNTDAHCVARSAADCGLSSSVRSSLRSSSSREREMFFWAPHIPFCFFPPIDEEPINSNIVREFYANLDVKDDFEVIVR
GVPVQWSPKAINDLFNLQDFPHAVFNEMMVAPSSDQLSAAVREVGIEGAQWRVSQTRKHTFQAAYLKSEANTWMGFIRLRLLPTTHDSTVSWDRVLLAFAILRSMSNDVG
KIISYEIVDCWKKKVGKLFFPNTITMLCRNAGVPVDENDVILFDKGIIDTSNLARLQRTQEIRQGGLIYDMNTILEQLALLASRQEFAERQTLTFWTYVKNRDAGLKWAL
QENFSKPYPTLSAFPEDLLNPWIPPPPAEKEDEEEDLGQED