; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015314 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015314
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr12:10178083..10182519
RNA-Seq ExpressionLag0015314
SyntenyLag0015314
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4262994.1 unnamed protein product [Prunus armeniaca]4.9e-8125.44Show/hide
Query:  LVDTMGRTKGWRRDVEGVELGGKRSKGGDEEERVRCSWSDCIGGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSV
        +V+  G  K  +   + + + G R +  + E      ++   GG     PPR M ++ WNV+G G+PRTF  L  +++EK+P ++FL ETKV+A++M + 
Subjt:  LVDTMGRTKGWRRDVEGVELGGKRSKGGDEEERVRCSWSDCIGGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSV

Query:  KRLLG-----------------------------------------------WWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEK
        +  LG                                               ++ TGFYG P T Q + +W LL RL       W         + +   
Subjt:  KRLLG-----------------------------------------------WWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEK

Query:  DGAGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV
        DG+        + ERLDR      + D Y     NHL    SDH  +++         S    R   FEE W ++PD  ++++ +W       G      
Subjt:  DGAGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV

Query:  IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG------------IAIRDAAV
        + N    C   +  W     GN   ++K A +++ +A+    TTD   L  + E  + E+L++ ++ W+QRSR  WL+EG               +   V
Subjt:  IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG------------IAIRDAAV

Query:  C---------------------------FIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRITGALLG----
        C                           F     Q  +  L ++ P ++   N  LL++FT EE+   L Q  P KA   NG+P +F +    ++G    
Subjt:  C---------------------------FIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRITGALLG----

Query:  --------------AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGG--
                        N TL+ LIPK K    ++  + ISLC   YK+I+K + NR+K ILP +I++NQS F+P R ++DN +  FE +H ++    G  
Subjt:  --------------AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGG--

Query:  --------------------------------------------------------GL---------------------------------NGANGNEAS
                                                                G+                                  GA  +   
Subjt:  --------------------------------------------------------GL---------------------------------NGANGNEAS

Query:  VIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIV
         +  L   YE+ SGQ +NY KS  + SPN        I+  L VP   CH++  G                             K+L  +GKE+L+K++ 
Subjt:  VIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIV

Query:  QAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV-------FSFVFGT-------EGPLFSPVRVLGGKSWFATVVCL
        QAIP YSM+CF +PKGL KE++  MA+FWW   +D R I W+ WE LC  K  G +       F+    T       + P     R+   +   +     
Subjt:  QAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV-------FSFVFGT-------EGPLFSPVRVLGGKSWFATVVCL

Query:  A-----------QLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSGPSLPATSTVSDLF
        A            L  G+ELL +G RWR+GNG +   Y + WLP     +I S P LP ++ V DLF
Subjt:  A-----------QLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSGPSLPATSTVSDLF

PRQ56718.1 putative RNA-directed DNA polymerase [Rosa chinensis]8.3e-8125.13Show/hide
Query:  MSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGW-----------------------------------------------
        M+++ WN +G G+P T   L  +V    P V+FLS+T+ +A  M  V+  LGW                                               
Subjt:  MSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGW-----------------------------------------------

Query:  ----------W-FTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGIS--------------------------------RWGE
                  W FTG YGFP   + S+TW+LL +L    + PW++GGD+N I  + +K G  +                                 R GE
Subjt:  ----------W-FTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGIS--------------------------------RWGE

Query:  TIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSW----GAGPGDPGDITPLVIANKAK
         +  RLDR   +  W DL+P+  V HLD S SDH P+ L +        +  ++  +FEE WL     +++V SSW    G GP         ++ N+ +
Subjt:  TIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSW----GAGPGDPGDITPLVIANKAK

Query:  SCMHSMAGWGRSKSGNFSRRI-KIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG-------------------------
        S   ++  W     G   + I KI NQ      + L +    E  +  + +L ++L +++V+W+QR++  WL++G                         
Subjt:  SCMHSMAGWGRSKSGNFSRRI-KIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG-------------------------

Query:  --------------IAIRDAAVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIR---------ITGAL--
                      I +      F     +DF   L  +   VS+  N+AL R   +EE+  A++  HP+K+P P+G    F +         I GA+  
Subjt:  --------------IAIRDAAVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIR---------ITGAL--

Query:  -------LGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGG------
               L AVN T V LIPK +  +++  +RPISLCNV YK+ SK L NR+K +L  LIS  QSAF+PGR + DN+++ FE  H L++   G       
Subjt:  -------LGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGG------

Query:  ---------------------------------------------LNG----------------------------------------------------
                                                     +NG                                                    
Subjt:  ---------------------------------------------LNG----------------------------------------------------

Query:  -----------------ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG---------------------
                         A  ++  V++ LL+ YE+ASGQ VNY+KS ++FS N +   Q+ I+ IL V     H +  G                     
Subjt:  -----------------ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG---------------------

Query:  --------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGS-------VFSFVFGTEGP---
                K L  +GKEVL+K++ QAIP Y M+CF +P+ L  E+HR +A+FWW    D R+IHW++WE LC PKC G        +F+    T+     
Subjt:  --------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGS-------VFSFVFGTEGP---

Query:  --------------------LFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQ
                             F    + GG+S+         +++GR++L +G R+++G+G +   +   W+P  +S +
Subjt:  --------------------LFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQ

XP_030945781.1 uncharacterized protein LOC115970260 [Quercus lobata]2.2e-8131.34Show/hide
Query:  WSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGA-----------------------GIS---------RWGETIYERLDRVFGTTAWTDLYPSYVVNHLD
        W++L  L      PW   GDFN +L   EK G                        G S         R GE I+ERLDR      W + +P+  V HL+
Subjt:  WSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGA-----------------------GIS---------RWGETIYERLDRVFGTTAWTDLYPSYVVNHLD

Query:  YSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPL-VIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSA
           SDHRP+ L L      + +  ++  RFE  W+  P+ +++V  +W   P      TP+ V A K K C   +  W R   GN  ++IK    ++  A
Subjt:  YSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPL-VIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSA

Query:  IADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREGIAIRDAAVCFIGPSVQ----DFDVALRDLNPS---------VSDDMNQALLRLFTEE
         A    T   E +++ + +L  +  ++E  W QRSR  WL+ G         F G + Q    +F   LRD N           V++ M++ L+R FT E
Subjt:  IADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREGIAIRDAAVCFIGPSVQ----DFDVALRDLNPS---------VSDDMNQALLRLFTEE

Query:  EILVALRQTHPNKAPDPNGLPGVFIR---------ITGALLG---------AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL
        E+ VA+++  P KAP P+G+P +F +         +T A+L          ++N T + LIPK      V+D RPISLCNV YK+ISK + NR+K +L  
Subjt:  EILVALRQTHPNKAPDPNGLPGVFIR---------ITGALLG---------AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL

Query:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG-----ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYIS---QILYVPH
        +IS+ QSAFI  R + DN ++ FE +H + K S  G  G     +   E + I++LL +YE ASGQ +N EK+ + FS NT+E TQ+ +     +L + H
Subjt:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG-----ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYIS---QILYVPH

Query:  ---------------CPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCL
                         C  Q            K K+L  +G+EV++K++VQ+IP YSMN F LP GL K+I   + KFWW GS D ++I W+ W +LC 
Subjt:  ---------------CPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCL

Query:  PKCMG-----------------SVFSFVFGTEGPL---FSPVRVLGGKSWFATV-----VCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSL
         K +G                  V+  +   E  L   FS      G    A+V          ++  R ++ +G  WR+GN +    +   WLP+    
Subjt:  PKCMG-----------------SVFSFVFGTEGPL---FSPVRVLGGKSWFATV-----VCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSL

Query:  QIQSGPSLPATSTVSDLF
        +I S  +  +   V DLF
Subjt:  QIQSGPSLPATSTVSDLF

XP_030963604.1 uncharacterized protein LOC115984727 [Quercus lobata]4.4e-8227.24Show/hide
Query:  WNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLL-----------------------------------------------GWWFTGF
        WN RG G+ +T + L  ++  + P V+FL+ET +  +R++S+   L                                                W FTGF
Subjt:  WNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLL-----------------------------------------------GWWFTGF

Query:  YGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGISRWGET--------------------------------IYERLDRVFGTTAWT
        YG P T   S +W+LL  L    D PWL GGDFN +L   EK G     +G+                                 ++ERLDR   T  W 
Subjt:  YGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGISRWGET--------------------------------IYERLDRVFGTTAWT

Query:  DLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSR
        D +P+  V      QSDH P+ ++    P       QR  RFE+ WL +      V  SW A   D    +P+  +      C   +  W ++  GN +R
Subjt:  DLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSR

Query:  RIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREGIA--------------------IRDA-AVCFIGPS-----VQDFD
         +    + +  A        S +  +Q + ++ E L+ +E  W+QRSR  WL  G +                    +RD+  V F G       + DF 
Subjt:  RIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREGIA--------------------IRDA-AVCFIGPS-----VQDFD

Query:  VAL-RDLNPS------------VSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIR------------------ITGALLGAVNDTLVVLIP
          L    NPS            VS DMN  L+  FT  E+  AL+Q  P KAP PNG+P +F +                   T  +L  +N T + LIP
Subjt:  VAL-RDLNPS------------VSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIR------------------ITGALLGAVNDTLVVLIP

Query:  KTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLN------------------------
        K K+   + +  PI+LCNV YKL+SK L NR+K +LP +IS++QSAF   + + DN ++ FE +H ++ +  G +                         
Subjt:  KTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLN------------------------

Query:  ----------------------GANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG---------------
                               A   +   I+ LL  YEKASGQ +N  K+ + FS N  + T++ I  +L V     +++  G               
Subjt:  ----------------------GANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG---------------

Query:  --------------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMG----------------
                      K+L  +GKEVLLK +VQAIP ++M+CF LP GL ++I   + KFWW    D R IHW  WE LC PK  G                
Subjt:  --------------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMG----------------

Query:  ---------------SVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPN
                        VF   +   G +F      G  +W         ++  + L+ R  RWRIGNG+    +   WLPN
Subjt:  ---------------SVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPN

XP_042939444.1 uncharacterized protein LOC122274474 [Carya illinoinensis]1.7e-8127.01Show/hide
Query:  MSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLG----WWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC
        M+ + WN RG G+PRT + L+ LV+ K+P ++FLSETK +  R+  +K  LG    +    FYG P + +   +W LL  L+   + PWL  GDFN I  
Subjt:  MSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLG----WWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC

Query:  QDEKDGAGISRW---------------------------------GETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ
        Q EK GA    +                                 G+   ERLDR  G   W +L+ ++ V HLD +QSDH+   L++           +
Subjt:  QDEKDGAGISRW---------------------------------GETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ

Query:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQE
        RV RFE  W ++ +   ++   W       G  T          C   +  W R+K  +  + +K   + ++  + +    +  E + + +  +  ++  
Subjt:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQE

Query:  DEVYWKQRSREIWLREG-----------------------------IAIRDAAVC----------FIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEE
        + + W+QR+++ WL++G                             +      +C          F        D  L  L   ++DDM  +L  +FTE 
Subjt:  DEVYWKQRSREIWLREG-----------------------------IAIRDAAVC----------FIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEE

Query:  EILVALRQTHPNKAPDPNGLPGVFIR---------ITGALL---------GAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL
        E+  A    +P  +P P+G P +F +         +T A L           +N+TL+ LIPK      V D RPISLCNV YK+I+K L NR+K ILP 
Subjt:  EILVALRQTHPNKAPDPNGLPGVFIR---------ITGALL---------GAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL

Query:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGGG----------------------------------------------LNGANGNEASVIRDLLIW
        +IS  Q+AF+PGR + DN I+ FE +H ++    G                                                  AN  E S ++ LL  
Subjt:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGGG----------------------------------------------LNGANGNEASVIRDLLIW

Query:  YEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIVQAIPCYSM
        YE+ASGQ +N +K+ + FS NT E T++ ++ I  +     +++  G                             K+L  +GKE L+K+++QAIP YSM
Subjt:  YEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIVQAIPCYSM

Query:  NCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV----FSF----VFGTEG------PLFSPVRVLGGK----SWFATV-------
          F LP+ L+ E+++ +  FWW   E   RIHW+SW+ +   K  G +    F F    +   +G      P     RVL  K    S F  V       
Subjt:  NCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV----FSF----VFGTEG------PLFSPVRVLGGK----SWFATV-------

Query:  VCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSG-PSLPATSTVSDL
              +  R LL  G  WR+G+G++   +   WLP   S ++QS    LP+ + V+ L
Subjt:  VCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSG-PSLPATSTVSDL

TrEMBL top hitse value%identityAlignment
A0A2N9FCL2 Reverse transcriptase domain-containing protein1.1e-9728.73Show/hide
Query:  PRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRL-LGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC
        P  M  I WN RG G+  T + L  LV+ K P VLFLSET +   R+  ++     W  T FYG P T     +W+LL  L G    PW   GDFN I+ 
Subjt:  PRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRL-LGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC

Query:  QDEKDGAGIS---------------------------------RWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ
          EK+G  +                                  R   T + RLDR   T  W   + + VV+HL+ + SDH+P+   L+ HP+   +  Q
Subjt:  QDEKDGAGIS---------------------------------RWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ

Query:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQE
        R+ RFE+ W   P    ++  +W   P   G  T  V A K + C  ++  W RS+ GN ++ +K   ++++ A  D       E +I    ++ ++L +
Subjt:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQE

Query:  DEVYWKQRSREIWLREG--------------------IAIRDAA-------------------VCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEE
        +E  WKQRSR+ WL+EG                    +A+R  A                     F    ++D ++ L  + P ++ +MNQAL+  FTEE
Subjt:  DEVYWKQRSREIWLREG--------------------IAIRDAA-------------------VCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEE

Query:  EILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL
        E+  A++Q  P KAP P+G+P VF +                   +G LL ++N T V LIPKTK    V + RPISLCNV YKLISK L NR+K ILP 
Subjt:  EILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPL

Query:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGG-----------------------------------------------------------------
        +IS++QSAF+PGR + DN ++ FE +H +  +  G                                                                 
Subjt:  LISQNQSAFIPGRYVVDNAILGFECIHELRKESGG-----------------------------------------------------------------

Query:  ------------------GLNG-------------------------------------ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGT
                          GLNG                                     A+  E   I+++L  YEKASGQ +N  K+ + FS NT +  
Subjt:  ------------------GLNG-------------------------------------ANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGT

Query:  QQYISQILYVPHCP------------------CHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSE
        Q ++  IL VP                     C  Q            K K+L  +G+EVL+K++VQAIP Y+MNCF LP  L KEI   + +FWW   +
Subjt:  QQYISQILYVPHCP------------------CHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSE

Query:  DTRRIHWMSWEALCLPKCMG-------------------------------SVFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRW
        DTR+IHW+ WE LC  K  G                                VFS  F   G +  +  +  G  +W         ++  ++L+  G  W
Subjt:  DTRRIHWMSWEALCLPKCMG-------------------------------SVFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRW

Query:  RIGNGRTTPTYGSNWLPNE
        R+G+G   P  GSNWL +E
Subjt:  RIGNGRTTPTYGSNWLPNE

A0A2N9G219 RNase H domain-containing protein1.1e-10732.5Show/hide
Query:  PRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRL-LGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC
        P  M  I WN RG G+  T + L KLV+ K P VLFLSET +   R+  ++     W  T FYG P T    Q+W+LL  L G    PW   GDFN I+ 
Subjt:  PRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRL-LGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILC

Query:  QDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ
          EK G                                     R   T + RLDR   T  W   + S VV+HL+ + SDH+P+   LS  P+  ++  +
Subjt:  QDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQ

Query:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIA-NKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQ
        ++ RFE+ W   P     +  +W          +P+V A  K ++C  ++ GW R + G+ ++ ++   ++++ A  D       + +I    ++ ++L 
Subjt:  RVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLVIA-NKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQ

Query:  EDEVYWKQRSREIWLREG--------------------IAIRDA-------------------AVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTE
        ++E  WKQRSRE WL+EG                     A+R A                      F    ++D +V L  + P V+ +MNQ+L   FTE
Subjt:  EDEVYWKQRSREIWLREG--------------------IAIRDA-------------------AVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTE

Query:  EEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILP
        EE+L+A++Q  P KAP P+G+P +F +                   +G LL A+N T V LIPKTK+   V + RPISLCNV YKLISK L NR+K ILP
Subjt:  EEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILP

Query:  LLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG-ANGNEASVIRDLLIW----------YEKASGQTVNYEKSVVAFSPNTEEGTQQYISQIL
         +IS+ QSAF+PGR + DN ++ FE +H +  +  G +   A   + S   D + W          YEKASGQ +N  K+ + FS NT + TQ+ I +IL
Subjt:  LLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG-ANGNEASVIRDLLIW----------YEKASGQTVNYEKSVVAFSPNTEEGTQQYISQIL

Query:  YVP------------------HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWM
         VP                     C  Q            K K+L  +G+E+L+K++VQAIP Y+MNCF LP  L KEI   + +FWW  + D R+IHW+
Subjt:  YVP------------------HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWM

Query:  SWEALCLPKCMGSVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSG-PSLPATSTVSDL
         WE LC  K  G                 +  G  +W         ++  + L++ G  WR+G+G   P  GSNWL  E   +I S   +LP  + V +L
Subjt:  SWEALCLPKCMGSVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEFSLQIQSG-PSLPATSTVSDL

A0A2N9HYS7 Uncharacterized protein2.5e-9929.02Show/hide
Query:  GGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGG
        G GWFPA P+ M  I WN RG G+  T + L  LV+ K P VLFLSET               W  T FYG P T     +W+LL  L G    PW   G
Subjt:  GGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGG

Query:  DFNAILCQDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPL
        DFN I+   EK G                                     R   T + RLDR   T  W   + S  V+HL+ + SDH+P+   L+  P+
Subjt:  DFNAILCQDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPL

Query:  CWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEA
           +  Q++ RFE+ W   PD   +V  +W   P   G  +P+  +  K + C   +  W R++ GN ++ +K   + ++ A  D       + +I    
Subjt:  CWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEA

Query:  QLEEVLQEDEVYWKQRSREIWLREG---------------------IAIRDAAVCFIGPSV------------------QDFDVALRDLNPSVSDDMNQA
        ++ ++L ++E  WKQRSR+ WL+EG                       IRD       P+V                  +D +V L  + P V+ +MNQ 
Subjt:  QLEEVLQEDEVYWKQRSREIWLREG---------------------IAIRDAAVCFIGPSV------------------QDFDVALRDLNPSVSDDMNQA

Query:  LLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVN
        L+  FTE E++ A++Q  P KAP P+G+P +F +                   +G LL ++N T V LIPKTK    V + RPISLCNV YKLISK L N
Subjt:  LLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVN

Query:  RMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG------------------------------------------------ANGNE
        R+K ILP +IS++QSAF+PGR + DN ++ FE +H +     G +                                                   NGN 
Subjt:  RMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG------------------------------------------------ANGNE

Query:  ASVI--------------------------------------------RDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVP---------
           I                                            R     YEKASGQ +N  K+ + FS NT +  Q+ I +IL VP         
Subjt:  ASVI--------------------------------------------RDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVP---------

Query:  ---------HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMG
                    C  Q            K K+L  +G+E+L+K++VQAIP Y+MNCF LP  L KEI   + +FWW  + D R+IHW+ WE LC  K  G
Subjt:  ---------HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMG

Query:  S-------------------------------VFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEF
                                        VFS  F  +G +  +  +  G  +W         ++  ++L+  G  WR+GNG   P  GSNWL +E 
Subjt:  S-------------------------------VFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNEF

Query:  SLQIQSG-PSLPATSTVSDL
          ++ S    LP  + V +L
Subjt:  SLQIQSG-PSLPATSTVSDL

A0A2N9IWN7 Uncharacterized protein2.5e-9928.88Show/hide
Query:  EERVRCSWSDCIGGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMS--------------------------------
        E+ ++  +S   GGGW PAPP  MS + WN RG G P T + L +LV+EK P VLF+ E+ +  +R+                                 
Subjt:  EERVRCSWSDCIGGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMS--------------------------------

Query:  -SVKRLL--------------GWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGI-------------------------
         S+K                  W FTGFYG   T +  ++WSLL  L   +  PW   GDFN +L  +EK G  I                         
Subjt:  -SVKRLL--------------GWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGGDFNAILCQDEKDGAGI-------------------------

Query:  -------SRWG-ETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDIT
               +R G  T++ERLDRV  TT+W  L+P   V+HL    SDH P+ +  S HPL       R+ RF+E WL     ++ + S+W           
Subjt:  -------SRWG-ETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDIT

Query:  PLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG-----------------
           + +K +SC +S+  W R   GN +R +K     ++ A ++       E     + ++  +L  +E  W+QRSR+ WLR G                 
Subjt:  PLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREG-----------------

Query:  ---IAIRDA-------------------AVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI-------
             I+D                     + F      +FD AL  ++  V+D+MN  L+R FT EE+  AL+Q  P+ AP P+G+  +F +        
Subjt:  ---IAIRDA-------------------AVCFIGPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI-------

Query:  -----------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESG
                   +G+LL AVN T + LIPKT+  + V+D RPISLCNV YK++SK + NR+K+ILP +IS+ QSAF+P R + DN ++ FE +H ++    
Subjt:  -----------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESG

Query:  GG---------------------------------------------------LNGANGN-----------EASVIRDLLIWYEKASGQTVNYEKSVVAF
        G                                                    +NGA  +           +   I  +L  YE ASGQ VN +K+ + F
Subjt:  GG---------------------------------------------------LNGANGN-----------EASVIRDLLIWYEKASGQTVNYEKSVVAF

Query:  SPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMA
        S +T       +  +L VP    +++  G                             K+L  +GKE+L+K++VQAIP YSM+CF LP  L  EI   + 
Subjt:  SPNTEEGTQQYISQILYVPHCPCHQQAKG-----------------------------KILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMA

Query:  KFWWNGSEDTRRIHWMSWEALCLPKCMG-----SVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNE
        KFWW+   D ++IHW++WE LC  K  G      + +  F     L   V+  G  +W         ++  R +L  G R  IGNG +   +G NWLP  
Subjt:  KFWWNGSEDTRRIHWMSWEALCLPKCMG-----SVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNE

Query:  FSLQIQSGPS-LPATSTVSDL
           Q+ S  +  PA + VS L
Subjt:  FSLQIQSGPS-LPATSTVSDL

A0A2N9J7E4 Uncharacterized protein1.1e-9128.56Show/hide
Query:  GGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGG
        G GWFPA P+ M  I WN RG G+  T + L  LV+ K P VLFLSET               W  T FYG P T     +W+LL  L G    PW   G
Subjt:  GGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDTPWLIGG

Query:  DFNAILCQDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPL
        DFN I+   EK G                                     R   T + RLDR   T  W   + S  V+HL+ + SDH+P+   L+  P+
Subjt:  DFNAILCQDEKDG---------------------------------AGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPL

Query:  CWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEA
           +  Q++ RFE+ W   PD   +V  +W   P   G  +P+  +  K + C   +  W R++ GN ++ +K   + ++ A  D       + +I    
Subjt:  CWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDPGDITPLV-IANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEA

Query:  QLEEVLQEDEVYWKQRSREIWLREG---------------------IAIRDAAVCFIGPSV------------------QDFDVALRDLNPSVSDDMNQA
        ++ ++L ++E  WKQRSR+ WL+EG                       IRD       P+V                  +D +V L  + P V+ +MNQ 
Subjt:  QLEEVLQEDEVYWKQRSREIWLREG---------------------IAIRDAAVCFIGPSV------------------QDFDVALRDLNPSVSDDMNQA

Query:  LLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVN
        L+  FTE E++ A++Q  P KAP P+G+P +F +                   +G LL ++N T V LIPKTK    V + RPISLCNV YKLISK L N
Subjt:  LLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRI------------------TGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVN

Query:  RMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG------------------------------------------------ANGNE
        R+K ILP +IS++QSAF+PGR + DN ++ FE +H +     G +                                                   NGN 
Subjt:  RMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRKESGGGLNG------------------------------------------------ANGNE

Query:  ASVIR--------DLLIWY-------------EKAS------GQTVNYEKSVV-----------AFSPNTEEGTQQYISQILYVP---------------
           I         D +  Y              KAS      G + N+ K  +           + +    + +++ I +IL VP               
Subjt:  ASVIR--------DLLIWY-------------EKAS------GQTVNYEKSVV-----------AFSPNTEEGTQQYISQILYVP---------------

Query:  ---HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGS-----
              C  Q            K K+L  +G+E+L+K++VQAIP Y+MNCF LP  L KEI   + +FWW  + D R+IHW+ WE LC  K  G      
Subjt:  ---HCPCHQQA-----------KGKILLDSGKEVLLKSIVQAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGS-----

Query:  --------------------------VFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNE
                                  VFS  F  +G +  +  +  G  +W         ++  ++L+  G  WR+GNG   P  GSNWL +E
Subjt:  --------------------------VFSFVFGTEGPLF-SPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWLPNE

SwissProt top hitse value%identityAlignment
P14381 Transposon TX1 uncharacterized 149 kDa protein8.0e-1029.19Show/hide
Query:  LREGIAIRDAAVCFI------GPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRITGALLG-------------
        L +  AIRD A  F        P   D    L D  P VS+   + L    T +E+  ALR    NK+P  +GL   F +     LG             
Subjt:  LREGIAIRDAAVCFI------GPSVQDFDVALRDLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRITGALLG-------------

Query:  -----AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRK
             +    ++ L+PK    R + + RP+SL +  YK+++KA+  R+K++L  +I  +QS  +PGR + DN  L  + +H  R+
Subjt:  -----AVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQSAFIPGRYVVDNAILGFECIHELRK

P93295 Uncharacterized mitochondrial protein AtMg003101.7e-0744Show/hide
Query:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPK
        A+P Y+M+CF L K L K++   M +FWW+  E+ R+I W++W+ LC  K
Subjt:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.7e-0428.83Show/hide
Query:  LRDLNP-SVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVF------------------IRITGALLGAVNDTLVVLIPKTKAARWVADIRPI
        ++D++P   +D +   L  L +++EI  A+     NKAP P+     F                     TG LL   N T + LIPK      ++  RP+
Subjt:  LRDLNP-SVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVF------------------IRITGALLGAVNDTLVVLIPKTKAARWVADIRPI

Query:  SLCNVSYKLIS
        S C V YK+I+
Subjt:  SLCNVSYKLIS

AT4G29090.1 Ribonuclease H-like superfamily protein2.4e-0926.11Show/hide
Query:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV--------------------------------FSFVFGTEGPLFS
        A+P Y+M CF LPK + K+I   +A FWW   ++ + +HW +W+ L   K  G +                                 S  F    PL +
Subjt:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSV--------------------------------FSFVFGTEGPLFS

Query:  PVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWL---PNEFSLQIQSGP-----SLPATSTVSDL
        P   LG +  F        +   +E+L +G R  +GNG     +   WL   P   +L++Q  P     S+ +   VSDL
Subjt:  PVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTTPTYGSNWL---PNEFSLQIQSGP-----SLPATSTVSDL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-0844Show/hide
Query:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPK
        A+P Y+M+CF L K L K++   M +FWW+  E+ R+I W++W+ LC  K
Subjt:  AIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGGAGGGAGGGGTGGATAACGCAGTGAGTCGGCAACCTGAGGCCCTGAAGCCTTCTGCCGAGGGTCTGGGGAGGTGGTGGAAGCTCTGGCTGGTGGGCAGGGGGAT
CAGGGAGGTAGGGGTAGTGGGAGGGGGTTGCGTTGAGGTAGCTGTGAGTGGGGAGGGGTTGGCTGATAGGGAGAAGGGGAAACAGAAGGTGGGGGATGTCGCGATTGCAA
GTGGGGTCACTGAGGGAGGTGAGGATATGATGCTTGTTGACACCATGGGCAGGACAAAGGGGTGGAGACGGGATGTAGAGGGTGTAGAGCTGGGAGGGAAGAGGTCGAAA
GGAGGGGATGAAGAGGAACGGGTCAGGTGTAGCTGGTCCGACTGTATTGGCGGTGGCTGGTTCCCAGCCCCGCCTAGGATTATGAGTCTGATATTTTGGAATGTTCGGGG
GTCGGGGTCACCCCGAACATTCAAGCGCCTGAACAAGTTGGTTCAGGAGAAACGACCTCAGGTGCTCTTCCTGTCTGAAACGAAAGTGTCTGCTAGTCGGATGTCTTCTG
TGAAACGCTTGCTGGGGTGGTGGTTCACTGGCTTCTATGGCTTCCCTGCAACGGACCAGCACTCTCAGACTTGGTCACTCCTATCCAGGCTGAGGGGTTGTACTGATACG
CCATGGCTGATTGGTGGGGATTTCAACGCCATCTTATGTCAGGATGAGAAGGACGGGGCAGGGATAAGCCGCTGGGGTGAGACGATTTATGAACGGTTGGATCGGGTTTT
TGGCACTACAGCTTGGACTGATCTTTATCCGAGCTATGTGGTGAACCATCTCGATTACAGCCAGTCTGATCATAGGCCGGTGGAACTAATCCTTAGTCCCCATCCTCTGT
GTTGGTCCCAAAGTGGTCAGCGGGTTATGCGTTTCGAGGAAACCTGGCTTAGGCAGCCGGATTTGCGACAGTTGGTTGATAGTTCGTGGGGTGCAGGGCCAGGAGATCCT
GGTGATATAACGCCTTTGGTTATTGCCAATAAGGCTAAGAGTTGCATGCACTCGATGGCTGGTTGGGGTCGATCAAAGTCCGGGAATTTCTCGAGGCGCATTAAAATTGC
CAACCAAAAGGTTCAGTCGGCCATCGCTGATCTTAAAACAACTGATTCTCGTGAGTTGCTTATCCAAGCAGAGGCACAATTGGAAGAGGTACTACAGGAAGATGAGGTAT
ATTGGAAGCAGAGGTCTAGGGAGATATGGCTTCGAGAAGGGATCGCAATACGCGATGCAGCTGTTTGCTTCATCGGGCCGAGTGTTCAGGATTTTGATGTGGCTTTGCGA
GACCTAAATCCGTCTGTGAGTGATGATATGAACCAGGCTCTGTTACGCCTGTTTACCGAAGAGGAGATTCTGGTGGCGTTAAGACAAACGCATCCTAATAAAGCCCCCGA
TCCAAATGGGTTGCCGGGAGTTTTTATAAGAATCACTGGGGCATTGTTGGGTGCAGTTAATGATACCTTAGTTGTGCTCATTCCGAAAACCAAGGCGGCTCGATGGGTTG
CAGACATTAGGCCTATCTCCCTCTGTAATGTGAGTTACAAGCTGATCTCAAAGGCCTTGGTTAATCGTATGAAGAATATTCTGCCACTACTAATTTCACAGAACCAGAGT
GCGTTTATTCCAGGGAGATATGTGGTGGATAATGCCATCCTGGGGTTCGAGTGTATTCATGAGCTCCGCAAAGAGTCAGGGGGAGGGCTAAATGGGGCCAATGGGAATGA
AGCGTCGGTTATTCGGGATTTGTTGATATGGTATGAGAAAGCTTCAGGACAGACTGTCAACTATGAGAAATCTGTTGTTGCTTTTAGTCCAAACACGGAGGAGGGGACAC
AACAGTATATTAGCCAAATCCTATATGTGCCCCATTGCCCTTGCCATCAGCAGGCTAAAGGGAAAATTCTTCTCGATAGCGGGAAGGAAGTTCTGCTTAAATCTATAGTC
CAGGCTATTCCTTGCTATTCCATGAACTGTTTCTGGTTGCCCAAGGGTTTGATCAAAGAGATCCACAGGACAATGGCTAAATTTTGGTGGAATGGGTCCGAGGATACGAG
GCGAATTCATTGGATGAGTTGGGAGGCGCTGTGCCTCCCCAAGTGTATGGGATCCGTCTTCTCTTTTGTGTTCGGTACTGAGGGGCCGTTATTTTCCCCAGTCAGGGTTC
TTGGAGGCAAGTCTTGGTTCGCGACCGTCGTTTGTCTGGCGCAGCTTGTTATGGGGCGGGAGCTCTTGGTTCGAGGATGTCGGTGGAGGATAGGCAATGGCCGGACTACG
CCTACTTATGGTTCAAACTGGTTGCCGAATGAGTTCTCTCTCCAAATACAGTCAGGCCCGTCGCTTCCTGCTACTAGTACAGTTAGTGATCTATTTGCCCATCTGGTGGG
TGGAACGAGGTTGTGCTCAGAGCCCATTTTGATGAGGCAGACTCGGGTATCGACTTGCTCATATGTTGGCTGTGCAGGGACCTTCCCTTCGAATCCTGATAGGATGCGTG
CGTGGTGGTCCGACCTTTGGAGGCTAAATGTGCCTACATGTGTGTTCTGTGCGATGAGTTTGTGGAGGATCGTCGGCATCTGTTCTGGAAGTGCCATGTGGTTGAGAGTA
TGTGGTTGTGCTCCAAGTTTGCCCTCTCCATCGGTCCTTTTCCCAATTCCGGTTCGAGGAAGTCATTTGGGCGTGAAGGATAGTCTTCCAGGGCCGGATTTCGAGCTTGT
GATCATTTCTGGTGGTTGTATGGAATCTCCGGAACAATCTGAGTTGGGGTGGCCTGATTCAGGGGAGGCTAGGGGTGGTTGTGTACTGCGTGGGGCTGATGGTGAGGTTT
TTATGGCGGCCTGTTTGAGCTTACAAAGGTGTTGGAGCGTGGATTTGGCAGAGGGTTGGGCTGTGTATAAAGGGGTCCAGCTTGCTCGTCAGCTGGGGTTTGTTGATTTT
GTGGTGGAACCGACTCTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGGAGGGAGGGGTGGATAACGCAGTGAGTCGGCAACCTGAGGCCCTGAAGCCTTCTGCCGAGGGTCTGGGGAGGTGGTGGAAGCTCTGGCTGGTGGGCAGGGGGAT
CAGGGAGGTAGGGGTAGTGGGAGGGGGTTGCGTTGAGGTAGCTGTGAGTGGGGAGGGGTTGGCTGATAGGGAGAAGGGGAAACAGAAGGTGGGGGATGTCGCGATTGCAA
GTGGGGTCACTGAGGGAGGTGAGGATATGATGCTTGTTGACACCATGGGCAGGACAAAGGGGTGGAGACGGGATGTAGAGGGTGTAGAGCTGGGAGGGAAGAGGTCGAAA
GGAGGGGATGAAGAGGAACGGGTCAGGTGTAGCTGGTCCGACTGTATTGGCGGTGGCTGGTTCCCAGCCCCGCCTAGGATTATGAGTCTGATATTTTGGAATGTTCGGGG
GTCGGGGTCACCCCGAACATTCAAGCGCCTGAACAAGTTGGTTCAGGAGAAACGACCTCAGGTGCTCTTCCTGTCTGAAACGAAAGTGTCTGCTAGTCGGATGTCTTCTG
TGAAACGCTTGCTGGGGTGGTGGTTCACTGGCTTCTATGGCTTCCCTGCAACGGACCAGCACTCTCAGACTTGGTCACTCCTATCCAGGCTGAGGGGTTGTACTGATACG
CCATGGCTGATTGGTGGGGATTTCAACGCCATCTTATGTCAGGATGAGAAGGACGGGGCAGGGATAAGCCGCTGGGGTGAGACGATTTATGAACGGTTGGATCGGGTTTT
TGGCACTACAGCTTGGACTGATCTTTATCCGAGCTATGTGGTGAACCATCTCGATTACAGCCAGTCTGATCATAGGCCGGTGGAACTAATCCTTAGTCCCCATCCTCTGT
GTTGGTCCCAAAGTGGTCAGCGGGTTATGCGTTTCGAGGAAACCTGGCTTAGGCAGCCGGATTTGCGACAGTTGGTTGATAGTTCGTGGGGTGCAGGGCCAGGAGATCCT
GGTGATATAACGCCTTTGGTTATTGCCAATAAGGCTAAGAGTTGCATGCACTCGATGGCTGGTTGGGGTCGATCAAAGTCCGGGAATTTCTCGAGGCGCATTAAAATTGC
CAACCAAAAGGTTCAGTCGGCCATCGCTGATCTTAAAACAACTGATTCTCGTGAGTTGCTTATCCAAGCAGAGGCACAATTGGAAGAGGTACTACAGGAAGATGAGGTAT
ATTGGAAGCAGAGGTCTAGGGAGATATGGCTTCGAGAAGGGATCGCAATACGCGATGCAGCTGTTTGCTTCATCGGGCCGAGTGTTCAGGATTTTGATGTGGCTTTGCGA
GACCTAAATCCGTCTGTGAGTGATGATATGAACCAGGCTCTGTTACGCCTGTTTACCGAAGAGGAGATTCTGGTGGCGTTAAGACAAACGCATCCTAATAAAGCCCCCGA
TCCAAATGGGTTGCCGGGAGTTTTTATAAGAATCACTGGGGCATTGTTGGGTGCAGTTAATGATACCTTAGTTGTGCTCATTCCGAAAACCAAGGCGGCTCGATGGGTTG
CAGACATTAGGCCTATCTCCCTCTGTAATGTGAGTTACAAGCTGATCTCAAAGGCCTTGGTTAATCGTATGAAGAATATTCTGCCACTACTAATTTCACAGAACCAGAGT
GCGTTTATTCCAGGGAGATATGTGGTGGATAATGCCATCCTGGGGTTCGAGTGTATTCATGAGCTCCGCAAAGAGTCAGGGGGAGGGCTAAATGGGGCCAATGGGAATGA
AGCGTCGGTTATTCGGGATTTGTTGATATGGTATGAGAAAGCTTCAGGACAGACTGTCAACTATGAGAAATCTGTTGTTGCTTTTAGTCCAAACACGGAGGAGGGGACAC
AACAGTATATTAGCCAAATCCTATATGTGCCCCATTGCCCTTGCCATCAGCAGGCTAAAGGGAAAATTCTTCTCGATAGCGGGAAGGAAGTTCTGCTTAAATCTATAGTC
CAGGCTATTCCTTGCTATTCCATGAACTGTTTCTGGTTGCCCAAGGGTTTGATCAAAGAGATCCACAGGACAATGGCTAAATTTTGGTGGAATGGGTCCGAGGATACGAG
GCGAATTCATTGGATGAGTTGGGAGGCGCTGTGCCTCCCCAAGTGTATGGGATCCGTCTTCTCTTTTGTGTTCGGTACTGAGGGGCCGTTATTTTCCCCAGTCAGGGTTC
TTGGAGGCAAGTCTTGGTTCGCGACCGTCGTTTGTCTGGCGCAGCTTGTTATGGGGCGGGAGCTCTTGGTTCGAGGATGTCGGTGGAGGATAGGCAATGGCCGGACTACG
CCTACTTATGGTTCAAACTGGTTGCCGAATGAGTTCTCTCTCCAAATACAGTCAGGCCCGTCGCTTCCTGCTACTAGTACAGTTAGTGATCTATTTGCCCATCTGGTGGG
TGGAACGAGGTTGTGCTCAGAGCCCATTTTGATGAGGCAGACTCGGGTATCGACTTGCTCATATGTTGGCTGTGCAGGGACCTTCCCTTCGAATCCTGATAGGATGCGTG
CGTGGTGGTCCGACCTTTGGAGGCTAAATGTGCCTACATGTGTGTTCTGTGCGATGAGTTTGTGGAGGATCGTCGGCATCTGTTCTGGAAGTGCCATGTGGTTGAGAGTA
TGTGGTTGTGCTCCAAGTTTGCCCTCTCCATCGGTCCTTTTCCCAATTCCGGTTCGAGGAAGTCATTTGGGCGTGAAGGATAGTCTTCCAGGGCCGGATTTCGAGCTTGT
GATCATTTCTGGTGGTTGTATGGAATCTCCGGAACAATCTGAGTTGGGGTGGCCTGATTCAGGGGAGGCTAGGGGTGGTTGTGTACTGCGTGGGGCTGATGGTGAGGTTT
TTATGGCGGCCTGTTTGAGCTTACAAAGGTGTTGGAGCGTGGATTTGGCAGAGGGTTGGGCTGTGTATAAAGGGGTCCAGCTTGCTCGTCAGCTGGGGTTTGTTGATTTT
GTGGTGGAACCGACTCTTTGA
Protein sequenceShow/hide protein sequence
MQEGGVDNAVSRQPEALKPSAEGLGRWWKLWLVGRGIREVGVVGGGCVEVAVSGEGLADREKGKQKVGDVAIASGVTEGGEDMMLVDTMGRTKGWRRDVEGVELGGKRSK
GGDEEERVRCSWSDCIGGGWFPAPPRIMSLIFWNVRGSGSPRTFKRLNKLVQEKRPQVLFLSETKVSASRMSSVKRLLGWWFTGFYGFPATDQHSQTWSLLSRLRGCTDT
PWLIGGDFNAILCQDEKDGAGISRWGETIYERLDRVFGTTAWTDLYPSYVVNHLDYSQSDHRPVELILSPHPLCWSQSGQRVMRFEETWLRQPDLRQLVDSSWGAGPGDP
GDITPLVIANKAKSCMHSMAGWGRSKSGNFSRRIKIANQKVQSAIADLKTTDSRELLIQAEAQLEEVLQEDEVYWKQRSREIWLREGIAIRDAAVCFIGPSVQDFDVALR
DLNPSVSDDMNQALLRLFTEEEILVALRQTHPNKAPDPNGLPGVFIRITGALLGAVNDTLVVLIPKTKAARWVADIRPISLCNVSYKLISKALVNRMKNILPLLISQNQS
AFIPGRYVVDNAILGFECIHELRKESGGGLNGANGNEASVIRDLLIWYEKASGQTVNYEKSVVAFSPNTEEGTQQYISQILYVPHCPCHQQAKGKILLDSGKEVLLKSIV
QAIPCYSMNCFWLPKGLIKEIHRTMAKFWWNGSEDTRRIHWMSWEALCLPKCMGSVFSFVFGTEGPLFSPVRVLGGKSWFATVVCLAQLVMGRELLVRGCRWRIGNGRTT
PTYGSNWLPNEFSLQIQSGPSLPATSTVSDLFAHLVGGTRLCSEPILMRQTRVSTCSYVGCAGTFPSNPDRMRAWWSDLWRLNVPTCVFCAMSLWRIVGICSGSAMWLRV
CGCAPSLPSPSVLFPIPVRGSHLGVKDSLPGPDFELVIISGGCMESPEQSELGWPDSGEARGGCVLRGADGEVFMAACLSLQRCWSVDLAEGWAVYKGVQLARQLGFVDF
VVEPTL