; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg026791 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg026791
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationscaffold13:26825323..26836671
RNA-Seq ExpressionSpg026791
SyntenySpg026791
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ABA98491.1 retrotransposon protein, putative, unclassified [Oryza sativa Japonica Group]4.1e-4526.78Show/hide
Query:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL
        ++  +A+G  +GEF+  +++E+    G+ LR+K+++DI+KPL RG  +  G+     W    YE LPDFCY CG +GHT   C+++  EG+  + +   L
Subjt:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL

Query:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK
        R+    K +  G   K +G R   QR  G  +  +G    SW +    E  D S+G  +  +  D+  S                     ++G    A K
Subjt:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK

Query:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG
         V +                E    G   + + S          P+ G L DG  +        +V+K+T   +     GA+   P      KL +  +G
Subjt:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG

Query:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI
          + G   +  + +  GGG +   G +P +K G  + P   G+A                   + P +  GDFNEIL S EK GG  + Q+ M  FR ++
Subjt:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI

Query:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK
          C L DLG     FTW          I+ERLDR  AN E  + F   ++ +     SDH P+++ L    + V+ + G    +FE +WL   + +E+VK
Subjt:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK

Query:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW
          W              +      LS+W+   L G ++  +K  K+E++      + R + V+  V    L+KL ++ +I+WK ++   WL  GDRNT +
Subjt:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW

Query:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
        FH+  +  ++ N I      +G WV+ +E+    I ++F  LF+S+
Subjt:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

XP_015619246.1 uncharacterized protein LOC107279669 [Oryza sativa Japonica Group]4.1e-4526.78Show/hide
Query:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL
        ++  +A+G  +GEF+  +++E+    G+ LR+K+++DI+KPL RG  +  G+     W    YE LPDFCY CG +GHT   C+++  EG+  + +   L
Subjt:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL

Query:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK
        R+    K +  G   K +G R   QR  G  +  +G    SW +    E  D S+G  +  +  D+  S                     ++G    A K
Subjt:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK

Query:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG
         V +                E    G   + + S          P+ G L DG  +        +V+K+T   +     GA+   P      KL +  +G
Subjt:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG

Query:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI
          + G   +  + +  GGG +   G +P +K G  + P   G+A                   + P +  GDFNEIL S EK GG  + Q+ M  FR ++
Subjt:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI

Query:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK
          C L DLG     FTW          I+ERLDR  AN E  + F   ++ +     SDH P+++ L    + V+ + G    +FE +WL   + +E+VK
Subjt:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK

Query:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW
          W              +      LS+W+   L G ++  +K  K+E++      + R + V+  V    L+KL ++ +I+WK ++   WL  GDRNT +
Subjt:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW

Query:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
        FH+  +  ++ N I      +G WV+ +E+    I ++F  LF+S+
Subjt:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

XP_024037590.1 uncharacterized protein LOC112097210 [Citrus clementina]1.6e-4435.02Show/hide
Query:  PLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS
        P +  GDFNEIL   EK+GG +RN + ++AFR +++ CNLIDLGC    FTW  R  G  LI+ERLDRF  + +       + + +L    SDH P+M+ 
Subjt:  PLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS

Query:  LGEREV---KRKIGSGPIKFEGSWLAFGECREIVKLHW----SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDV
        + ER      +K  S  + +E  W  +  C+ IVK  W    S  Q      F     +CL +L  W++   +G  +   ++KK+  +   N   R +  
Subjt:  LGEREV---KRKIGSGPIKFEGSWLAFGECREIVKLHW----SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDV

Query:  KLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQALRRIMEG
        ++   E +++K+L +EE++WK +SR +WLK GD+NTK+FHS+A+S K+ N I G  + + VWVD+ E + ++  +YF++LF++S+   + +   + G
Subjt:  KLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQALRRIMEG

XP_030924745.1 uncharacterized protein LOC115951731 [Quercus lobata]5.5e-4236.52Show/hide
Query:  SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIM
        S P +  GDFNEI   +EK+G  +R + QM  FR ++D C L DLG     FTW  R  G   +  RLDR  A  E + +F   +I HL    SDH PI+
Subjt:  SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIM

Query:  VSLGEREVKRKIGSG-PIKFEGSWLAFGECREIVKLHWSNPQL-SSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKD-VKL
        +   + E+KR    G P +FE  W+    C ++++  W   +L +S   F+ K+      L  WN+    G +++++  K +E++ +  S   + D  ++
Subjt:  VSLGEREVKRKIGSG-PIKFEGSWLAFGECREIVKLHWSNPQL-SSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKD-VKL

Query:  GVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
         +   E+++L  +EE  WK +SR  WLK GDRNT +FH RAT   K NLI G  +  G WVD +E++GR +  YF N+F+SS
Subjt:  GVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

XP_030940268.1 uncharacterized protein LOC115965235 [Quercus lobata]3.2e-4236.81Show/hide
Query:  PLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS
        P V  GDFNEI +  EK+GGA R+  QM  FR  +D C   D+G     FTW      G L+  RLDR  A+ E + KF  +++ HL+   SDH PI + 
Subjt:  PLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS

Query:  LGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHW-----SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMK-----KEEIQAIMNSNL-R
          +   +      P +FE  WL    C  +V   W     +NP L   H    K+ +C   L +WNK  + G+I+  +  K     K EI A+   N  +
Subjt:  LGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHW-----SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMK-----KEEIQAIMNSNL-R

Query:  LKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
        +KD+K      E++KLL+ EE  W  +++ +WL++GDRN+K+FH RA+   K N I G  +  G+WVD +E +G  +++Y+S+LFSSS
Subjt:  LKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

TrEMBL top hitse value%identityAlignment
A0A2N9F7A6 Uncharacterized protein9.8e-4527.04Show/hide
Query:  LGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCE-----GKVKMDYG----
        +G ++G  +  +V EN    G  LR ++ +DI KP+ RG  I   S+ +  W+   YE+LP  C++CG +GHT  +C    C      G+V   YG    
Subjt:  LGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCE-----GKVKMDYG----

Query:  -VELRYTQGSKGFYKGKKAGFRDISQRGRGRGNFFQGRGR---ESWNRGRNSEEEDDSEG-------------------SSQSKETPDDNGSKEEGVGFY
          EL   +  +G+ + +K      +   +G  NF   R     ES  RG +S    + E                    +++ KE+      KE+GV F 
Subjt:  -VELRYTQGSKGFYKGKKAGFRDISQRGRGRGNFFQGRGR---ESWNRGRNSEEEDDSEG-------------------SSQSKETPDDNGSKEEGVGFY

Query:  ALK--------------LVDSELVSLGLEQRALP---------SHWPER---GFLFDGWTTNRLFIRGALVLKETEVEECVGGAEK----EKPAG-MEMI
         L               +V+   V+  +  ++LP          H P       +F        + +G  V K   + +      +      PAG  E +
Subjt:  ALK--------------LVDSELVSLGLEQRALP---------SHWPER---GFLFDGWTTNRLFIRGALVLKETEVEECVGGAEK----EKPAG-MEMI

Query:  KLGKDGRGVS-DMGTLTRLLKAQVG---GGGSQASTGTRPLEKE--------GFLGPVGEARR------------YLSAPLVAGGDFNEILTSKEKMGGA
        KL   G       G L  L  ++V       S+       ++KE        GF G     RR            + ++P +  GDFNEIL + E++G  
Subjt:  KLGKDGRGVS-DMGTLTRLLKAQVG---GGGSQASTGTRPLEKE--------GFLGPVGEARR------------YLSAPLVAGGDFNEILTSKEKMGGA

Query:  ERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEG
         R + Q+  FR +I    L DLG     FTW  +  G + +  RLDR  A+   +++++   ++HL    SDH P+++ +  G   VKRK      +FE 
Subjt:  ERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEG

Query:  SWLAFGECREIVKLHWSNPQLSSTHNFSL--KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDVKLGVAERELDKLLEEEEIFWKFKS
         W    +CR ++   WS      +  F +  K+  C   L  W++ R  GSI ++IK+K+E++Q   N +      +L   + EL+ LLE+EEIFW+ +S
Subjt:  SWLAFGECREIVKLHWSNPQLSSTHNFSL--KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDVKLGVAERELDKLLEEEEIFWKFKS

Query:  REEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQALRRIMEG
        R  W+  GD+NTK+FH+  T  ++ NLIKG Y+ + +W     ++      YF N+F+SS      +   +EG
Subjt:  REEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQALRRIMEG

A0A7N2LLK4 Uncharacterized protein4.3e-4829.47Show/hide
Query:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGK----VKMDY
        R+ A  +G+ +GE V  E     +     +RVKV L + KPLRRG  I  GS  ER W+   YE+LP  C++CG LGH +H C       K    V+  Y
Subjt:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGK----VKMDY

Query:  GVELRYTQGS-KGFYKGKKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGSKEEGVGFYALKLVDSELVSLGLEQRALPS
        G  L+ T GS +   K       + S  G GRG        ++ N  R S++  D+  +  +   P    S   G    A+K++      LG       +
Subjt:  GVELRYTQGS-KGFYKGKKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGSKEEGVGFYALKLVDSELVSLGLEQRALPS

Query:  HWPERGF--LFDGWTTNRLFIRGALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGVS---------DMGTLTR---LLKAQVGGGGSQASTGTRPLE
         W  +    L         F+    + KE   E+C         A   ++K    G G++         D+   T    L K     G     TG     
Subjt:  HWPERGF--LFDGWTTNRLFIRGALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGVS---------DMGTLTR---LLKAQVGGGGSQASTGTRPLE

Query:  KEGFLGPVGEARRYL----SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKF
        +        E  R+L      P V  GD+N  L S EK+       NQM AFR +++ C+L DLG     +TW  +  G +  + RLDR     E   KF
Subjt:  KEGFLGPVGEARRYL----SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKF

Query:  EEIKIFHLSRHGSDHHPIMVSLGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHW--SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKK
            + HLS H SDH PIM+ +      R  G    KFE +WL   EC E+V   W     + +       KI  C  +L  W   R +   +  IK+ +
Subjt:  EEIKIFHLSRHGSDHHPIMVSLGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHW--SNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKK

Query:  EEIQAIMNSNLRLKD-VKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSS
        + ++ + ++ L  +         ++LD+LL ++EI+W  +SR  WLK GD+N K+FHS+A+  ++ N I+G  NSN  WV+  E++ R    YF NLF +
Subjt:  EEIQAIMNSNLRLKD-VKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSS

Query:  STVE
           +
Subjt:  STVE

A0A7N2R0C3 Reverse transcriptase domain-containing protein4.7e-4726.55Show/hide
Query:  RRRKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKV----KM
        R ++   A+G SIG+F+  +V+E     G  LRV+V++D+ + L RG  I      E +W+   YE+LP+FCY CG L H + +C EE  + K      +
Subjt:  RRRKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKV----KM

Query:  DYGVELRYTQGSK-----GFYKGKKAG--------------FRDISQRGRGR------------------GNFFQGRGRESWNRGRNSEEEDDSEGSS--
         YG  LR     K     GF K K  G               RD  Q G  R                  G+   G    +  +G N   E      S  
Subjt:  DYGVELRYTQGSK-----GFYKGKKAG--------------FRDISQRGRGR------------------GNFFQGRGRESWNRGRNSEEEDDSEGSS--

Query:  ------QSKETPDDNGSKEE------------------GVGFYALKLVDSELVSLGL------EQRALPSHWPERGFLFDGWTTNRL---------FIR-
              +++E  + NG KE                   G+G   +K     +V LGL      +      + PE     +GW  N+L          IR 
Subjt:  ------QSKETPDDNGSKEE------------------GVGFYALKLVDSELVSLGL------EQRALPSHWPERGFLFDGWTTNRL---------FIR-

Query:  ------------------GALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGVSDMGTLTRL--------------------------LKAQVG----
                          G L L+  E+++ V  +++ K    +M  L  + RG+     +  L                          L+ ++G    
Subjt:  ------------------GALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGVSDMGTLTRL--------------------------LKAQVG----

Query:  ---------GGGSQ-----------------------ASTGTRPLEKEGFLGPVGEARRYLS------------APLVAGGDFNEILTSKEKMGGAERNQ
                 GG +                         S G  P    GF G      R +S             P V  GDFNEIL S EK+G  ER+ 
Subjt:  ---------GGGSQ-----------------------ASTGTRPLEKEGFLGPVGEARRYLS------------APLVAGGDFNEILTSKEKMGGAERNQ

Query:  NQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSLGEREVKRKIGSGPIKFEGSWLAFG
         QM  FR  +  C L+DLG     FTW     G      RLDR  AN+E ++ F E K+ H S   SDH  + +S+  RE  RK+      FE  W    
Subjt:  NQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSLGEREVKRKIGSGPIKFEGSWLAFG

Query:  ECREIVKLHWS----NPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDV-KLGVAERELDKLLEEEEIFWKFKSREE
         CRE+++  W     NP+L+  +    ++  C  +L NWN+ R+ G++   +K K+  +Q +   NL  +   ++   ++E+++++  EEI W  +SR  
Subjt:  ECREIVKLHWS----NPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDV-KLGVAERELDKLLEEEEIFWKFKSREE

Query:  WLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
        W+K+GDRNT++FH+ A + ++ N I+G  +S G W +N+EE+   I +YF  ++SS+
Subjt:  WLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

A0A803QC75 Uncharacterized protein1.3e-4428.64Show/hide
Query:  DIDKTDKDFQNFMACKILSPRTIKRRRKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCG
        ++ K D  F +F   +I     + + +  A ALGN IGEF+    D   +  G  LRV+VKL + KPL RG  IK   + +  WI   YE++P+FC+ CG
Subjt:  DIDKTDKDFQNFMACKILSPRTIKRRRKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCG

Query:  KLGHTVHEC----DEESCEGKVKMDYGVELRYTQGSKGFYKGKKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGSKEEG
         LGH   +C    +         + YG  L+  +     Y   +  F         +GN +    R      R +        SS+S   P      E  
Subjt:  KLGHTVHEC----DEESCEGKVKMDYGVELRYTQGSKGFYKGKKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGSKEEG

Query:  VGFYALKL-----VDSELVSLGLEQRALPSHWPERGFLFDGWTTNRLFIRGALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGV---SDMGTLTRLL
             L+      V   L+S       +    P         TT+ L          T        A      G+E+ ++G  G  +   SD   +T L 
Subjt:  VGFYALKL-----VDSELVSLGLEQRALPSHWPERGFLFDGWTTNRLFIRGALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGV---SDMGTLTRLL

Query:  KAQVGGGGSQASTGTRPLEKEGFLGPVGEARRYLS-------------APLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFT
                     G+      GF G    A +  S              P +A GDFNEIL++ +K GG+ R ++ M AFR+S+D C L ++      FT
Subjt:  KAQVGGGGSQASTGTRPLEKEGFLGPVGEARRYLS-------------APLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSIDACNLIDLGCPQGTFT

Query:  WIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS---LGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHWSNPQLSSTHNFSL
        W K       +KERLD  F N++  S F    + HL    SDH  + VS   L + E   +      +FE  WLA  E  EI+   W++  +S      L
Subjt:  WIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVS---LGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHWSNPQLSSTHNFSL

Query:  -KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKD--VKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLI
          +++C   L +W+  +  G ++  I   + ++  + NSN    D   +L  AE  L+ LLE+EE++W+ +SR +WL  GDRNTK+FH++A+S K  N I
Subjt:  -KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKD--VKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLI

Query:  KGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQAL
        K  +NS G  V +  ++   +  ++S+LFSS++V+ +AL
Subjt:  KGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQAL

Q2QQV8 Retrotransposon protein, putative, unclassified2.0e-4526.78Show/hide
Query:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL
        ++  +A+G  +GEF+  +++E+    G+ LR+K+++DI+KPL RG  +  G+     W    YE LPDFCY CG +GHT   C+++  EG+  + +   L
Subjt:  RKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGVEL

Query:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK
        R+    K +  G   K +G R   QR  G  +  +G    SW +    E  D S+G  +  +  D+  S                     ++G    A K
Subjt:  RYTQGSKGFYKG---KKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGS--------------------KEEGVGFYALK

Query:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG
         V +                E    G   + + S          P+ G L DG  +        +V+K+T   +     GA+   P      KL +  +G
Subjt:  LVDS----------------ELVSLGLEQRALPSHW--------PERGFLFDGWTTNRLFIRGALVLKET--EVEECVGGAEKEKPAGMEMIKLGKDGRG

Query:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI
          + G   +  + +  GGG +   G +P +K G  + P   G+A                   + P +  GDFNEIL S EK GG  + Q+ M  FR ++
Subjt:  VSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGF-LGP--VGEARRYL---------------SAPLVAGGDFNEILTSKEKMGGAERNQNQMSAFRSSI

Query:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK
          C L DLG     FTW          I+ERLDR  AN E  + F   ++ +     SDH P+++ L    + V+ + G    +FE +WL   + +E+VK
Subjt:  DACNLIDLGCPQGTFTWIKRVRG-GSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSL--GEREVKRKIGSGPIKFEGSWLAFGECREIVK

Query:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW
          W              +      LS+W+   L G ++  +K  K+E++      + R + V+  V    L+KL ++ +I+WK ++   WL  GDRNT +
Subjt:  LHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNL-RLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKW

Query:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS
        FH+  +  ++ N I      +G WV+ +E+    I ++F  LF+S+
Subjt:  FHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein3.5e-1023.86Show/hide
Query:  LVAGGDFNEILTSKEKMGGAERN--QNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKF-EEIKIFHLSRHGSDHHPIM
        ++  GDF++I  + +     + +     +  F++ +   +L+D+      +TW        +I+ +LDR  AN +  S F   I +F LS   SDH P +
Subjt:  LVAGGDFNEILTSKEKMGGAERN--QNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKF-EEIKIFHLSRHGSDHHPIM

Query:  VSLGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHWSNPQLSSTHNFSL-KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDVKL--
        + L     + K       F  +   F      + + W       +H FSL +     +K       +  G+IQ   K   + +++I +  L      L  
Subjt:  VSLGEREVKRKIGSGPIKFEGSWLAFGECREIVKLHWSNPQLSSTHNFSL-KITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDVKL--

Query:  --GVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSST
           VA ++ +      E F++ KSR +WL+ GD NT++FH    + +  NLIK     + V V+N  ++   I  Y+++L  S +
Subjt:  --GVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTKKCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSST

AT5G36228.1 nucleic acid binding;zinc ion binding4.4e-0533.77Show/hide
Query:  LGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHEC
        + +++GE VA + +E    +   +RVKV++D  +PLR    ++  S  ER  I   YEKL   C  C ++ H V  C
Subjt:  LGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHEC


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCACTCGAGGGCTGGAACGCTTGATCTCGAGTCTGAACCTTCAATGGATTCTTGGATCTTGAAGTCTTGGAGTCTTGAAGAGTCTTTGGGAGACAACGGCTATGA
ACAAGGGGGGCAACTAGGAGGTTTGCCAGTAAGGGATCCGTTGGTGATCGAGTGGGGAAGGCATGCAGGCGCGAACAGGATGGAATCCAAGGACGATGCAGACGAAGTTC
AAAGGAAACTGGAAAGGCTCGGTTTGGAAGAAGAAGAAAGGGGTCAGATTGTTGAAATCGAAGACGACGATATTGACAAAACCGACAAGGACTTTCAGAACTTCATGGCT
TGTAAAATCCTATCCCCACGAACCATAAAACGCAGAAGGAAGTATGCGGTTGCTCTCGGAAATTCCATAGGAGAGTTTGTGGCAGCTGAAGTTGATGAGAACGAGAAAAT
GGAAGGGGAAACTCTTCGAGTCAAAGTGAAATTGGATATCCAAAAGCCGTTAAGACGTGGAACAAACATAAAAACTGGATCGATGGCGGAAAGGAAGTGGATTAAAGCGA
CTTACGAGAAATTACCAGACTTTTGCTACTACTGTGGTAAGCTGGGTCACACGGTCCATGAATGTGATGAAGAAAGTTGCGAAGGAAAGGTTAAGATGGACTATGGGGTT
GAATTAAGATACACTCAAGGTAGCAAAGGCTTTTACAAAGGAAAGAAAGCGGGATTCCGAGATATAAGTCAAAGGGGTAGAGGCAGAGGGAATTTTTTCCAAGGAAGAGG
AAGAGAAAGCTGGAACAGAGGGAGAAATTCCGAGGAGGAAGATGACAGTGAGGGGAGCAGTCAGTCTAAGGAAACTCCAGATGATAATGGATCGAAGGAAGAAGGTGTTG
GGTTTTATGCCCTAAAACTCGTAGATAGTGAATTGGTGTCTCTGGGTCTTGAACAAAGGGCCCTACCCTCTCACTGGCCCGAGAGGGGTTTTCTATTTGATGGTTGGACC
ACAAATAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGAAACAGAGGTCGAGGAATGTGTGGGAGGTGCCGAAAAGGAAAAACCGGCTGGGATGGAAATGATAAAATT
AGGCAAAGACGGCAGGGGAGTCTCAGATATGGGCACATTGACGCGACTGTTAAAAGCTCAGGTGGGTGGTGGAGGTTCACAGGCTTCTACGGGAACCCGACCCCTAGAAA
AGGAAGGATTCTTGGGACCTGTTGGTGAAGCTAGGAGATATCTCTCAGCTCCCTTGGTTGCTGGGGGGGATTTCAATGAGATCCTCACGTCCAAGGAGAAGATGGGTGGC
GCGGAGAGAAACCAGAACCAAATGAGCGCCTTTAGATCATCCATAGATGCCTGCAATCTTATTGATTTGGGGTGCCCTCAGGGCACGTTCACTTGGATCAAAAGAGTGAG
GGGCGGTAGCTTGATTAAAGAGAGGTTGGATCGGTTCTTTGCAAATGACGAGCTGATTTCTAAATTCGAGGAGATCAAAATCTTTCATTTGAGCAGGCATGGCTCTGACC
ATCACCCCATCATGGTTAGTTTGGGGGAGCGCGAAGTCAAGAGAAAGATCGGGTCGGGACCGATCAAGTTTGAAGGCAGCTGGCTTGCCTTTGGTGAGTGTCGGGAGATT
GTTAAACTTCATTGGAGCAATCCCCAGCTATCTTCCACGCATAACTTTAGCCTCAAGATAACATCTTGTCTGAGGAAGCTTAGTAATTGGAACAAGACTCGGCTCAACGG
ATCAATCCAATCAGCCATTAAGATGAAAAAGGAAGAAATCCAAGCCATTATGAATAGCAACCTGAGGTTAAAGGATGTCAAGCTTGGAGTGGCCGAAAGAGAGCTTGATA
AGCTTCTAGAGGAAGAAGAGATTTTCTGGAAGTTCAAATCGCGAGAGGAGTGGCTCAAATGGGGTGATCGTAATACCAAGTGGTTCCATTCTAGAGCTACATCCACGAAA
AAGTGTAATCTTATCAAAGGCTTTTACAATAGTAATGGGGTGTGGGTGGACAACGACGAAGAGATGGGTAGAGAGATCTCCAAATATTTCAGCAACCTCTTCTCCTCATC
TACGGTCGAGCATCAAGCCTTAAGAAGAATCATGGAAGGGGGCAGCCGAGAGACTACAACCCACATTCTCTGGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCACTCGAGGGCTGGAACGCTTGATCTCGAGTCTGAACCTTCAATGGATTCTTGGATCTTGAAGTCTTGGAGTCTTGAAGAGTCTTTGGGAGACAACGGCTATGA
ACAAGGGGGGCAACTAGGAGGTTTGCCAGTAAGGGATCCGTTGGTGATCGAGTGGGGAAGGCATGCAGGCGCGAACAGGATGGAATCCAAGGACGATGCAGACGAAGTTC
AAAGGAAACTGGAAAGGCTCGGTTTGGAAGAAGAAGAAAGGGGTCAGATTGTTGAAATCGAAGACGACGATATTGACAAAACCGACAAGGACTTTCAGAACTTCATGGCT
TGTAAAATCCTATCCCCACGAACCATAAAACGCAGAAGGAAGTATGCGGTTGCTCTCGGAAATTCCATAGGAGAGTTTGTGGCAGCTGAAGTTGATGAGAACGAGAAAAT
GGAAGGGGAAACTCTTCGAGTCAAAGTGAAATTGGATATCCAAAAGCCGTTAAGACGTGGAACAAACATAAAAACTGGATCGATGGCGGAAAGGAAGTGGATTAAAGCGA
CTTACGAGAAATTACCAGACTTTTGCTACTACTGTGGTAAGCTGGGTCACACGGTCCATGAATGTGATGAAGAAAGTTGCGAAGGAAAGGTTAAGATGGACTATGGGGTT
GAATTAAGATACACTCAAGGTAGCAAAGGCTTTTACAAAGGAAAGAAAGCGGGATTCCGAGATATAAGTCAAAGGGGTAGAGGCAGAGGGAATTTTTTCCAAGGAAGAGG
AAGAGAAAGCTGGAACAGAGGGAGAAATTCCGAGGAGGAAGATGACAGTGAGGGGAGCAGTCAGTCTAAGGAAACTCCAGATGATAATGGATCGAAGGAAGAAGGTGTTG
GGTTTTATGCCCTAAAACTCGTAGATAGTGAATTGGTGTCTCTGGGTCTTGAACAAAGGGCCCTACCCTCTCACTGGCCCGAGAGGGGTTTTCTATTTGATGGTTGGACC
ACAAATAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGAAACAGAGGTCGAGGAATGTGTGGGAGGTGCCGAAAAGGAAAAACCGGCTGGGATGGAAATGATAAAATT
AGGCAAAGACGGCAGGGGAGTCTCAGATATGGGCACATTGACGCGACTGTTAAAAGCTCAGGTGGGTGGTGGAGGTTCACAGGCTTCTACGGGAACCCGACCCCTAGAAA
AGGAAGGATTCTTGGGACCTGTTGGTGAAGCTAGGAGATATCTCTCAGCTCCCTTGGTTGCTGGGGGGGATTTCAATGAGATCCTCACGTCCAAGGAGAAGATGGGTGGC
GCGGAGAGAAACCAGAACCAAATGAGCGCCTTTAGATCATCCATAGATGCCTGCAATCTTATTGATTTGGGGTGCCCTCAGGGCACGTTCACTTGGATCAAAAGAGTGAG
GGGCGGTAGCTTGATTAAAGAGAGGTTGGATCGGTTCTTTGCAAATGACGAGCTGATTTCTAAATTCGAGGAGATCAAAATCTTTCATTTGAGCAGGCATGGCTCTGACC
ATCACCCCATCATGGTTAGTTTGGGGGAGCGCGAAGTCAAGAGAAAGATCGGGTCGGGACCGATCAAGTTTGAAGGCAGCTGGCTTGCCTTTGGTGAGTGTCGGGAGATT
GTTAAACTTCATTGGAGCAATCCCCAGCTATCTTCCACGCATAACTTTAGCCTCAAGATAACATCTTGTCTGAGGAAGCTTAGTAATTGGAACAAGACTCGGCTCAACGG
ATCAATCCAATCAGCCATTAAGATGAAAAAGGAAGAAATCCAAGCCATTATGAATAGCAACCTGAGGTTAAAGGATGTCAAGCTTGGAGTGGCCGAAAGAGAGCTTGATA
AGCTTCTAGAGGAAGAAGAGATTTTCTGGAAGTTCAAATCGCGAGAGGAGTGGCTCAAATGGGGTGATCGTAATACCAAGTGGTTCCATTCTAGAGCTACATCCACGAAA
AAGTGTAATCTTATCAAAGGCTTTTACAATAGTAATGGGGTGTGGGTGGACAACGACGAAGAGATGGGTAGAGAGATCTCCAAATATTTCAGCAACCTCTTCTCCTCATC
TACGGTCGAGCATCAAGCCTTAAGAAGAATCATGGAAGGGGGCAGCCGAGAGACTACAACCCACATTCTCTGGAGATGA
Protein sequenceShow/hide protein sequence
MFHSRAGTLDLESEPSMDSWILKSWSLEESLGDNGYEQGGQLGGLPVRDPLVIEWGRHAGANRMESKDDADEVQRKLERLGLEEEERGQIVEIEDDDIDKTDKDFQNFMA
CKILSPRTIKRRRKYAVALGNSIGEFVAAEVDENEKMEGETLRVKVKLDIQKPLRRGTNIKTGSMAERKWIKATYEKLPDFCYYCGKLGHTVHECDEESCEGKVKMDYGV
ELRYTQGSKGFYKGKKAGFRDISQRGRGRGNFFQGRGRESWNRGRNSEEEDDSEGSSQSKETPDDNGSKEEGVGFYALKLVDSELVSLGLEQRALPSHWPERGFLFDGWT
TNRLFIRGALVLKETEVEECVGGAEKEKPAGMEMIKLGKDGRGVSDMGTLTRLLKAQVGGGGSQASTGTRPLEKEGFLGPVGEARRYLSAPLVAGGDFNEILTSKEKMGG
AERNQNQMSAFRSSIDACNLIDLGCPQGTFTWIKRVRGGSLIKERLDRFFANDELISKFEEIKIFHLSRHGSDHHPIMVSLGEREVKRKIGSGPIKFEGSWLAFGECREI
VKLHWSNPQLSSTHNFSLKITSCLRKLSNWNKTRLNGSIQSAIKMKKEEIQAIMNSNLRLKDVKLGVAERELDKLLEEEEIFWKFKSREEWLKWGDRNTKWFHSRATSTK
KCNLIKGFYNSNGVWVDNDEEMGREISKYFSNLFSSSTVEHQALRRIMEGGSRETTTHILWR