; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035657 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035657
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionCCHC-type domain-containing protein
Genome locationchr3:26440135..26444280
RNA-Seq ExpressionLag0035657
SyntenyLag0035657
Gene Ontology termsGO:0090304 - nucleic acid metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_019170410.1 PREDICTED: uncharacterized protein LOC109165884 [Ipomoea nil]2.8e-3821.85Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVK-IGKW
        M  L+WN RGLGNPR  R +   V R +P  VF+ E+K+    A+R++V LG++  F V + G  GG+ + W + S  N++ +S  +ID  + +    KW
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVK-IGKW

Query:  RFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG------------------GLVELLV-------------------------
        R  GFYG PK  +R  SW L+  L      PWV+ GDFN++L+  +K+ G                  GL +L +                         
Subjt:  RFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG------------------GLVELLV-------------------------

Query:  ---KWKLSTDL--------------------------------DSLN------------------------------------------VFKTVVS----
            W+   D+                                D LN                                          V  +++S    
Subjt:  ---KWKLSTDL--------------------------------DSLN------------------------------------------VFKTVVS----

Query:  --------------------HLN-----------------------------------------------------------------------LHQSDH
                            +LN                                                                       L Q D 
Subjt:  --------------------HLN-----------------------------------------------------------------------LHQSDH

Query:  -KPLLFEFKMEG----------------------------------------ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEI
          P LF    EG                                        A+ +E   IKR ++TY   S Q VN+ KS    SKN  + +  +  +I
Subjt:  -KPLLFEFKMEG----------------------------------------ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEI

Query:  LGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------------------------------------------------------
        LGV   ++ G YLG+ S   RN+   FS I+DK+ + + SW                                                           
Subjt:  LGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------------------------------------------------------

Query:  ----------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWG-----GNYFSRVIGGESV-------
                        +++  FN AML K  WR +  P SL++RV K RY+    F EA LGNNPS  WRSI        G    R+  G+S        
Subjt:  ----------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWG-----GNYFSRVIGGESV-------

Query:  ----------------MGERKVNCLID-ENNRWLEGKVRDNFNVQDANSIIPLGDS-NPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLK
                        + + KV  LID +   W    + D F  QD   I+ +  S   +D   W  D KG  +VK  Y +       +  + S      
Subjt:  ----------------MGERKVNCLID-ENNRWLEGKVRDNFNVQDANSIIPLGDS-NPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLK

Query:  ECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFLFGLFMCNREH
          WK LWS+ +  + K+ +W+  NDI+PT  NL  K + ++P C + G+   N  H
Subjt:  ECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFLFGLFMCNREH

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]1.3e-4044.39Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGKWR
        MK L WNV GLGNP  FR LR+ V+R +PQ+VF+SE+K       R K  L +DCC  V+S GK GGL+L WNS S++ I S S GHID++I  K G WR
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGKWR

Query:  FIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG-----------GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLF
        F GFYGNP   +R  SW LL RL  +   PW++GGDFNEI+ + +K  G            + E L ++ ++  + +       V+HL L  SDH+P+L 
Subjt:  FIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG-----------GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLF

Query:  EFKME
         +  E
Subjt:  EFKME

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]4.5e-1247.31Show/hide
Query:  ASKKECSTIKRVISTYGKASD-QLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR
        AS   C +IK ++ +Y KAS  Q +N DKS F+VSKN  + +     + L V +T+SLG YLG+ S   RNK  +F+ IKD+VWKALQ WK +
Subjt:  ASKKECSTIKRVISTYGKASD-QLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR

XP_022157437.1 uncharacterized protein LOC111024135 [Momordica charantia]2.5e-3926.52Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKG----GGLVLFWNSTSDINIVSFSKGHIDALIN---
        M  + WN RGLG PRA R L+   Q  RP ++F+ E+K      +R+K  LG+DC F V   G+G    GGL LFW +T D+ + SFS  HID +++   
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKG----GGLVLFWNSTSDINIVSFSKGHIDALIN---

Query:  VKIGKWRFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKGGLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLFEFKM
        + I KWR  G +G P+ + +  +W LL  L S    PW+  GDFNEI+F  +K+ G           +    S+  F+   +H        K   F +  
Subjt:  VKIGKWRFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKGGLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLFEFKM

Query:  EGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGK---ALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDRDII
            +++   I+  +       + L+ F  +      +      ALEI               ++  +  SN   +  LF    ++ W + +  KD    
Subjt:  EGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGK---ALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDRDII

Query:  HFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSIS----------------------WGGNYFSRVIG----------GESV
              +    W+     +  L  V       +  F+EA L  N S TWR I                       W  N+ S + G           + V
Subjt:  HFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSIS----------------------WGGNYFSRVIG----------GESV

Query:  MGERKVNCLIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAK
        M    +N    EN  W    +   F   +A  I  +PL   N +D+ IW  +K    +VKSAYH+   I++ M + S   S  K  W  +W +   P+ +
Subjt:  MGERKVNCLIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAK

Query:  SCVWKIPNDIIPTCYNLYKKGLSVNPVC
           W++ ++ +PTC NL K+G+ +   C
Subjt:  SCVWKIPNDIIPTCYNLYKKGLSVNPVC

XP_030497957.1 uncharacterized protein LOC115713607 [Cannabis sativa]1.7e-4825.85Show/hide
Query:  QRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIG-KWRFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWV-----------
        + I++ LG+  CF V + GK GGL L W+    + I SF++ HIDAL+   +G  WRF GFYG+P    R  SW LL RLK +++  WV           
Subjt:  QRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIG-KWRFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWV-----------

Query:  ----------------------------------VGG---------DFNEILFVEDKKKGGLVELLVKWKLSTDLDSL--NVFKTVVSHLNLHQSDHKPL
                                          +GG          F  +  +   + G   +++VK  +S   D +  +  +T++  L L   D    
Subjt:  ----------------------------------VGG---------DFNEILFVEDKKKGGLVELLVKWKLSTDLDSL--NVFKTVVSHLNLHQSDHKPL

Query:  LFEFKMEGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR
                A+ +E  T+K ++ TY   S Q +N +K+   V   +  +L  +    LGV   K+   YLGM S   +NK  +F KI++KV   LQ     
Subjt:  LFEFKMEGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR

Query:  DIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSRVIGGESVMGE-RKV----------------------
                   KM W++I +P SLL+RVLK  YF ++ F EA LG+  S  W+ + WG +  ++  G    +G+ RK+                      
Subjt:  DIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSRVIGGESVMGE-RKV----------------------

Query:  ------NCLIDENNRWLEGKVRDNFNVQDANSIIP-LGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKS
              N L+  N  W   +V   F+  D   ++  +  ++  D + W  +  G  +V S Y L  F+  E  A+ S+NSKLK  WK +W     P+ K+
Subjt:  ------NCLIDENNRWLEGKVRDNFNVQDANSIIP-LGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKS

Query:  CVWKIPNDIIPTCYNLYKKGLSVNPVC---------FLFGLFMCNR--EHWSTLDY---WDWLGRNLNAKELEPSSSASEEAAVRFRRGCRKLPQKLQKS
         +WK+ N  IPT   L K+G+S+   C             L+ C +  + W  L Y    D +G N +A         S+E  +RF++     P     S
Subjt:  CVWKIPNDIIPTCYNLYKKGLSVNPVC---------FLFGLFMCNR--EHWSTLDY---WDWLGRNLNAKELEPSSSASEEAAVRFRRGCRKLPQKLQKS

Query:  QPLCFLHTIPEE------------RPSRVRIVGSLRRKVAGNSMSMPLGILRKIEVALAGLFVTLYAGMIPIQEKWPLSRIVVNSDCLELIQ-LLNREDE
        +P+      P++                  I   +R +  G  ++  +     + V LA     L    + I  KW +S + V SD   +I  ++N    
Subjt:  QPLCFLHTIPEE------------RPSRVRIVGSLRRKVAGNSMSMPLGILRKIEVALAGLFVTLYAGMIPIQEKWPLSRIVVNSDCLELIQ-LLNREDE

Query:  DLSKEIFVNSISGLANSVGGVCFRHCPREQNCVAHSIAR
               V +I  L      + F    R  N VA+S+A+
Subjt:  DLSKEIFVNSISGLANSVGGVCFRHCPREQNCVAHSIAR

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]4.3e-3926.6Show/hide
Query:  GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLFEFKMEGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYT
        GL  LL   +L+  L+ L + ++  S  +L  +D   L        A+++    I R + TY +AS Q++N +K     S+N  +  +I   ++LG+P  
Subjt:  GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLFEFKMEGASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYT

Query:  KSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------------------------------------------------------------
             YLG+ S + +NK  LF  I DK+WK L SWK+                                                               
Subjt:  KSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------------------------------------------------------------

Query:  -----------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSRVIGGESVMGERKVNC---------
                   R+ IHFN A+LAK  WRI++ P SLL+ +L+ RYF +  +L A LG+NPSLTWRS+ WG     + +      GER +NC         
Subjt:  -----------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSRVIGGESVMGERKVNC---------

Query:  -------------------LIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKM
                           LI E+  W    +  NFN  D N +  IPL      D +IW++   G   VKS YH AV +AE+ +++ S++  ++  W  
Subjt:  -------------------LIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKM

Query:  LWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFL---------FGLFMCNREHWSTLDYWDWLGRNLNAKELEPSSSA
         W +   P+ +  VWK+ +  +P    LY++ ++ +P C +           LF C R        W+    +++ + +E SS+A
Subjt:  LWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFL---------FGLFMCNREHWSTLDYWDWLGRNLNAKELEPSSSA

TrEMBL top hitse value%identityAlignment
A0A2N9H5H6 CCHC-type domain-containing protein6.7e-3826.09Show/hide
Query:  PQIVFISESKIGDVRAQRIKVLLGYDCCFCV--SSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIG-KWRFIGFYGNPKVEERPLSWTLLSRLKS
        P IVF+ E+++     + ++V LG   C  V  + +G GGGL L W+ST  INI S+S  HIDA +    G  WR  GFYG+P+   R  SW LL  L +
Subjt:  PQIVFISESKIGDVRAQRIKVLLGYDCCFCV--SSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIG-KWRFIGFYGNPKVEERPLSWTLLSRLKS

Query:  LYSYPWV----VGGDFNEILFVEDKKKGGLVELLVKWKLSTDLDSLNVFKTVV-SHLNLHQSDHKPL-------------------LFEFKMEGASKKEC
        L + PW+    +G DF    +   +  GGLV + +   ++ + D + +F     SHL++  SDH  L                   LF F+     ++ C
Subjt:  LYSYPWV----VGGDFNEILFVEDKKKGGLVELLVKWKLSTDLDSLNVFKTVV-SHLNLHQSDHKPL-------------------LFEFKMEGASKKEC

Query:  ------------------------------------STIKR---VISTYGKASDQLVNFDKSGFMVSKNVGKALEIK-------CGEILGVPY-------
                                            S +K    +I +  +  ++L       + V  N+ +                L  P+       
Subjt:  ------------------------------------STIKR---VISTYGKASDQLVNFDKSGFMVSKNVGKALEIK-------CGEILGVPY-------

Query:  -------TKSLGNYLGMSS------------------------------SNSRNKSHLFSKIKDKVWKALQSWKDRDIIHFNPAMLAKMCWRIIKDPTSL
               +K+ G   G+S+                               +   K H  SK K    K     + RDI  FN  +LAK  WR++++P SL
Subjt:  -------TKSLGNYLGMSS------------------------------SNSRNKSHLFSKIKDKVWKALQSWKDRDIIHFNPAMLAKMCWRIIKDPTSL

Query:  LARVLKGRYFKDKPFLEAPLGNNPSLTWRSIS----------------------WGGNY------FSRVIGGESVMGERKVNCLIDENNRWLEGKVRDN-
        + R+ K +YF   PFLEA + +N S  WRSI                       W   +      +S +   +++     V+ LID +      ++ D  
Subjt:  LARVLKGRYFKDKPFLEAPLGNNPSLTWRSIS----------------------WGGNY------FSRVIGGESVMGERKVNCLIDENNRWLEGKVRDN-

Query:  FNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEAS-SSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLS
        F  +DA  I  IPL    P+D +IW+  KKG  TVKSAY + +  A+  EAS SS  S     W  +WS +  P+  + +W+   DI+PT   L+ KG  
Subjt:  FNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEAS-SSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLS

Query:  VNPVCFLFGLFMCNREH
            C   G     R+H
Subjt:  VNPVCFLFGLFMCNREH

A0A6J1DUG8 uncharacterized protein LOC1110241356.5e-4144.39Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGKWR
        MK L WNV GLGNP  FR LR+ V+R +PQ+VF+SE+K       R K  L +DCC  V+S GK GGL+L WNS S++ I S S GHID++I  K G WR
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGKWR

Query:  FIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG-----------GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLF
        F GFYGNP   +R  SW LL RL  +   PW++GGDFNEI+ + +K  G            + E L ++ ++  + +       V+HL L  SDH+P+L 
Subjt:  FIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG-----------GLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLF

Query:  EFKME
         +  E
Subjt:  EFKME

A0A6J1DUG8 uncharacterized protein LOC1110241352.2e-1247.31Show/hide
Query:  ASKKECSTIKRVISTYGKASD-QLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR
        AS   C +IK ++ +Y KAS  Q +N DKS F+VSKN  + +     + L V +T+SLG YLG+ S   RNK  +F+ IKD+VWKALQ WK +
Subjt:  ASKKECSTIKRVISTYGKASD-QLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDR

A0A6J1DUG8 uncharacterized protein LOC1110241352.3e-3826.06Show/hide
Query:  ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------
        A+++ C  IKRV+ TY KAS Q +N DKS    S N   A ++   + L +P  +    YLG+ S + R+K  +FS IK+++ K + SW +         
Subjt:  ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------

Query:  ----------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGG
                                                R  IHFN A+LAK  WRI + P SLL R+LK RYF +  FLEA LG++PSLTW+ I W  
Subjt:  ----------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGG

Query:  NYFS-----RVIGGESVM---------------------GERKVNCLIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSA
                 +V  G  +                          V+ LI +  +W    ++  F+  D   I  +PL   + +D +IW     G  TVKS 
Subjt:  NYFS-----RVIGGESVM---------------------GERKVNCLIDENNRWLEGKVRDNFNVQDANSI--IPLGDSNPKDEVIWSRDKKGRSTVKSA

Query:  YHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFL---------FGLFMCNRE----HWSTLDYWDW
        YHLA  I  ++   SS +++    WK  WS+   P+ K   W+  +D +P   +L K+ +  +  C +           LF C       H S L  +DW
Subjt:  YHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFL---------FGLFMCNRE----HWSTLDYWDW

Query:  -----LGRNLNAKELEPSSSASEEAAV------------RFRRGCRKLPQKLQKSQPLCFLHTIPEERPSRVRIVGSLRRKVAGN-----SMSMPLGILR
             + +      L    S +E   +            R   G +    K   S  + +L      +  +V  +G++ R   GN     S   P G   
Subjt:  -----LGRNLNAKELEPSSSASEEAAV------------RFRRGCRKLPQKLQKSQPLCFLHTIPEERPSRVRIVGSLRRKVAGN-----SMSMPLGILR

Query:  KIEVALAGLFVTLYAGMIPIQEKWPLSRIVVNSDCLELIQLLNREDEDLSK-EIFVNSISGLANSVGGVCFRHCPREQNCVAHSIAREGVG
          ++    LF +L      IQ++ P+S  +V SD L ++  L      +S     +  +  L + +  V   H  RE N  AH +A+  +G
Subjt:  KIEVALAGLFVTLYAGMIPIQEKWPLSRIVVNSDCLELIQLLNREDEDLSK-EIFVNSISGLANSVGGVCFRHCPREQNCVAHSIAREGVG

A0A803PBM9 Uncharacterized protein3.3e-1327.71Show/hide
Query:  MEGASKKECSTIKRVISTYGKASDQLVNFDKS----GFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD--
        +EG ++ EC T+  ++  Y + S Q +N +KS    G  +S  +G +L  + G  L   +TK    YLG+ S   R K  +F  IKDKVW  L+SWK   
Subjt:  MEGASKKECSTIKRVISTYGKASDQLVNFDKS----GFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD--

Query:  ------------------------------------------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLAR
                                                                                R +  FN A+LAK  WR+I  P SLLAR
Subjt:  ------------------------------------------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLAR

Query:  VLKGRYFKDKPFLEAPLGNNPSLTWRSISWG
        VLK  Y+ +  FL+A      S  W+ I+WG
Subjt:  VLKGRYFKDKPFLEAPLGNNPSLTWRSISWG

A0A803QH07 Uncharacterized protein1.5e-3727.87Show/hide
Query:  ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------
        A+++    +KR ++TY KAS QL+N DKS    S N   A +      L +P T     YLG+ S + R+K  LFS IK+KVWK L +W +         
Subjt:  ASKKECSTIKRVISTYGKASDQLVNFDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD---------

Query:  -----------------------------------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYF
                                                                         R  +HFN A+LAK  WRI   P SLL+R+LK RYF
Subjt:  -----------------------------------------------------------------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYF

Query:  KDKPFLEAPLGNNPSLTWRSISWGGNYFSR----VIGGESVMGERK----------------------VNCLIDENNRWLEGKVRDNFNVQDANSI--IP
            FL+A +G++PS TW+SI WG     +     +G  S +   K                      V+ LI++N  W    + D F   D   I  IP
Subjt:  KDKPFLEAPLGNNPSLTWRSISWGGNYFSR----VIGGESVMGERK----------------------VNCLIDENNRWLEGKVRDNFNVQDANSI--IP

Query:  LGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFLFGLFMCN
        L     +D +IW     G   VKS +HLA  + +++++SSSD +  ++ WK  W+++  P+ +   WK+   I+P    L+K+ +  +  C L     CN
Subjt:  LGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFLFGLFMCN

Query:  REHWSTLDY
           W ++ +
Subjt:  REHWSTLDY

A0A803QH07 Uncharacterized protein5.7e-2139.22Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGK-W
        MKI++WN RGLGNP A R LR  V+   P ++F+ E+K+      R + +L +     V  VG  GGL+L WN  + + + +F+    D  +    G   
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGK-W

Query:  RFIGFYGNPKVEERPLSWTLLSRLKSLY-SYPWVVGGDFNEILFVEDKKKGGL
         F  FYG P    R  SWTLL RLK +    PW++ GDFNEIL+  +K+ G L
Subjt:  RFIGFYGNPKVEERPLSWTLLSRLKSLY-SYPWVVGGDFNEILFVEDKKKGGL

A0A803QH07 Uncharacterized protein3.3e-3752Show/hide
Query:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGK-W
        MK+L WNV+GLGNP   R L+  V R+ P++VFISES++   +A+ ++V LGYD CF V + GK GGL+L W++  D NI+SFS  HID+ I  + G+ W
Subjt:  MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGK-W

Query:  RFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG
        RF GFYG+P   +R  SW LL+R+  +YS PWV+GGDFNEIL  ++K  G
Subjt:  RFIGFYGNPKVEERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKG

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657503.8e-0622.27Show/hide
Query:  NPAMLAKMCWRIIKDPTSLLARVLKGRY----FKDKPFLEAPLGNNPSLTWRSISWG-------------GN------YFSRVIGGESVM----GERKVN
        N A+++K+ WR++++  SL   VL+ +Y     +D  +L  P G+  S TWRSI+ G             G+      +  R + G+ ++    GER  +
Subjt:  NPAMLAKMCWRIIKDPTSLLARVLKGRY----FKDKPFLEAPLGNNPSLTWRSISWG-------------GN------YFSRVIGGESVM----GERKVN

Query:  C-LIDENNRWLEGKVRD---------NFNVQDANSIIPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRA
        C  +   + W+ G+  D         N    +  +++    +  +D + W   + G+ +V+SAY       E +         +   +  LW +    R 
Subjt:  C-LIDENNRWLEGKVRD---------NFNVQDANSIIPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLKECWKMLWSINSIPRA

Query:  KSCVWKIPNDIIPTCYNLYKKGLSVNPVC
        K+ +W + N  + T    +++ LS + VC
Subjt:  KSCVWKIPNDIIPTCYNLYKKGLSVNPVC

P93295 Uncharacterized mitochondrial protein AtMg003106.3e-0933.33Show/hide
Query:  KCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD-------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPL
        +  ++L    T ++  +   S  N R       KI    W+ L   K+       RD+  FN A+LAK  +RII  P +LL+R+L+ RYF     +E  +
Subjt:  KCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD-------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPL

Query:  GNNPSLTWRSISWGGNYFSR
        G  PS  WRSI  G    SR
Subjt:  GNNPSLTWRSISWGGNYFSR

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-1527.56Show/hide
Query:  KVWKALQSWKD------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSR----VIG-GE--------
        K W  L  +K       +DI  FN A+L K  WR++  P SL+A+V K RYF     L APLG+ PS  W+SI        +    V+G GE        
Subjt:  KVWKALQSWKD------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPLGNNPSLTWRSISWGGNYFSR----VIG-GE--------

Query:  ----------------------SVMGERKVNCLIDENNR-WLEGKVRDNFNVQDANSIIPL--GDSNPKDEVIWSRDKKGRSTVKSAYH-LAVFIAEEME
                              SV    KV+ LIDE+ R W +  +   F   +   I  L  G     D   W     G  TVKS Y  L   I +   
Subjt:  ----------------------SVMGERKVNCLIDENNR-WLEGKVRDNFNVQDANSIIPL--GDSNPKDEVIWSRDKKGRSTVKSAYH-LAVFIAEEME

Query:  ASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVC
                L   ++ +W   + P+ +  +WK  ++ +P    L  + LS    C
Subjt:  ASSSDNSKLKECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVC

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-1033.33Show/hide
Query:  KCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD-------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPL
        +  ++L    T ++  +   S  N R       KI    W+ L   K+       RD+  FN A+LAK  +RII  P +LL+R+L+ RYF     +E  +
Subjt:  KCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKD-------RDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAPL

Query:  GNNPSLTWRSISWGGNYFSR
        G  PS  WRSI  G    SR
Subjt:  GNNPSLTWRSISWGGNYFSR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAAATCCTCAACTGGAACGTTCGGGGTTTGGGGAATCCCCGAGCGTTCCGAGCTTTGCGCCATGAGGTGCAAAGGTTCAGACCTCAAATTGTTTTCATTTCTGAGTC
TAAGATTGGAGATGTTAGAGCGCAGAGAATCAAAGTGTTGTTGGGTTATGATTGCTGTTTTTGTGTTAGCAGTGTTGGAAAAGGTGGTGGGCTTGTTTTGTTTTGGAATT
CGACAAGTGATATTAACATAGTTTCTTTCTCCAAAGGGCATATTGATGCTCTTATTAATGTCAAGATAGGCAAGTGGCGGTTCATTGGTTTTTATGGAAACCCAAAGGTG
GAAGAGAGACCATTATCGTGGACTCTCCTTAGCAGACTCAAGTCCCTATACAGCTACCCGTGGGTTGTGGGAGGGGACTTCAATGAGATTCTATTTGTTGAAGATAAAAA
AAAAGGGGGGCTAGTCGAGCTTCTAGTCAAATGGAAGCTTTCAACTGATCTTGATAGCCTGAATGTGTTTAAAACTGTGGTCAGCCACCTTAATTTGCATCAATCTGATC
ACAAGCCTTTGCTTTTTGAATTCAAAATGGAAGGGGCCTCTAAGAAGGAATGTAGCACTATCAAAAGGGTGATCTCTACTTATGGGAAAGCTTCCGATCAGTTGGTGAAT
TTCGACAAATCGGGGTTCATGGTGAGTAAAAATGTGGGGAAAGCTCTTGAGATCAAATGTGGAGAGATCCTTGGAGTTCCCTATACCAAATCTTTAGGGAATTATCTAGG
AATGTCATCATCAAATAGTAGGAACAAAAGCCACTTGTTTTCTAAGATCAAGGATAAGGTGTGGAAAGCTCTTCAATCCTGGAAAGATAGAGACATTATCCATTTCAATC
CAGCGATGCTCGCTAAAATGTGTTGGAGAATCATCAAAGATCCGACAAGCCTCCTCGCTAGGGTCCTTAAAGGGCGTTACTTTAAGGACAAGCCTTTCCTCGAAGCCCCT
TTGGGGAATAACCCCTCTCTTACATGGAGAAGCATTTCATGGGGAGGGAATTATTTCTCAAGGGTTATAGGTGGAGAGTCGGTAATGGGAGAAAGGAAGGTGAATTGCCT
CATTGATGAGAACAACAGGTGGTTAGAAGGCAAAGTGAGGGATAACTTCAATGTCCAAGATGCCAATTCAATCATTCCCCTTGGAGACTCTAATCCTAAAGACGAAGTCA
TATGGAGTCGGGACAAAAAAGGAAGGTCCACGGTGAAAAGTGCCTATCATTTGGCGGTGTTTATAGCAGAGGAAATGGAAGCCTCATCCTCGGATAATAGCAAACTCAAA
GAGTGTTGGAAAATGCTTTGGAGCATCAATTCCATCCCGAGGGCAAAATCTTGTGTGTGGAAGATCCCCAACGACATCATCCCTACTTGTTATAATCTCTACAAGAAAGG
CTTATCTGTTAACCCTGTTTGTTTCTTGTTTGGCTTATTTATGTGTAACAGGGAGCACTGGTCGACACTAGATTATTGGGATTGGCTTGGTCGAAACCTGAATGCTAAGG
AGTTGGAGCCGTCGTCGTCCGCTTCAGAAGAGGCCGCCGTCCGCTTCAGAAGAGGTTGTCGGAAGTTACCTCAAAAACTCCAGAAGTCGCAGCCACTTTGTTTTCTTCAT
ACTATCCCAGAGGAGAGACCCTCCCGAGTCAGAATTGTTGGATCCCTCCGCCGCAAGGTTGCTGGAAACTCAATGTCGATGCCTCTTGGAATTCTTCGGAAGATAGAGGT
GGCCTTGGCTGGATTGTTCGTGACTCTCTACGCTGGGATGATTCCTATCCAAGAGAAATGGCCGCTGTCGAGGATCGTGGTGAATTCCGACTGTCTTGAGCTCATTCAAT
TGTTGAATCGTGAGGATGAGGATCTTTCAAAAGAAATCTTTGTGAATTCAATTTCGGGGCTGGCCAATTCTGTGGGGGGAGTCTGTTTTAGACATTGCCCTAGAGAGCAA
AATTGCGTTGCTCACTCTATTGCGCGCGAAGGTGTTGGCTTTACCCGCCACTCCATTTTGTGTAATTTGGGTCAGAGGCTCCCTTCCACACTGGAAGGAGAACTTGTGAG
GAAGACTCTTGTAGGCCTGCGGGCTCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAAATCCTCAACTGGAACGTTCGGGGTTTGGGGAATCCCCGAGCGTTCCGAGCTTTGCGCCATGAGGTGCAAAGGTTCAGACCTCAAATTGTTTTCATTTCTGAGTC
TAAGATTGGAGATGTTAGAGCGCAGAGAATCAAAGTGTTGTTGGGTTATGATTGCTGTTTTTGTGTTAGCAGTGTTGGAAAAGGTGGTGGGCTTGTTTTGTTTTGGAATT
CGACAAGTGATATTAACATAGTTTCTTTCTCCAAAGGGCATATTGATGCTCTTATTAATGTCAAGATAGGCAAGTGGCGGTTCATTGGTTTTTATGGAAACCCAAAGGTG
GAAGAGAGACCATTATCGTGGACTCTCCTTAGCAGACTCAAGTCCCTATACAGCTACCCGTGGGTTGTGGGAGGGGACTTCAATGAGATTCTATTTGTTGAAGATAAAAA
AAAAGGGGGGCTAGTCGAGCTTCTAGTCAAATGGAAGCTTTCAACTGATCTTGATAGCCTGAATGTGTTTAAAACTGTGGTCAGCCACCTTAATTTGCATCAATCTGATC
ACAAGCCTTTGCTTTTTGAATTCAAAATGGAAGGGGCCTCTAAGAAGGAATGTAGCACTATCAAAAGGGTGATCTCTACTTATGGGAAAGCTTCCGATCAGTTGGTGAAT
TTCGACAAATCGGGGTTCATGGTGAGTAAAAATGTGGGGAAAGCTCTTGAGATCAAATGTGGAGAGATCCTTGGAGTTCCCTATACCAAATCTTTAGGGAATTATCTAGG
AATGTCATCATCAAATAGTAGGAACAAAAGCCACTTGTTTTCTAAGATCAAGGATAAGGTGTGGAAAGCTCTTCAATCCTGGAAAGATAGAGACATTATCCATTTCAATC
CAGCGATGCTCGCTAAAATGTGTTGGAGAATCATCAAAGATCCGACAAGCCTCCTCGCTAGGGTCCTTAAAGGGCGTTACTTTAAGGACAAGCCTTTCCTCGAAGCCCCT
TTGGGGAATAACCCCTCTCTTACATGGAGAAGCATTTCATGGGGAGGGAATTATTTCTCAAGGGTTATAGGTGGAGAGTCGGTAATGGGAGAAAGGAAGGTGAATTGCCT
CATTGATGAGAACAACAGGTGGTTAGAAGGCAAAGTGAGGGATAACTTCAATGTCCAAGATGCCAATTCAATCATTCCCCTTGGAGACTCTAATCCTAAAGACGAAGTCA
TATGGAGTCGGGACAAAAAAGGAAGGTCCACGGTGAAAAGTGCCTATCATTTGGCGGTGTTTATAGCAGAGGAAATGGAAGCCTCATCCTCGGATAATAGCAAACTCAAA
GAGTGTTGGAAAATGCTTTGGAGCATCAATTCCATCCCGAGGGCAAAATCTTGTGTGTGGAAGATCCCCAACGACATCATCCCTACTTGTTATAATCTCTACAAGAAAGG
CTTATCTGTTAACCCTGTTTGTTTCTTGTTTGGCTTATTTATGTGTAACAGGGAGCACTGGTCGACACTAGATTATTGGGATTGGCTTGGTCGAAACCTGAATGCTAAGG
AGTTGGAGCCGTCGTCGTCCGCTTCAGAAGAGGCCGCCGTCCGCTTCAGAAGAGGTTGTCGGAAGTTACCTCAAAAACTCCAGAAGTCGCAGCCACTTTGTTTTCTTCAT
ACTATCCCAGAGGAGAGACCCTCCCGAGTCAGAATTGTTGGATCCCTCCGCCGCAAGGTTGCTGGAAACTCAATGTCGATGCCTCTTGGAATTCTTCGGAAGATAGAGGT
GGCCTTGGCTGGATTGTTCGTGACTCTCTACGCTGGGATGATTCCTATCCAAGAGAAATGGCCGCTGTCGAGGATCGTGGTGAATTCCGACTGTCTTGAGCTCATTCAAT
TGTTGAATCGTGAGGATGAGGATCTTTCAAAAGAAATCTTTGTGAATTCAATTTCGGGGCTGGCCAATTCTGTGGGGGGAGTCTGTTTTAGACATTGCCCTAGAGAGCAA
AATTGCGTTGCTCACTCTATTGCGCGCGAAGGTGTTGGCTTTACCCGCCACTCCATTTTGTGTAATTTGGGTCAGAGGCTCCCTTCCACACTGGAAGGAGAACTTGTGAG
GAAGACTCTTGTAGGCCTGCGGGCTCTTTAG
Protein sequenceShow/hide protein sequence
MKILNWNVRGLGNPRAFRALRHEVQRFRPQIVFISESKIGDVRAQRIKVLLGYDCCFCVSSVGKGGGLVLFWNSTSDINIVSFSKGHIDALINVKIGKWRFIGFYGNPKV
EERPLSWTLLSRLKSLYSYPWVVGGDFNEILFVEDKKKGGLVELLVKWKLSTDLDSLNVFKTVVSHLNLHQSDHKPLLFEFKMEGASKKECSTIKRVISTYGKASDQLVN
FDKSGFMVSKNVGKALEIKCGEILGVPYTKSLGNYLGMSSSNSRNKSHLFSKIKDKVWKALQSWKDRDIIHFNPAMLAKMCWRIIKDPTSLLARVLKGRYFKDKPFLEAP
LGNNPSLTWRSISWGGNYFSRVIGGESVMGERKVNCLIDENNRWLEGKVRDNFNVQDANSIIPLGDSNPKDEVIWSRDKKGRSTVKSAYHLAVFIAEEMEASSSDNSKLK
ECWKMLWSINSIPRAKSCVWKIPNDIIPTCYNLYKKGLSVNPVCFLFGLFMCNREHWSTLDYWDWLGRNLNAKELEPSSSASEEAAVRFRRGCRKLPQKLQKSQPLCFLH
TIPEERPSRVRIVGSLRRKVAGNSMSMPLGILRKIEVALAGLFVTLYAGMIPIQEKWPLSRIVVNSDCLELIQLLNREDEDLSKEIFVNSISGLANSVGGVCFRHCPREQ
NCVAHSIAREGVGFTRHSILCNLGQRLPSTLEGELVRKTLVGLRAL