; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040851 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040851
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:8982489..8983965
RNA-Seq ExpressionLag0040851
SyntenyLag0040851
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.3e-6032.55Show/hide
Query:  GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKN-------------------------LGNGQ
        G+  ++HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +PNL VS+VL  KYF   S+L   N                         +GNG 
Subjt:  GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKN-------------------------LGNGQ

Query:  SIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHL
        +I  F DPWLPRPTTFK + R +    D TVA FIT   +WD+  ++      D ++I  + IS     D W+WHYD R  YSV+SGYKL M        
Subjt:  SIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHL

Query:  SDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL----------------------------------WNHHVPVLGNCLS-------------
        +      T W  +WK+ VP+K+K+F+W+S H  IP   NL                                  W    P L  CLS             
Subjt:  SDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL----------------------------------WNHHVPVLGNCLS-------------

Query:  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN------PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSN
                      +  W IWNDRN+++H + +     +CEW+  +L   S+ +M+N       N   VVQ      ++  K     ++ DA   G  ++
Subjt:  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN------PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSN

Query:  AGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE
           G ++ D    L+A  ++      SPL A+   +LEGL+ A   N   L + SDSLL I+ I  E
Subjt:  AGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE

XP_024950112.1 uncharacterized protein LOC112496847 [Citrus sinensis]1.2e-5430.27Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M  F++P+G    I    AKFWWGS GD   +HW++WE L + K  GGL FR+   FNQA++AKQAWR+L  PN  VSRVL  +YF + S L  K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD-VEVIKGLSISGTTPDRWIW
                             +GNG+ I +F D WLPRP TF+ +  +   +  + VAD I     WD  KL +H + +D  E++K    +    D  +W
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLD-VEVIKGLSISGTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHV---PVLGNC--------------------
        HYD R  YSVKSGY+L++ +      S    +  +W  +W + +P K+K+F+W++ +N +P+  NLW   V   P    C                    
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHV---PVLGNC--------------------

Query:  ---------------------------------LSICVGV-WSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF-RMANPNGGSVVQSMADIVNIISKGE
                                         L + V + WS W  RN  + +    +P I        L+ F R+  P    +       ++I  K +
Subjt:  ---------------------------------LSICVGV-WSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF-RMANPNGGSVVQSMADIVNIISKGE

Query:  E--------FIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN
        E        F +++DA    K  +AG+G V+ D +G ++A      +   S   A+A AVL GL+LAR  +V  L I SD L +++ +N
Subjt:  E--------FIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN

XP_030483769.1 uncharacterized protein LOC115700339 [Cannabis sativa]4.2e-5531.03Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKY-----FLSKS-----
        M CF++ K    ++ A+ A+FWWGS  D+ ++HW++W+ LC+ K  GG+ FR  V+FNQA+LAKQAWR+   PN  +SRVL G+Y     F++ S     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKY-----FLSKS-----

Query:  ------VLSGKNL---------GNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW
              ++ G+ L         G+G SI    D W+P    FK    + P      V+D+ITP   W+I  L     P DV+ I  + +S     DRWIW
Subjt:  ------VLSGKNL---------GNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVW--------------S
        H+D    YSV +GY  +    +R   +     +TWWK  W   +P+KVK+F W+   +SIP   +L++  V     C S+C   W               
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVW--------------S

Query:  IWNDRN-----NVVHNRPIPDPRIRCEWINDY--LSKFRMANPNGGSVVQSMADIVNIISKGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNL
        +W            H     D  IR   I      S    ++PN G    S A +V   +  E  I M++DA +   +S  GIG+++ + +G ++A  + 
Subjt:  IWNDRN-----NVVHNRPIPDPRIRCEWINDY--LSKFRMANPNGGSVVQSMADIVNIISKGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNL

Query:  STMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEET
            N    E +A A+  GL  A RL++    + +D L+L+ +++  T
Subjt:  STMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEET

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.1e-5527.89Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M CF++      +I  + A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA LAKQAWR+   PN  +SRVL G+Y+     ++ K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW
                             +G+G  +    D W+P    FK + R      +  VAD+IT +  WD+  L+    P D++ I  + +S  +T DRW W
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------
        HYD    Y+VKSGY L+     + H S       WW+  W + +PSKV++F W+  ++++P   NL++  V     C S+C      +G           
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------

Query:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIIS
                                                 +W IW+DRNN +H + +  P         YL+ F   + A     S V + A  V  + 
Subjt:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIIS

Query:  KGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSI
          E  + M++DA +   ++  GIG+++ D  G ++A  +   + N    E +A A+  GL+ A++L ++   + +D L+L+ ++
Subjt:  KGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSI

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]2.6e-5728.45Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M CF++      +I ++ A+FWWGS+ D+ ++HW+ W++LC  K  G L FR  V+FNQA LAKQAWRV  NP+  +SRVL G+Y+     LS K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW
                             +G G ++    D W+P    FK      P   +  VA +IT +  W+   L++   P DVE I  + +S  +  D WIW
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVWS--------------
        HY+    Y+VKS Y L+     R   S  G   TWWK+ W + +PSKV++F WK  ++++P   NL++  V     C S+C  VW               
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVWS--------------

Query:  --------------------------------------------IWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIIS---
                                                    IW+DRNN +H + +  P         YL+ F             +A  VN +    
Subjt:  --------------------------------------------IWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIIS---

Query:  -KGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN
            +  M++DA +   +S  G+G+++ D  G ++A  +   + N    E +A A+  GL+LA +L ++   + +D L+L+ +IN
Subjt:  -KGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN

TrEMBL top hitse value%identityAlignment
A0A6J1DX30 uncharacterized protein LOC1110248741.6e-6032.55Show/hide
Query:  GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKN-------------------------LGNGQ
        G+  ++HW +W  +C PKE GGLNFRDL  FNQA++AK  WR L +PNL VS+VL  KYF   S+L   N                         +GNG 
Subjt:  GDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKN-------------------------LGNGQ

Query:  SIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHL
        +I  F DPWLPRPTTFK + R +    D TVA FIT   +WD+  ++      D ++I  + IS     D W+WHYD R  YSV+SGYKL M        
Subjt:  SIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHL

Query:  SDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL----------------------------------WNHHVPVLGNCLS-------------
        +      T W  +WK+ VP+K+K+F+W+S H  IP   NL                                  W    P L  CLS             
Subjt:  SDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNL----------------------------------WNHHVPVLGNCLS-------------

Query:  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN------PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSN
                      +  W IWNDRN+++H + +     +CEW+  +L   S+ +M+N       N   VVQ      ++  K     ++ DA   G  ++
Subjt:  ------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYL---SKFRMAN------PNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQSN

Query:  AGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE
           G ++ D    L+A  ++      SPL A+   +LEGL+ A   N   L + SDSLL I+ I  E
Subjt:  AGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE

A0A803PIB6 Uncharacterized protein5.4e-5627.89Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M CF++      +I  + A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA LAKQAWR+   PN  +SRVL G+Y+     ++ K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW
                             +G+G  +    D W+P    FK + R      +  VAD+IT +  WD+  L+    P D++ I  + +S  +T DRW W
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------
        HYD    Y+VKSGY L+     + H S       WW+  W + +PSKV++F W+  ++++P   NL++  V     C S+C      +G           
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------

Query:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIIS
                                                 +W IW+DRNN +H + +  P         YL+ F   + A     S V + A  V  + 
Subjt:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKF---RMANPNGGSVVQSMADIVNIIS

Query:  KGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSI
          E  + M++DA +   ++  GIG+++ D  G ++A  +   + N    E +A A+  GL+ A++L ++   + +D L+L+ ++
Subjt:  KGEEFI-MHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSI

A0A803PKJ2 Uncharacterized protein2.8e-6029.28Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M CF++      +I ++ A+FWWGS+ D+ ++HW+ W++LC  K  GGL FR  V+FNQA LAKQAWRV  NP+  +SRVL G+Y+     LS K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW
                             +G G ++    D W+P    FK      P   +  VA +IT +  W+   L++   P+DVE I  + +S  +  D WIW
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSIS-GTTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------
        HYD    Y+VKSGY L+     R   S  G   TWWK+ W + +PSKV++F WK  ++++P   NL++  V     C S+C      +G           
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSIC------VG-----------

Query:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVN----II
                                                 +W IW+DRNN +H + +  P         YL+ F+    +   V   +A  VN    I 
Subjt:  -----------------------------------------VWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVN----II

Query:  SKGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN
            +  M++DA +   +S  GIG+++ D  G ++   +  T+ N    E +A A+  GL+ A+ L ++   + +D ++L+ +IN
Subjt:  SKGEEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSIN

A0A803Q185 Uncharacterized protein2.6e-5828.99Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----
        M CF++ KG ++ +  + A+FWWGS+    ++HW +WE+LC+PK+ GG+ FRDL  FNQA+LAKQ WR +  PN   +RVL   YF + SVL  K     
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGK-----

Query:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIW
                             +G+G +I + +DPWLPRP TFK+  +  P  +   V D       WD P ++      D E+I  L  SG    D+ +W
Subjt:  --------------------NLGNGQSIYMFQDPWLPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWIW

Query:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNC--------------------LSI
        HY     YSVKSGY+++         SD      WW+K+W++ +P K+KVFVWK  H  +P    L   HV     C                      +
Subjt:  HYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNC--------------------LSI

Query:  C--------------------------------------VGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGE
        C                                      V  WS+WN RN   H+  +P P    EW    L  F+        V++    +  +   GE
Subjt:  C--------------------------------------VGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGE

Query:  EFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE
          I ++DA V      +G+G V+  + G ++   + +     SPL+ + +A+L+G+++  + ++   SI  D L  I+ I+++
Subjt:  EFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE

A0A803QQT2 Uncharacterized protein3.4e-5830.58Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLS-------
        M CF++PK  ++ +  + ++FWWGS     ++HW +W  LCRPK+ GGL FRDL  FNQA+LAKQ WR L +P L  SRVL   YF  K VL        
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLS-------

Query:  ---------GKNL---------GNGQSIYMFQDPWLPRPTTFKVVSRMDPRM-KDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWI
                 GK L         GNG+S+ + +DPWLPRP TFKV  +  P +  +  V D       WD   +     P DV++I G+  S     D+ +
Subjt:  ---------GKNL---------GNGQSIYMFQDPWLPRPTTFKVVSRMDPRM-KDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISG-TTPDRWI

Query:  WHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVP---VLGNCLS-----------------
        WHY     YSVKSGY+++   +   H S+      WWKK+W++ +P KVK FVWK  HN +PA VNL    +    V   C S                 
Subjt:  WHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVP---VLGNCLS-----------------

Query:  --------------------------------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKG
                                                +  W+IWN RN VVH    P P    EW  ++L+ FR  +       +S  D   +    
Subjt:  --------------------------------------ICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKG

Query:  EEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE
        ++  +++DA V      +G+G V+ D  GV+++           PL+ + +A+ +G+++  +  ++R S+ +D L  +  I  +
Subjt:  EEFIMHMDACVMGKQSNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEE

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003106.9e-1640.86Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL
        M CF++ K +  K+++   +FWW S  +  ++ W  W+ LC+ KE  GGL FRDL  FNQA+LAKQ++R++  P+  +SR+L  +YF   S++
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein6.8e-2725.95Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKY---------------
        M CF +PK +  +I ++ A FWW +  +   MHW+ W++L   K  GG+ F+D+  FN A+L KQ WR+L+ P   +++V   +Y               
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKY---------------

Query:  -FLSKSVLSGKNL---------GNGQSIYMFQDPWL-PRPTTFKVVSRMDPRMKDATVADFITPS-------LHWDIPKLNKHLVPLDVEVIKGLSISG-
         F+ KS+ + + +         GNG+ I +++  WL  +P +  +  +  P  + A+V+  +  S         W    +      ++ ++I  L   G 
Subjt:  -FLSKSVLSGKNL---------GNGQSIYMFQDPWL-PRPTTFKVVSRMDPRMKDATVADFITPS-------LHWDIPKLNKHLVPLDVEVIKGLSISG-

Query:  TTPDRWIWHYDGRRVYSVKSGY-KLSMLTSQRGHLSDMGRNS--TWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCL
           D + W Y     Y+VKSGY  L+ + ++R    ++   S    ++K+WK     K++ F+WK   NS+P    L   H+     C+
Subjt:  TTPDRWIWHYDGRRVYSVKSGY-KLSMLTSQRGHLSDMGRNS--TWWKKVWKMIVPSKVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.9e-1740.86Show/hide
Query:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL
        M CF++ K +  K+++   +FWW S  +  ++ W  W+ LC+ KE  GGL FRDL  FNQA+LAKQ++R++  P+  +SR+L  +YF   S++
Subjt:  MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKE-IGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGTGTTTTCAGATTCCCAAAGGTATATTGACTAAAATCTCGGCTCTCTGTGCCAAGTTTTGGTGGGGCTCGAATGGAGATCACTGTCGAATGCATTGGCAACGATG
GGAGAACTTATGTAGGCCAAAGGAGATTGGAGGTTTAAACTTTAGAGATCTTGTCAATTTCAATCAGGCAATGTTGGCGAAGCAAGCATGGCGAGTTTTAACTAATCCAA
ATCTGACGGTTTCAAGAGTTTTATGTGGGAAATATTTCCTGTCGAAATCAGTCCTATCGGGGAAGAACCTAGGGAATGGGCAATCAATTTATATGTTCCAGGACCCGTGG
CTCCCTCGACCTACTACCTTTAAGGTGGTCTCTCGTATGGATCCAAGGATGAAGGACGCGACAGTGGCTGATTTTATTACCCCATCTCTTCATTGGGATATACCCAAACT
TAACAAGCATTTGGTGCCTCTTGATGTGGAGGTTATAAAAGGCTTGTCGATTAGTGGTACGACACCAGATAGATGGATATGGCATTATGATGGTAGAAGGGTGTATTCTG
TTAAGAGTGGGTATAAGCTCTCGATGCTAACCAGCCAGAGGGGACATTTGTCAGATATGGGAAGGAATAGCACTTGGTGGAAGAAGGTGTGGAAGATGATAGTTCCTAGC
AAAGTAAAAGTTTTTGTTTGGAAATCTTTTCATAACTCAATTCCCGCTATGGTTAACCTCTGGAATCATCATGTGCCTGTCTTGGGAAATTGTCTGAGTATCTGTGTGGG
TGTCTGGTCGATATGGAATGATAGGAATAACGTGGTTCATAATCGTCCAATTCCGGATCCAAGGATTAGATGTGAATGGATCAATGATTATCTTTCGAAGTTCCGGATGG
CTAATCCAAATGGCGGTTCGGTTGTTCAGTCAATGGCAGATATTGTTAATATTATATCAAAGGGGGAAGAGTTTATAATGCATATGGACGCTTGTGTAATGGGTAAGCAG
AGTAACGCTGGCATTGGTATTGTTCTGCATGATAAAGACGGTGTGCTAATGGCGGTGCAGAACTTATCGACTATGGCGAACAATTCTCCTTTGGAAGCAAAAGCAGTGGC
GGTCCTTGAAGGGCTACGTTTGGCTAGGAGATTGAATGTGGAGAGACTATCTATTTTGTCAGATTCACTATTGTTGATAAAATCCATTAATGAGGAAACGCAAGTGGAGA
CCTGTATAGCTGTGACTATCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGGTGTTTTCAGATTCCCAAAGGTATATTGACTAAAATCTCGGCTCTCTGTGCCAAGTTTTGGTGGGGCTCGAATGGAGATCACTGTCGAATGCATTGGCAACGATG
GGAGAACTTATGTAGGCCAAAGGAGATTGGAGGTTTAAACTTTAGAGATCTTGTCAATTTCAATCAGGCAATGTTGGCGAAGCAAGCATGGCGAGTTTTAACTAATCCAA
ATCTGACGGTTTCAAGAGTTTTATGTGGGAAATATTTCCTGTCGAAATCAGTCCTATCGGGGAAGAACCTAGGGAATGGGCAATCAATTTATATGTTCCAGGACCCGTGG
CTCCCTCGACCTACTACCTTTAAGGTGGTCTCTCGTATGGATCCAAGGATGAAGGACGCGACAGTGGCTGATTTTATTACCCCATCTCTTCATTGGGATATACCCAAACT
TAACAAGCATTTGGTGCCTCTTGATGTGGAGGTTATAAAAGGCTTGTCGATTAGTGGTACGACACCAGATAGATGGATATGGCATTATGATGGTAGAAGGGTGTATTCTG
TTAAGAGTGGGTATAAGCTCTCGATGCTAACCAGCCAGAGGGGACATTTGTCAGATATGGGAAGGAATAGCACTTGGTGGAAGAAGGTGTGGAAGATGATAGTTCCTAGC
AAAGTAAAAGTTTTTGTTTGGAAATCTTTTCATAACTCAATTCCCGCTATGGTTAACCTCTGGAATCATCATGTGCCTGTCTTGGGAAATTGTCTGAGTATCTGTGTGGG
TGTCTGGTCGATATGGAATGATAGGAATAACGTGGTTCATAATCGTCCAATTCCGGATCCAAGGATTAGATGTGAATGGATCAATGATTATCTTTCGAAGTTCCGGATGG
CTAATCCAAATGGCGGTTCGGTTGTTCAGTCAATGGCAGATATTGTTAATATTATATCAAAGGGGGAAGAGTTTATAATGCATATGGACGCTTGTGTAATGGGTAAGCAG
AGTAACGCTGGCATTGGTATTGTTCTGCATGATAAAGACGGTGTGCTAATGGCGGTGCAGAACTTATCGACTATGGCGAACAATTCTCCTTTGGAAGCAAAAGCAGTGGC
GGTCCTTGAAGGGCTACGTTTGGCTAGGAGATTGAATGTGGAGAGACTATCTATTTTGTCAGATTCACTATTGTTGATAAAATCCATTAATGAGGAAACGCAAGTGGAGA
CCTGTATAGCTGTGACTATCTAG
Protein sequenceShow/hide protein sequence
MGCFQIPKGILTKISALCAKFWWGSNGDHCRMHWQRWENLCRPKEIGGLNFRDLVNFNQAMLAKQAWRVLTNPNLTVSRVLCGKYFLSKSVLSGKNLGNGQSIYMFQDPW
LPRPTTFKVVSRMDPRMKDATVADFITPSLHWDIPKLNKHLVPLDVEVIKGLSISGTTPDRWIWHYDGRRVYSVKSGYKLSMLTSQRGHLSDMGRNSTWWKKVWKMIVPS
KVKVFVWKSFHNSIPAMVNLWNHHVPVLGNCLSICVGVWSIWNDRNNVVHNRPIPDPRIRCEWINDYLSKFRMANPNGGSVVQSMADIVNIISKGEEFIMHMDACVMGKQ
SNAGIGIVLHDKDGVLMAVQNLSTMANNSPLEAKAVAVLEGLRLARRLNVERLSILSDSLLLIKSINEETQVETCIAVTI