; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0039253 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0039253
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr2:39920727..39924352
RNA-Seq ExpressionLag0039253
SyntenyLag0039253
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025836 - Zinc knuckle CX2CX4HX4C
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CCA66044.1 hypothetical protein [Beta vulgaris subsp. vulgaris]2.7e-4329.48Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR
        M+I+CWN +G+G+ R  R+L K    Y P ++FLS+T +     + LK +LG+ N F V SRGR+GGL + W   +SFSL+S+S + I G +D G   WR
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR

Query:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV----------GAVSANWDHGASMG------------
        F GIYG  + E+K  TW+LM+ L  + + P L+ GDFN I+   E EGG  +    +  FRE +            V   W+ G S+             
Subjt:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV----------GAVSANWDHGASMG------------

Query:  SPLALTLATGRCMS------------CLDKWGRPRMGNYRQR-----------------------------IRAATDQVQQAMKVW--------------
        SP   T+     +             CL +  R R    +QR                             +    D +   +K W              
Subjt:  SPLALTLATGRCMS------------CLDKWGRPRMGNYRQR-----------------------------IRAATDQVQQAMKVW--------------

Query:  VIQDLALICRVQRND-SSLTSRGRDVLETEIQ--------GW--------------------------RRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFE
        V  DL   CR+Q+   SS     R  LE ++          W                          +++N V+G  D+ G W ++ D +  +   YF 
Subjt:  VIQDLALICRVQRND-SSLTSRGRDVLETEIQ--------GW--------------------------RRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFE

Query:  HIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG
         I+T++NPS+ +++  +  V P V +E N+ LL+PF +EE+ +AL Q HP KAPGPDG
Subjt:  HIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG

CCA66054.1 hypothetical protein [Beta vulgaris subsp. vulgaris]2.9e-4528.01Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR
        M+I+CWN +GLG+  + R+L     Q+ P ++F+S+T +  +  + LK  LG+ N F V S GR+GGL L W   V FSL+S+S + I G V+ G   WR
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR

Query:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSA----------NWDHGAS--------------
        F G+YG  + E+K  TW+L++HL  +++ P L+ GDFN IL   E EGG ++   E++ FR+ +  ++            W+ G S              
Subjt:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSA----------NWDHGAS--------------

Query:  --------------------------------MGSPLALT-----------------------------LATGRCMS---CLDKWGRPRMGNYRQRIRAA
                                         G P   T                             + TGR  S   CL +W   +  N  ++I  A
Subjt:  --------------------------------MGSPLALT-----------------------------LATGRCMS---CLDKWGRPRMGNYRQRIRAA

Query:  TDQVQQAMKVWVIQDLALICRVQRNDSS-----------LTSRGRDVLETE---------IQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYT
           +  A    + +     C +                 L SR  +V + +             +++N V+G  D  G W+++ D +  +   YF  I+T
Subjt:  TDQVQQAMKVWVIQDLALICRVQRNDSS-----------LTSRGRDVLETE---------IQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYT

Query:  TSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPI
        +SNPS+  ++  M  + P V +E N KLL PF ++E+  AL+Q HP KAPGPDG  +
Subjt:  TSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPI

XP_006487889.1 uncharacterized protein LOC102617714 [Citrus sinensis]2.3e-4731.84Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWV-DVGESPW
        M I+ WNVQGLG +R FR   KL+Q+ RPQ++FLS+TK+     +  +  L + N F VD  G  GGL LLWT  VS  + SYS + ID  + +   S W
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWV-DVGESPW

Query:  RFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV----------GAVSANWDHGASMGSPLALTLATGR
        R T +YGHP++EQK  TW+L++ L G S+ PWL  GDFN I   +E  GG  +    +  FR+AV                W +  +  + +        
Subjt:  RFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV----------GAVSANWDHGASMGSPLALTLATGR

Query:  CMSCLDKWGRPRMGNYRQRIRAATDQVQQAMKVW------------------VIQDLALICRVQRNDSSLTSRG---RDVLETEIQGWRRKNEVRGFEDS
         +  L  W +   G  ++++    ++++     +                  ++QD  +  + QR+ +     G         +    R+KN + G  D 
Subjt:  CMSCLDKWGRPRMGNYRQRIRAATDQVQQAMKVW------------------VIQDLALICRVQRNDSSLTSRG---RDVLETEIQGWRRKNEVRGFEDS

Query:  DGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFP
         G W +D D V  +   +F  +++T+ P+ E++D A       V +EMN +L  PF +EE+  AL Q  P KAPGPDG P
Subjt:  DGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFP

XP_023913015.1 uncharacterized protein LOC112024634 [Quercus suber]8.6e-4231.78Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVG-ESPW
        MS + WN QGLG  +    L  LV    P++VFL +TK++    + +  ++ + N F V    R GGL LLW   +S  + +YS + ID +++ G +  W
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVG-ESPW

Query:  RFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSANWDHGASMGSPLA----------------L
        RF G YG P    +  +W+L+K L   S+ PW+V GDFN IL   E +GG   P  ++L FREA+   S   D G + G P                  L
Subjt:  RFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSANWDHGASMGSPLA----------------L

Query:  TLATGRCM-SCLDKWGRPRMGNYRQRIRAATDQVQQAMK----------VWVIQDLALICRVQRNDSSLTSRGRDVLETEIQGWR-------------RK
         L   R +   L +W R   G+    ++ +  +++QA +          ++ +Q+   I +++R +  +  R R  L+   +G +             ++
Subjt:  TLATGRCM-SCLDKWGRPRMGNYRQRIRAATDQVQQAMK----------VWVIQDLALICRVQRNDSSLTSRGRDVLETEIQGWR-------------RK

Query:  NEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG
        N + G ED  G WQ +  R+  +IE YF  ++TT +P+    D  +  + PS+ DEMN+ L R F  EEV  ALKQ  P  APGPDG
Subjt:  NEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG

XP_031095115.1 uncharacterized protein LOC115999403 [Ipomoea triloba]1.0e-4232.23Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDG--WVDVGESP
        MS++ WN +GLG+  A R L  L++  RP +VFL +T +     + ++ +LG+PN   VD++G  GGL LLWT ++   +  +S N ID    VDVG   
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDG--WVDVGESP

Query:  WRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV---GAV-------SANWDHGASMGSPLA------
        WRFTG YG P+  ++  +W  +++L G S+ PW+V GDFN +L  HE  G        L GF+EAV   G V          W+      + +       
Subjt:  WRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV---GAV-------SANWDHGASMGSPLA------

Query:  ------LTL---ATGRCMSCLDKWGRPRM------GNYRQRIRAATDQVQQAMKVWVIQDLALICR--VQRNDS---SLTSRGR-DVLETE-----IQGW
              LTL   AT   +SC      P +       +  +R R   D       +W+ ++   +CR  VQ + S    L+   R ++LET+     ++GW
Subjt:  ------LTL---ATGRCMSCLDKWGRPRM------GNYRQRIRAATDQVQQAMKVWVIQDLALICR--VQRNDS---SLTSRGR-DVLETE-----IQGW

Query:  RRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGF
        RR+N++   ++  GVW +D D + G++  YF  ++   +    ++D+ +  +   V    N  L+RP   EEV+ A+ Q HP+K+PGPDGF
Subjt:  RRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGF

TrEMBL top hitse value%identityAlignment
A0A2N9FB16 CCHC-type domain-containing protein1.5e-4427.76Show/hide
Query:  YERLPDFCYVCGCLGHSRRECDVVGNGTGVWE---GEQYGDWLRAGVSLGEGSRRFQEG----KSEGGGNQSVGDVGAGDRLEVVSQVGDVGEGGTVGGS
        YE LP FCY CG LGH   EC VVG G  + E   GE++G WLRA  +     RR +EG      EG  N       A +          V   GT  GS
Subjt:  YERLPDFCYVCGCLGHSRRECDVVGNGTGVWE---GEQYGDWLRAGVSLGEGSRRFQEG----KSEGGGNQSVGDVGAGDRLEVVSQVGDVGEGGTVGGS

Query:  THKGKGKQKVGNSLVESRATYVGKEAMVVDEVVGGESAGVEKGVKATGSRSWKRKARDALTDISNKEISMELAGGPSVAMSIMCWNVQGLGSTRAFRRLY
        T                   +    A VV+ V+    +  +   K +G         D + ++  KE+       P V M  +  N +GLG+ +    L+
Subjt:  THKGKGKQKVGNSLVESRATYVGKEAMVVDEVVGGESAGVEKGVKATGSRSWKRKARDALTDISNKEISMELAGGPSVAMSIMCWNVQGLGSTRAFRRLY

Query:  KLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGES-PWRFTGIYGHPQAEQKARTWALM
         LV++  P +VFL +T++     + L+V+LG      V+  G+ GGL LLW  +V  ++ SYS + IDG V   +   WR TG YG+P+A  + R+W+L+
Subjt:  KLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGES-PWRFTGIYGHPQAEQKARTWALM

Query:  KHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSANWDHGASMGSPLALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAM
        +HLR  S  PW++ GDFN I    E  G   + A ++  FREA+   S   D G +    L L   T +  + + +  R RM  + +     +   +   
Subjt:  KHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSANWDHGASMGSPLALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAM

Query:  KVWVIQDLALI----------CRVQ----------------------------RNDSSLTSRGRDVLETEIQG--------WRRK---------------
          W +Q +             CR++                            +      SR  ++++ ++ G        WR++               
Subjt:  KVWVIQDLALI----------CRVQ----------------------------RNDSSLTSRGRDVLETEIQG--------WRRK---------------

Query:  -----------NEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG
                   N + G  D    W+ +P  V  +   YF  ++ +SNP    ID+ +  V   V   MN+ L+RPF QEE++ AL Q HP+K+PGPDG
Subjt:  -----------NEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG

A0A2N9FNQ8 Uncharacterized protein1.3e-4328.4Show/hide
Query:  GTVGGSTHKGKGKQKVGNSLVESRATYVGKEAMVVDEVVGG-----ESAGVEKGVKATGSRSWKR---KARDALTDI----------SNKEISMELAGGP
        GT   +  + + K   G  L E R   +      V+EV+ G      S  V K  K T + +WKR   + +D L D+           N +I ++     
Subjt:  GTVGGSTHKGKGKQKVGNSLVESRATYVGKEAMVVDEVVGG-----ESAGVEKGVKATGSRSWKR---KARDALTDI----------SNKEISMELAGGP

Query:  SVAMSIMCWNVQ--GLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVG
        +V       +VQ  GLG+ +A R L+ +V+   P+++FL +TK+ +   + +++KLGY N F V S GRSGGL LLW       + +YS + ID  VD  
Subjt:  SVAMSIMCWNVQ--GLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVG

Query:  ESP-WRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGA----------------------------
        ++  WR TG YG P+  ++  +WAL+KHL    + PWL  GDFN +L  HE  GG  +   ++L F+EAV A                            
Subjt:  ESP-WRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGA----------------------------

Query:  --------------VSANWDHGASMGSPL-ALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAMKVWVIQDLALICRVQRNDSSLTSRGRDVLET
                      + ++W+  A +GSP+  L     RC   L  W R   G     ++      Q++ + W+                   R       
Subjt:  --------------VSANWDHGASMGSPL-ALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAMKVWVIQDLALICRVQRNDSSLTSRGRDVLET

Query:  EIQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPISS
        + +  R KN +RG  DS+G W +D D +  +   YF  I+++S  +  E+++ +  +   V  +MN +LL PF   E++ A  Q HP+K+PGPDG+ +  
Subjt:  EIQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPISS

Query:  KRDDDHL
        K +  H+
Subjt:  KRDDDHL

A0A2N9IPS8 Reverse transcriptase domain-containing protein8.1e-4627.12Show/hide
Query:  KGKGKQKVGNSLVESRATYVGKEAMVVDEVVGGESAGVEKGVKATGSRSWKRKARD--ALTDI---SNKEISMELAGGPSVAMSIMCWNVQGLGSTRAFR
        K   K    NS VE+  T       +  +  G  ++GV       G  +WKR AR+    T++   ++K +       P   M ++ WN QGLG+    R
Subjt:  KGKGKQKVGNSLVESRATYVGKEAMVVDEVVGGESAGVEKGVKATGSRSWKRKARD--ALTDI---SNKEISMELAGGPSVAMSIMCWNVQGLGSTRAFR

Query:  RLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGE--SPWRFTGIYGHPQAEQKART
         L  L+++  P ++FLS+T++  +G + L+V + +   FCV  RG  GGL +LW   +   L +YS N ID  +   E    +R TG YG+P+  ++  +
Subjt:  RLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGE--SPWRFTGIYGHPQAEQKART

Query:  WALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV---------------------------------------------GAVSAN-
        WAL+KHL   S++PWL  GDFN IL ++E  G   +P  ++  FREAV                                             G+V ++ 
Subjt:  WALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV---------------------------------------------GAVSAN-

Query:  ----------------------------------------------WDHGASMGSPLALTLATGR-CMSCLDKWGRPRMGNYRQRIRAATDQVQQAMK--
                                                      W  G + GSP+ + +   + C + L  W R R G+    I+   +Q+Q  +   
Subjt:  ----------------------------------------------WDHGASMGSPLALTLATGR-CMSCLDKWGRPRMGNYRQRIRAATDQVQQAMK--

Query:  -------VWVIQD-------LALICRVQRNDSSLTSRG---RDVLETEIQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQA
               +  +QD          I   QR+  +  S G         +    RR N + G  D DGVWQ +  ++  +   YF+ I+T+SNPS E I   
Subjt:  -------VWVIQD-------LALICRVQRNDSSLTSRG---RDVLETEIQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQA

Query:  MLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG
        +  +   V + MN +L   F ++EV LALKQ +P KAPGPDG
Subjt:  MLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDG

A0A7N2M6Y1 CCHC-type domain-containing protein5.1e-4825.54Show/hide
Query:  YERLPDFCYVCGCLGHSRRECDVVGNGTGVWEGEQYGDWLRAGVSLGEGSRRF---QEGKSEGGGNQSVG--------DVGAGDRLEVVSQVGDVGEGG-
        YERLP FCY CG LGH  + C V  N      G QYG+WL+AG +L +G   F   Q+  +E  G  S+G        + G+G     ++  G  G    
Subjt:  YERLPDFCYVCGCLGHSRRECDVVGNGTGVWEGEQYGDWLRAGVSLGEGSRRF---QEGKSEGGGNQSVG--------DVGAGDRLEVVSQVGDVGEGG-

Query:  --TVGGSTHKGKGKQKVGNS----LVESRATYVGKEAMVVDEVVGGESAG----------------------------VEKGVKATGSRSWKRKARD---
           VGG T   KG   + ++     V+S     G++    D+V     +G                             E+G+  T  R +KR ARD   
Subjt:  --TVGGSTHKGKGKQKVGNS----LVESRATYVGKEAMVVDEVVGGESAG----------------------------VEKGVKATGSRSWKRKARD---

Query:  --------ALTDISNKEISME--LAGGPSVAMSIMC--WNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGG
                   ++SNK  ++   L      A   +C  WN +GLG+ R+ R L +LVQ+++P +VFLS+TK++    + +K K+G  N   V S GRSGG
Subjt:  --------ALTDISNKEISME--LAGGPSVAMSIMC--WNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGG

Query:  LVLLWTRAVSFSLISYSLNPIDGWVDVGES--PWRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV-
        L +LW R +   + SYS   ID  V   ES   WR TG YG+P+  ++  +W  ++ L      PWL  GDFN ++   E  GG  +   ++  FRE + 
Subjt:  LVLLWTRAVSFSLISYSLNPIDGWVDVGES--PWRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAV-

Query:  -------------------------------------------------------------------------------------------GAVSANWDH
                                                                                                     +   W  
Subjt:  -------------------------------------------------------------------------------------------GAVSANWDH

Query:  GASMGSPLALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAM-----------------KVWVIQDLALICRVQRND---SSLTSRGRDVLETEI
           + SP  +      C   L KW +   G   ++I+   + +   +                 ++  + D   I   QR+      L  R      ++ 
Subjt:  GASMGSPLALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAM-----------------KVWVIQDLALICRVQRND---SSLTSRGRDVLETEI

Query:  QGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPIS
           RRKN +    D  GVW   PD +  +   YF+++Y+T+ P+   I + +  +P  V ++MN  L++ F +EE+E+AL Q HP KAPGPD   +S
Subjt:  QGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPIS

F4NCJ6 Reverse transcriptase domain-containing protein1.4e-4528.01Show/hide
Query:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR
        M+I+CWN +GLG+  + R+L     Q+ P ++F+S+T +  +  + LK  LG+ N F V S GR+GGL L W   V FSL+S+S + I G V+ G   WR
Subjt:  MSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTLGFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWR

Query:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSA----------NWDHGAS--------------
        F G+YG  + E+K  TW+L++HL  +++ P L+ GDFN IL   E EGG ++   E++ FR+ +  ++            W+ G S              
Subjt:  FTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSKPAGELLGFREAVGAVSA----------NWDHGAS--------------

Query:  --------------------------------MGSPLALT-----------------------------LATGRCMS---CLDKWGRPRMGNYRQRIRAA
                                         G P   T                             + TGR  S   CL +W   +  N  ++I  A
Subjt:  --------------------------------MGSPLALT-----------------------------LATGRCMS---CLDKWGRPRMGNYRQRIRAA

Query:  TDQVQQAMKVWVIQDLALICRVQRNDSS-----------LTSRGRDVLETE---------IQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYT
           +  A    + +     C +                 L SR  +V + +             +++N V+G  D  G W+++ D +  +   YF  I+T
Subjt:  TDQVQQAMKVWVIQDLALICRVQRNDSS-----------LTSRGRDVLETE---------IQGWRRKNEVRGFEDSDGVWQQDPDRVLGLIEGYFEHIYT

Query:  TSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPI
        +SNPS+  ++  M  + P V +E N KLL PF ++E+  AL+Q HP KAPGPDG  +
Subjt:  TSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGGGTCGCAGGGGAATGTCGGGGCCTTAAGTGTCGAATCCGGATTTCGAATCCTGGGCCTGGGGCATTACAGATGGTATCAGAACGGAACCTCTCCCAGTAGGAT
GTGGTTCGGGGACGAACCAAGGCGGAAGCTGGTGGGCATAGATCTCTCCCAAAAGACTCCCACAAGTCTCCTGCTTCAAGAAGTCTTAGAGATATACCGGTGTAGCCGAT
TGGTGGTGTTCGCACAAGAGGGTTGCTGCATTTTCAATCTTATTGGCAAGAAAAGGCGAATTGATCAAGGCTTTCTACAAAAGTTCGGTAATACTTTCCACTGCGTGGGG
TTTTTATCCCTTCAGTTGTATGAACGACTACCAGATTTTTGTTACGTGTGTGGTTGTCTGGGGCACTCGAGGAGGGAGTGCGATGTAGTGGGTAATGGTACTGGGGTGTG
GGAAGGTGAGCAGTATGGTGATTGGTTAAGGGCAGGTGTAAGTTTAGGGGAGGGTAGTAGGAGGTTTCAGGAAGGGAAGTCTGAGGGTGGTGGAAACCAAAGTGTGGGAG
ATGTGGGAGCTGGGGATAGGTTGGAGGTAGTTTCACAGGTGGGTGATGTCGGGGAAGGGGGGACTGTGGGAGGGTCGACACACAAGGGGAAAGGGAAACAGAAAGTGGGG
AATTCTTTGGTGGAGTCTAGGGCCACCTATGTGGGGAAGGAAGCGATGGTTGTCGATGAGGTGGTTGGTGGGGAGAGTGCGGGGGTAGAGAAGGGGGTTAAGGCGACTGG
TAGCAGAAGTTGGAAGAGAAAGGCGAGGGATGCATTGACGGATATCTCTAACAAGGAAATCTCTATGGAACTGGCAGGGGGCCCGTCGGTTGCCATGAGTATCATGTGCT
GGAATGTGCAAGGGTTGGGGTCGACTCGAGCATTCCGTAGGTTGTACAAGCTGGTGCAACAATACCGACCTCAGATGGTGTTCTTGTCTGACACGAAGGTGCAGACACTG
GGGTTTGATGTGCTGAAAGTTAAGCTAGGGTACCCAAACTATTTTTGTGTGGACAGTAGAGGGAGGAGTGGTGGTTTGGTGCTATTGTGGACCAGAGCGGTAAGTTTCAG
TCTTATCTCGTACTCCCTTAACCCCATAGATGGATGGGTAGATGTAGGAGAGAGCCCGTGGAGGTTCACTGGGATTTACGGTCATCCCCAAGCTGAGCAGAAGGCAAGAA
CATGGGCACTGATGAAGCACCTTCGAGGGAATAGTACGACCCCGTGGCTTGTCGAGGGAGATTTTAATGCAATTCTGTTTGACCATGAGAATGAGGGTGGAAGGTCGAAG
CCGGCTGGGGAATTACTTGGATTTCGGGAGGCAGTGGGAGCGGTGTCAGCTAACTGGGATCACGGTGCTTCGATGGGATCCCCGCTGGCATTGACATTGGCAACGGGAAG
GTGTATGTCTTGTCTTGACAAATGGGGGAGGCCAAGGATGGGAAATTATAGACAGCGTATTCGGGCGGCGACAGACCAGGTTCAACAGGCGATGAAAGTTTGGGTAATAC
AAGATCTCGCTCTGATCTGCAGAGTGCAGAGGAACGACTCGAGTCTTACTAGTAGAGGAAGAGATGTACTGGAAACAGAGATCCAGGGATGGAGGAGGAAGAATGAGGTA
AGGGGTTTCGAGGATAGTGATGGGGTTTGGCAGCAGGATCCGGATAGAGTCTTGGGGCTAATAGAGGGGTATTTTGAACACATTTATACTACGTCGAACCCCTCGGAGGA
GGAAATAGATCAAGCCATGTTGCGTGTTCCCCCCTCAGTTATAGACGAGATGAATAGTAAACTTCTTAGGCCATTCCAGCAGGAAGAAGTGGAACTTGCCCTAAAGCAAA
ACCACCCTAATAAAGCTCCAGGCCCAGACGGGTTCCCCATCTCCTCTAAACGAGACGATGATCATCTTGATACCGAAAAAGAGGAGTCCCAGACGGGTATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGGGTCGCAGGGGAATGTCGGGGCCTTAAGTGTCGAATCCGGATTTCGAATCCTGGGCCTGGGGCATTACAGATGGTATCAGAACGGAACCTCTCCCAGTAGGAT
GTGGTTCGGGGACGAACCAAGGCGGAAGCTGGTGGGCATAGATCTCTCCCAAAAGACTCCCACAAGTCTCCTGCTTCAAGAAGTCTTAGAGATATACCGGTGTAGCCGAT
TGGTGGTGTTCGCACAAGAGGGTTGCTGCATTTTCAATCTTATTGGCAAGAAAAGGCGAATTGATCAAGGCTTTCTACAAAAGTTCGGTAATACTTTCCACTGCGTGGGG
TTTTTATCCCTTCAGTTGTATGAACGACTACCAGATTTTTGTTACGTGTGTGGTTGTCTGGGGCACTCGAGGAGGGAGTGCGATGTAGTGGGTAATGGTACTGGGGTGTG
GGAAGGTGAGCAGTATGGTGATTGGTTAAGGGCAGGTGTAAGTTTAGGGGAGGGTAGTAGGAGGTTTCAGGAAGGGAAGTCTGAGGGTGGTGGAAACCAAAGTGTGGGAG
ATGTGGGAGCTGGGGATAGGTTGGAGGTAGTTTCACAGGTGGGTGATGTCGGGGAAGGGGGGACTGTGGGAGGGTCGACACACAAGGGGAAAGGGAAACAGAAAGTGGGG
AATTCTTTGGTGGAGTCTAGGGCCACCTATGTGGGGAAGGAAGCGATGGTTGTCGATGAGGTGGTTGGTGGGGAGAGTGCGGGGGTAGAGAAGGGGGTTAAGGCGACTGG
TAGCAGAAGTTGGAAGAGAAAGGCGAGGGATGCATTGACGGATATCTCTAACAAGGAAATCTCTATGGAACTGGCAGGGGGCCCGTCGGTTGCCATGAGTATCATGTGCT
GGAATGTGCAAGGGTTGGGGTCGACTCGAGCATTCCGTAGGTTGTACAAGCTGGTGCAACAATACCGACCTCAGATGGTGTTCTTGTCTGACACGAAGGTGCAGACACTG
GGGTTTGATGTGCTGAAAGTTAAGCTAGGGTACCCAAACTATTTTTGTGTGGACAGTAGAGGGAGGAGTGGTGGTTTGGTGCTATTGTGGACCAGAGCGGTAAGTTTCAG
TCTTATCTCGTACTCCCTTAACCCCATAGATGGATGGGTAGATGTAGGAGAGAGCCCGTGGAGGTTCACTGGGATTTACGGTCATCCCCAAGCTGAGCAGAAGGCAAGAA
CATGGGCACTGATGAAGCACCTTCGAGGGAATAGTACGACCCCGTGGCTTGTCGAGGGAGATTTTAATGCAATTCTGTTTGACCATGAGAATGAGGGTGGAAGGTCGAAG
CCGGCTGGGGAATTACTTGGATTTCGGGAGGCAGTGGGAGCGGTGTCAGCTAACTGGGATCACGGTGCTTCGATGGGATCCCCGCTGGCATTGACATTGGCAACGGGAAG
GTGTATGTCTTGTCTTGACAAATGGGGGAGGCCAAGGATGGGAAATTATAGACAGCGTATTCGGGCGGCGACAGACCAGGTTCAACAGGCGATGAAAGTTTGGGTAATAC
AAGATCTCGCTCTGATCTGCAGAGTGCAGAGGAACGACTCGAGTCTTACTAGTAGAGGAAGAGATGTACTGGAAACAGAGATCCAGGGATGGAGGAGGAAGAATGAGGTA
AGGGGTTTCGAGGATAGTGATGGGGTTTGGCAGCAGGATCCGGATAGAGTCTTGGGGCTAATAGAGGGGTATTTTGAACACATTTATACTACGTCGAACCCCTCGGAGGA
GGAAATAGATCAAGCCATGTTGCGTGTTCCCCCCTCAGTTATAGACGAGATGAATAGTAAACTTCTTAGGCCATTCCAGCAGGAAGAAGTGGAACTTGCCCTAAAGCAAA
ACCACCCTAATAAAGCTCCAGGCCCAGACGGGTTCCCCATCTCCTCTAAACGAGACGATGATCATCTTGATACCGAAAAAGAGGAGTCCCAGACGGGTATCTGA
Protein sequenceShow/hide protein sequence
MAGSQGNVGALSVESGFRILGLGHYRWYQNGTSPSRMWFGDEPRRKLVGIDLSQKTPTSLLLQEVLEIYRCSRLVVFAQEGCCIFNLIGKKRRIDQGFLQKFGNTFHCVG
FLSLQLYERLPDFCYVCGCLGHSRRECDVVGNGTGVWEGEQYGDWLRAGVSLGEGSRRFQEGKSEGGGNQSVGDVGAGDRLEVVSQVGDVGEGGTVGGSTHKGKGKQKVG
NSLVESRATYVGKEAMVVDEVVGGESAGVEKGVKATGSRSWKRKARDALTDISNKEISMELAGGPSVAMSIMCWNVQGLGSTRAFRRLYKLVQQYRPQMVFLSDTKVQTL
GFDVLKVKLGYPNYFCVDSRGRSGGLVLLWTRAVSFSLISYSLNPIDGWVDVGESPWRFTGIYGHPQAEQKARTWALMKHLRGNSTTPWLVEGDFNAILFDHENEGGRSK
PAGELLGFREAVGAVSANWDHGASMGSPLALTLATGRCMSCLDKWGRPRMGNYRQRIRAATDQVQQAMKVWVIQDLALICRVQRNDSSLTSRGRDVLETEIQGWRRKNEV
RGFEDSDGVWQQDPDRVLGLIEGYFEHIYTTSNPSEEEIDQAMLRVPPSVIDEMNSKLLRPFQQEEVELALKQNHPNKAPGPDGFPISSKRDDDHLDTEKEESQTGI