; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0038384 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0038384
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon protein
Genome locationchr2:16471809..16476428
RNA-Seq ExpressionLag0038384
SyntenyLag0038384
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain
IPR027806 - Harbinger transposase-derived nuclease domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.0e-9350.69Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        MVL +V+ LHE LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK E+ATNVLGVC     F+YVL G EGSA+DSR+LRDA+SR
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK---------------QGYQPRT----------------------PVKVQCRMTTACCLIHN
         N LK+PKG YYL D GY N EGFLAPYRGQRYHL +W+               + Y  R                       PV+VQCR   ACCL+HN
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK---------------QGYQPRT----------------------PVKVQCRMTTACCLIHN

Query:  LIRREMPVDPLE---QEVGDTH-----------------SNMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK
        LI REM    +E    EV  TH                 S   DN  E     SSR  K  W+K E+  LVECL+EL N G W++DNGTF+PG+L Q+ +
Subjt:  LIRREMPVDPLE---QEVGDTH-----------------SNMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK

Query:  WIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW
         +  KIP  ++ A   I+SR+K++K+ ++A+AEM GPNCSGFGWND  KCI AEK +FD+W
Subjt:  WIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW

GFS42850.1 hypothetical protein Acr_00g0082040 [Actinidia rufa]7.3e-9238.87Show/hide
Query:  VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVS
        VL+SVL L  +LLK PEP+ +NC DE  +W+WF+NCLGALDGTY+KV V  +D+PRYRTRK +IATNVL VCSQDMQFIYVLPG EGSASDSRVLRDA++
Subjt:  VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVS

Query:  RRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHN
        R+NGL++P G YYL DAGYTNGEGFLAPYRGQRYHL+ W+ G  P T                                    P+K Q R+  ACCL+HN
Subjt:  RRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHN

Query:  LIRREMPVDPLEQEVGDTHSN--------MDDNEPS---------------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG
        LI+REMPVDP E  + +   +        +D  EPS                            A SS +R W+K E+E L+ C+ +L +  T WK D G
Subjt:  LIRREMPVDPLEQEVGDTHSN--------MDDNEPS---------------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG

Query:  TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCI------------EAEKHI----------FDEWV----
         FK GF  + EK I    P  DL+A PHIES++K+ ++QY+ + +ML    SGFGW+D +K I              EK +          +++W+    
Subjt:  TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCI------------EAEKHI----------FDEWV----

Query:  --KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENI
          +      + PTD   A+E + T  E         ++V +    +  SMS A   T S   +S  +SKKR ++ + +   +  + +       N N  +
Subjt:  --KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENI

Query:  QEIALFYRQVAERESTREERR-NSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI
         EIA  YR     + +++ R+ N+ +S++     L   QR+RA  +I +D  ++D FF+L  +E +
Subjt:  QEIALFYRQVAERESTREERR-NSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI

XP_026662506.2 uncharacterized protein LOC113463064 [Phoenix dactylifera]1.1e-9542.09Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        +VLN VL LH +LL++PE V  N  DE+WK FKNCLGALDGTYIKVNV  +++PRYR RK EIATNVLGVC++DMQFIY+LP  EGSA+D R+LRDA+ R
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSRKRM
        RNGLK+P+  YYL +AGY N EGFLAPYRGQRYHLN+W+Q  QP    +             N+I R      +E+++     N   N  +   +  KR+
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSRKRM

Query:  WSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW
        W+K ED KLVECL+++ N G WK DNG F+PGF   +E+ + +K+P C L+  PH+E+ VK+LKKQYNAIAEMLGPNC  FGWNDRDKC+ A+K ++D W
Subjt:  WSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW

Query:  VK---------------------------GQWTWCRRPTDMFEAVER--EMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSS---K
        +K                                   P D  E +E+  E   +    GD           +D   S+ +      STA    +      
Subjt:  VK---------------------------GQWTWCRRPTDMFEAVER--EMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSS---K

Query:  KRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYF
        K+ K  D ++  +      +SS      E+ ++IA F+    E++   +ERR +L  EI K++ LS    + AG  + K   ++  F
Subjt:  KRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYF

XP_028060687.1 uncharacterized protein LOC114264281 [Camellia sinensis]2.5e-9237.72Show/hide
Query:  VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR
        VL +VL  H LLLK+PEP+ +NC D++W  F+NCLGALDGTY+KV   ++D+PRYRTRK EIATNVLGVCSQDMQFIYVLPG EGSASDSRVLRDAVSR 
Subjt:  VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR

Query:  NGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHNLI
        NGLK+P G YYL DAGYTNGEGFLAPYRGQ YHL+ W++G  P T                                    P+K Q R+ TACCL+HNLI
Subjt:  NGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHNLI

Query:  RREMPVD------------PLEQEVGD----------------------------THSNMDD--NEPSKAGSSRKRMWSKAEDEKLVECLLE-LSNIGTW
        +REMP+D            PL  E+GD                                MDD     S+  +  +R W+  E+  L+  + + +++   W
Subjt:  RREMPVD------------PLEQEVGD----------------------------THSNMDD--NEPSKAGSSRKRMWSKAEDEKLVECLLE-LSNIGTW

Query:  KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKH----------------------IFDEW
        K DN  F+ GF  + EK I+   P  DL+A PHI+S++K  +KQYNA+ +ML  N SGFGWND  K +  +                         +++W
Subjt:  KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKH----------------------IFDEW

Query:  V------KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSN
        +      +        P D   A+E+E  +     G+ S V        D   SMS A  N  + A  S+++ KKRA+  + +   ++ +   + S + N
Subjt:  V------KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSN

Query:  ANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI
         N  + E+A    ++       ++RR  + +E+ K+  +S  QR+ A  +I KD+ ++D FF+L  ++ +
Subjt:  ANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI

XP_042426186.1 uncharacterized protein LOC122014061 [Zingiber officinale]3.9e-10152.79Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        +VLNSVL LH +LLK+ EP+  NC +E+WKWFK C GALDGTYI VN  I D+PRYRTR  EIATNVLGVC+ +MQF Y+LPG EGSA+D RVLRDA+SR
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHNL
        RNGLKIP+GCYYL DAGYTNGEGFLAPYRGQRYHL +W+QGYQP T                                     VK QCR+ +ACC++ N 
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHNL

Query:  IRREMPVDPLEQEV--------------GDTHSNMDDNEPSKA------GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEK
        IR EM +DP+E E+              G     ++  E S A        + K +W+K ED  LV+CL+ELS    WK++NG F+ G+L+ +EK +  K
Subjt:  IRREMPVDPLEQEV--------------GDTHSNMDDNEPSKA------GSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEK

Query:  IPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVK
        +P   LKA PHIESR K+LK+Q+ AI EML  + SGFGWND +KCI   K +FDEWVK
Subjt:  IPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVK

TrEMBL top hitse value%identityAlignment
A0A5A7SWD8 Retrotransposon protein2.5e-9051.33Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        MVL +V+ LH+ LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK E+ATNVLGV      F+YVL G EGSA+DSR+LRDA+SR
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK-QGYQPRT------------------------------------PVKVQCRMTTACCLIHN
         N LK+PKG YYL DAGY N EGFLAPYRGQRYHL +W+     P T                                    PV+VQC    ACCL+HN
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK-QGYQPRT------------------------------------PVKVQCRMTTACCLIHN

Query:  LIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVK
        LI REM           T+ +++DN  S   SSR  K  W+K E+  LV    EL N G W++DNGTF+PG+L Q+ + +  KIP C++ A   I+SR+K
Subjt:  LIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVK

Query:  ILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW
        ++K+ ++A+AEM GPNCSGFGWND  KCI AEK +FD+W
Subjt:  ILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW

A0A7J0DWA5 Uncharacterized protein3.5e-9238.87Show/hide
Query:  VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVS
        VL+SVL L  +LLK PEP+ +NC DE  +W+WF+NCLGALDGTY+KV V  +D+PRYRTRK +IATNVL VCSQDMQFIYVLPG EGSASDSRVLRDA++
Subjt:  VLNSVLHLHELLLKQPEPVHSNCLDE--KWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVS

Query:  RRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHN
        R+NGL++P G YYL DAGYTNGEGFLAPYRGQRYHL+ W+ G  P T                                    P+K Q R+  ACCL+HN
Subjt:  RRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRT------------------------------------PVKVQCRMTTACCLIHN

Query:  LIRREMPVDPLEQEVGDTHSN--------MDDNEPS---------------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG
        LI+REMPVDP E  + +   +        +D  EPS                            A SS +R W+K E+E L+ C+ +L +  T WK D G
Subjt:  LIRREMPVDPLEQEVGDTHSN--------MDDNEPS---------------------------KAGSSRKRMWSKAEDEKLVECLLELSNIGT-WKADNG

Query:  TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCI------------EAEKHI----------FDEWV----
         FK GF  + EK I    P  DL+A PHIES++K+ ++QY+ + +ML    SGFGW+D +K I              EK +          +++W+    
Subjt:  TFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCI------------EAEKHI----------FDEWV----

Query:  --KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENI
          +      + PTD   A+E + T  E         ++V +    +  SMS A   T S   +S  +SKKR ++ + +   +  + +       N N  +
Subjt:  --KGQWTWCRRPTDMFEAVEREMTDNEFWRGDSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENI

Query:  QEIALFYRQVAERESTREERR-NSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI
         EIA  YR     + +++ R+ N+ +S++     L   QR+RA  +I +D  ++D FF+L  +E +
Subjt:  QEIALFYRQVAERESTREERR-NSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNLPADESI

A0A803PDI8 Uncharacterized protein1.5e-9038.92Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        MVLN++LHLH +LLK+P  +  +C DE+WKWFKNCLGALDGTYIKVN   LDRPRYRT KN+IATNVLGV SQDM+FIYVLPG +GSA+D RVLRDA++ 
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDW-------KQGYQPR---------------------------TPVKVQCRMTTACCLIHNLIR
        RN  K+P+G YYL DAGY NGE FL PYRG RYHLNDW       ++ +  R                            PVK+QCR+  ACC +HNLIR
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDW-------KQGYQPR---------------------------TPVKVQCRMTTACCLIHNLIR

Query:  REMPVDPLEQEVGDTHSNMDDN------------EPSKAGSS--------------------------------RKRMWSKAEDEKLVECLLELSNIGTW
         EM +DPLE    D  ++ D++            EPS A ++                                +K  W+  ED KLVECL+++ NIG W
Subjt:  REMPVDPLEQEVGDTHSNMDDN------------EPSKAGSS--------------------------------RKRMWSKAEDEKLVECLLELSNIGTW

Query:  KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVKGQWTWCRRPTDMFEAV----
        KA+N                         AQPHI SR+KILK+QY  I+ MLGP+ SGFGW++  KC+ A+K +FD+WVK   T       +F       
Subjt:  KADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVKGQWTWCRRPTDMFEAV----

Query:  ----EREMTDNEFWRGDSSYVAIV------GREEVDERTSMSE----APMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIAL
            +   T +   R   +   I         ++ D   ++ E    A MN+   +  ++R +K+++ S DP V  ++      S+  ++A+++I+++A 
Subjt:  ----EREMTDNEFWRGDSSYVAIV------GREEVDERTSMSE----APMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIAL

Query:  FYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRA
              + E+    RR +L  EI+KVDGL++R  +++
Subjt:  FYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRA

A0A803QNC5 Uncharacterized protein1.5e-11146.33Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        MVLN++LHLH+LLLK+P  +  +C+DE+WKWFKNCLGALDGTYIKVNV   +RPRYRTRKNEIATNVLGV SQDMQFIYVLPG EGSA+DSRVLRDA+  
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVK-VQCRMTTACCLIHN---------LIRREMPVDPLEQE----VGDTHSNMD
        RNG K+P+G YYL DAGY NGEGFL PYRGQRYHLNDW   + P +P +    R ++A  ++            I R     P++ +    +GD    M+
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVK-VQCRMTTACCLIHN---------LIRREMPVDPLEQE----VGDTHSNMD

Query:  DNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDR
            S     RK  W+  +D KLVECL+++ N G WKADNGTFKPG+L Q+EK + ++IP   +KAQPHI+SR+KILK+QY AI++MLGP+ SGFGWN++
Subjt:  DNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDR

Query:  DKCIEAEKHIFDEWVKGQWT---WCRRPTDMFEAVE----REMTDNEFWRGDSSYVAIVGRE-------EVDERTSMSE----APMNTQSTAHTSSRSSK
         KC+ A+K +FDEWVK   T      +P   ++ +     ++    +   G S  +  +  E       + D    + E    A MN+   +  ++R +K
Subjt:  DKCIEAEKHIFDEWVKGQWT---WCRRPTDMFEAVE----REMTDNEFWRGDSSYVAIVGRE-------EVDERTSMSE----APMNTQSTAHTSSRSSK

Query:  KRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNL
        +++ + DPLV  ++      S+  ++A+++I+++A       + E+    RR  L  EI+KVDGL+  QR++ G+L+  +Q  IDYFF L
Subjt:  KRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVRAGRLITKDQSQIDYFFNL

E5GCB5 Retrotransposon protein1.4e-9350.69Show/hide
Query:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR
        MVL +V+ LHE LLK+P+PV + C D++W+WF+NCLGALDGTYIKVNV   DR RYRTRK E+ATNVLGVC     F+YVL G EGSA+DSR+LRDA+SR
Subjt:  MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSR

Query:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK---------------QGYQPRT----------------------PVKVQCRMTTACCLIHN
         N LK+PKG YYL D GY N EGFLAPYRGQRYHL +W+               + Y  R                       PV+VQCR   ACCL+HN
Subjt:  RNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK---------------QGYQPRT----------------------PVKVQCRMTTACCLIHN

Query:  LIRREMPVDPLE---QEVGDTH-----------------SNMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK
        LI REM    +E    EV  TH                 S   DN  E     SSR  K  W+K E+  LVECL+EL N G W++DNGTF+PG+L Q+ +
Subjt:  LIRREMPVDPLE---QEVGDTH-----------------SNMDDN--EPSKAGSSR--KRMWSKAEDEKLVECLLELSNIGTWKADNGTFKPGFLIQIEK

Query:  WIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW
         +  KIP  ++ A   I+SR+K++K+ ++A+AEM GPNCSGFGWND  KCI AEK +FD+W
Subjt:  WIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein5.3e-1634.96Show/hide
Query:  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKG-CYYLYDAGYTNGEGFLAP
        W +F   +GA+DGT++ V V    +  Y  R +  + N++ +C   M F Y+  G  GS  D+ VL+ A    +   +P    YYL D+GY N +G LAP
Subjt:  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKG-CYYLYDAGYTNGEGFLAP

Query:  YRGQ-----RYHLNDWKQGYQPR
        YR       RYH++ +  G +PR
Subjt:  YRGQ-----RYHLNDWKQGYQPR

AT5G28730.1 unknown protein7.4e-1043.84Show/hide
Query:  NVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGC-YYLYDAGYTNGEGFLAPYRGQRYHLND
        NVL +C  DM F Y   G  GS  D+RVL  A+S      +P    YYL D+GY N  G+LAPYR +     D
Subjt:  NVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGC-YYLYDAGYTNGEGFLAPYRGQRYHLND

AT5G28950.1 unknown protein1.8e-1637.07Show/hide
Query:  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR-NGLKIP---KGCYYLYDAGYTNGEGF
        + +FK+C+GA+D T+I   V     P +R RK +I+ N+L  C+ D++F+YVL G EGSA DS+VL DA++R  N L +P   +    + +    N +  
Subjt:  WKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR-NGLKIP---KGCYYLYDAGYTNGEGF

Query:  LAPYRGQRYHLNDWKQ
        L     QR + N W++
Subjt:  LAPYRGQRYHLNDWKQ

AT5G35695.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)8.8e-1129.41Show/hide
Query:  FIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK-QGYQPRTP----------------------------
        FIYVL G EGSA DSRVL DA+ +          +YL D G+ N   FLAP+RG RYHL ++  Q   P TP                            
Subjt:  FIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGCYYLYDAGYTNGEGFLAPYRGQRYHLNDWK-QGYQPRTP----------------------------

Query:  --------VKVQCRMTTACCLIHNLIRREMPVDPLE--QEVGD------------THSNMDDNEP---SKAGSSRKRMWSKAEDEKL
                 K Q  +   C  +HN +R+E   D  +   EVG+              + +D+ EP    K       MW K+  E +
Subjt:  --------VKVQCRMTTACCLIHNLIRREMPVDPLE--QEVGD------------THSNMDDNEP---SKAGSSRKRMWSKAEDEKL

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.4e-2434.17Show/hide
Query:  VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR
        VLN+V+ + +    QP   +S+ L+    +FK+C+G +D  +I V VG+ ++  +R     +  NVL   S D++F YVL G EGSASD +VL  A++RR
Subjt:  VLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRR

Query:  NGLKIPKGCYYLYDAGYTNGEGFLAPYRG--------------QRYHL---------NDWKQGY-----QPRTPVKVQCRMTTACCLIHNLIRREMPVD
        N L++P+G YY+ D  Y N  GF+APY G              +R+ L            K+ +      P  P++ Q ++  A C +HN +R E P D
Subjt:  NGLKIPKGCYYLYDAGYTNGEGFLAPYRG--------------QRYHL---------NDWKQGY-----QPRTPVKVQCRMTTACCLIHNLIRREMPVD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCTTAACTCTGTGTTACATTTACATGAGTTATTACTTAAACAACCTGAGCCAGTTCATTCCAACTGCTTGGACGAAAAGTGGAAGTGGTTTAAGAATTGTCTAGG
TGCATTAGATGGAACCTACATTAAAGTCAACGTAGGTATTCTTGATAGACCTAGGTACCGAACAAGGAAGAATGAAATTGCTACCAATGTGTTAGGAGTTTGCTCCCAAG
ATATGCAATTCATCTATGTTTTACCTGGATGTGAAGGTTCGGCTTCTGACTCGAGAGTTTTGCGAGATGCTGTATCTAGGAGGAATGGATTAAAAATTCCAAAAGGTTGT
TACTATCTATATGATGCTGGCTATACAAATGGTGAAGGATTTTTGGCACCTTACCGAGGACAACGATATCATTTAAATGATTGGAAGCAAGGATATCAACCAAGAACTCC
AGTAAAAGTTCAATGTCGAATGACCACCGCTTGTTGCCTCATTCATAATCTTATAAGAAGAGAAATGCCTGTAGATCCTTTAGAACAAGAAGTTGGAGACACTCATTCAA
ATATGGATGACAACGAACCATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCCGAAGATGAGAAATTAGTGGAATGTTTACTGGAGCTATCTAATATTGGC
ACTTGGAAAGCTGACAATGGTACTTTTAAACCAGGATTTCTCATTCAAATAGAAAAATGGATAGTTGAAAAAATTCCTATGTGCGATCTTAAGGCTCAACCACACATAGA
GTCTAGAGTTAAAATATTGAAGAAGCAATACAATGCAATAGCTGAAATGTTAGGCCCAAATTGTAGTGGCTTTGGATGGAATGATAGAGACAAGTGCATAGAAGCAGAAA
AACATATATTTGATGAATGGGTGAAGGGCCAATGGACTTGGTGCAGAAGGCCAACTGATATGTTTGAAGCAGTGGAACGAGAAATGACCGACAATGAATTTTGGAGAGGA
GACAGCTCTTATGTGGCAATAGTTGGAAGGGAAGAAGTGGATGAACGAACCTCAATGAGTGAAGCACCAATGAATACACAATCTACTGCACATACATCAAGTAGGTCTAG
TAAGAAAAGAGCAAAGAGTGTGGACCCATTGGTAGCGGCAGTGAATGGACTTGAGAATGTTATGAGCAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCTT
TGTTTTATCGACAAGTGGCTGAACGAGAATCTACAAGAGAGGAACGTCGAAACTCATTAGTTAGTGAAATTAGAAAGGTGGATGGATTGAGTGTACGACAAAGAGTTCGA
GCTGGTAGGCTTATCACCAAAGATCAATCCCAGATTGATTACTTTTTTAATCTTCCAGCTGATGAAAGCATAGCCACAACCCCTCGTTCTTCTCCCTCGCCGTCTCTCTC
GAATCCCTCTGTTCTCGAATCTCGCCCCTCGTTCGACTCCCTCACCGTTACTTTTAACTCGCCGTCGCGCCTCCGTTCTCCACCATCGTCTGTTCTCCATCGCCGTCTGT
TCTCCACCGCCGTCTCTAACTCGCCGCCGTCGAGATTCCGTTATTACTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCTTAACTCTGTGTTACATTTACATGAGTTATTACTTAAACAACCTGAGCCAGTTCATTCCAACTGCTTGGACGAAAAGTGGAAGTGGTTTAAGAATTGTCTAGG
TGCATTAGATGGAACCTACATTAAAGTCAACGTAGGTATTCTTGATAGACCTAGGTACCGAACAAGGAAGAATGAAATTGCTACCAATGTGTTAGGAGTTTGCTCCCAAG
ATATGCAATTCATCTATGTTTTACCTGGATGTGAAGGTTCGGCTTCTGACTCGAGAGTTTTGCGAGATGCTGTATCTAGGAGGAATGGATTAAAAATTCCAAAAGGTTGT
TACTATCTATATGATGCTGGCTATACAAATGGTGAAGGATTTTTGGCACCTTACCGAGGACAACGATATCATTTAAATGATTGGAAGCAAGGATATCAACCAAGAACTCC
AGTAAAAGTTCAATGTCGAATGACCACCGCTTGTTGCCTCATTCATAATCTTATAAGAAGAGAAATGCCTGTAGATCCTTTAGAACAAGAAGTTGGAGACACTCATTCAA
ATATGGATGACAACGAACCATCTAAAGCTGGTTCAAGTAGGAAACGCATGTGGAGTAAGGCCGAAGATGAGAAATTAGTGGAATGTTTACTGGAGCTATCTAATATTGGC
ACTTGGAAAGCTGACAATGGTACTTTTAAACCAGGATTTCTCATTCAAATAGAAAAATGGATAGTTGAAAAAATTCCTATGTGCGATCTTAAGGCTCAACCACACATAGA
GTCTAGAGTTAAAATATTGAAGAAGCAATACAATGCAATAGCTGAAATGTTAGGCCCAAATTGTAGTGGCTTTGGATGGAATGATAGAGACAAGTGCATAGAAGCAGAAA
AACATATATTTGATGAATGGGTGAAGGGCCAATGGACTTGGTGCAGAAGGCCAACTGATATGTTTGAAGCAGTGGAACGAGAAATGACCGACAATGAATTTTGGAGAGGA
GACAGCTCTTATGTGGCAATAGTTGGAAGGGAAGAAGTGGATGAACGAACCTCAATGAGTGAAGCACCAATGAATACACAATCTACTGCACATACATCAAGTAGGTCTAG
TAAGAAAAGAGCAAAGAGTGTGGACCCATTGGTAGCGGCAGTGAATGGACTTGAGAATGTTATGAGCAGTCATCTTTCAAATGCTAATGAAAATATTCAAGAGATTGCTT
TGTTTTATCGACAAGTGGCTGAACGAGAATCTACAAGAGAGGAACGTCGAAACTCATTAGTTAGTGAAATTAGAAAGGTGGATGGATTGAGTGTACGACAAAGAGTTCGA
GCTGGTAGGCTTATCACCAAAGATCAATCCCAGATTGATTACTTTTTTAATCTTCCAGCTGATGAAAGCATAGCCACAACCCCTCGTTCTTCTCCCTCGCCGTCTCTCTC
GAATCCCTCTGTTCTCGAATCTCGCCCCTCGTTCGACTCCCTCACCGTTACTTTTAACTCGCCGTCGCGCCTCCGTTCTCCACCATCGTCTGTTCTCCATCGCCGTCTGT
TCTCCACCGCCGTCTCTAACTCGCCGCCGTCGAGATTCCGTTATTACTAA
Protein sequenceShow/hide protein sequence
MVLNSVLHLHELLLKQPEPVHSNCLDEKWKWFKNCLGALDGTYIKVNVGILDRPRYRTRKNEIATNVLGVCSQDMQFIYVLPGCEGSASDSRVLRDAVSRRNGLKIPKGC
YYLYDAGYTNGEGFLAPYRGQRYHLNDWKQGYQPRTPVKVQCRMTTACCLIHNLIRREMPVDPLEQEVGDTHSNMDDNEPSKAGSSRKRMWSKAEDEKLVECLLELSNIG
TWKADNGTFKPGFLIQIEKWIVEKIPMCDLKAQPHIESRVKILKKQYNAIAEMLGPNCSGFGWNDRDKCIEAEKHIFDEWVKGQWTWCRRPTDMFEAVEREMTDNEFWRG
DSSYVAIVGREEVDERTSMSEAPMNTQSTAHTSSRSSKKRAKSVDPLVAAVNGLENVMSSHLSNANENIQEIALFYRQVAERESTREERRNSLVSEIRKVDGLSVRQRVR
AGRLITKDQSQIDYFFNLPADESIATTPRSSPSPSLSNPSVLESRPSFDSLTVTFNSPSRLRSPPSSVLHRRLFSTAVSNSPPSRFRYY