; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030819 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030819
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold11:30331732..30338752
RNA-Seq ExpressionSpg030819
SyntenySpg030819
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]6.7e-6430.27Show/hide
Query:  LIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITAS
        +IHESDL CR+STRMDRR FAILC LLR   GL   EI+DVEEMVAMFLH++AHDVKNRVI+++F RSGET+SRHFN  L  V+RLY+ L+K+P P+T++
Subjt:  LIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITAS

Query:  CQDGR-----------------------------------------------------------------------------------------------
        C D R                                                                                               
Subjt:  CQDGR-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------LMAGADKHPKHI------------------------------------
                                                            ++ G   +P  +                                    
Subjt:  ----------------------------------------------------LMAGADKHPKHI------------------------------------

Query:  -WTRQEEARLVESL---------------------------VELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISE
          T  E+ + +E+                            +ELV  GGW+ DNGTFRPGYLA+L RM+ EK+S C + +T++IDC++++LKR +  I+E
Subjt:  -WTRQEEARLVESL---------------------------VELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISE

Query:  ILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGD-----DEGDQNVQADQDCYVPAPP
        +LG  CSGFGWNDE KCI AEKE++D WV+S  +AKGLLN PF  Y++L +VFG+D+A+G      A+   +  G      D GD N           PP
Subjt:  ILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGD-----DEGDQNVQADQDCYVPAPP

Query:  DINLAADMEFDDVPITPTSRPS---TVGSSQSRKR-SRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQQELSNDERVALM
          +   D+  DDV  +  SR S   T  S   RKR S+  ++VEA+ +   ++     Q  +IA+WP    A +   R     +L    EL++ +R  L 
Subjt:  DINLAADMEFDDVPITPTSRPS---TVGSSQSRKR-SRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQQELSNDERVALM

Query:  HILFSDPKMTNMMLSVPPTLRLRFLRGLLNE
          L S        + +P   R  F R LL +
Subjt:  HILFSDPKMTNMMLSVPPTLRLRFLRGLLNE

KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]5.7e-7145.59Show/hide
Query:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR
        +LRT  GL   + +DVEEMV +FLHIVAHDVKNRV RR FARSGET+SRHFN  L+VV+RL+++LLK+P  +T SC   +     + +   K  KH WT 
Subjt:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS
         E+  LVE L++LV +G WR DNGTF+PGYL ++++++KEK+    I  T  ++  V+ LK+QY+TI+E++G  CSGF WN E KCI+AEK V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS

Query:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS
        H +A+ LLNKPF  + DL  VFG+D+A+GG C  P E    T  D E D  +   +D  +P P  +   +    +D+P TPTS     GS +  K+ R S
Subjt:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS

Query:  YEVEALDIMR--QSVAMQETQFTKIADWP
        Y  + +D  R  +S+    T      D+P
Subjt:  YEVEALDIMR--QSVAMQETQFTKIADWP

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]1.5e-7933.79Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHRRIENQTPYLRHQMRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHI
        HEL S+++    SQRQL  ++    N+ +RI +     RH++RQLA FR+IH SDL CR+STRMDRRCFAILC LLRT  GL   E++DVEEMVAMFLHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHRRIENQTPYLRHQMRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR----------------------------------------------
        +AHDVKNRVI+R+F RSGETISRHFN  L  VIRL+D LLKKP+P+   C D R                                              
Subjt:  VAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------LMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMS
                                                   M  + + PKH WT++EEA     LVELV+ GGWR DNGTFRPGYL +L RM+  K+ 
Subjt:  ------------------------------------------LMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMS

Query:  SCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTH
         C I + S ID +++ +KR +  ++E+ G  CSGFGWNDE KCI AEKEV+D W  SH +AKGLLNK F  Y++L++VFGKD+A+GG        AD   
Subjt:  SCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTH

Query:  GDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGS-SQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIG
         +  G     AD        P  +L  +M  DD+  T T+R S   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  Q     + R  I 
Subjt:  GDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGS-SQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIG

Query:  EMLLAQQELSNDERVALMHILFSDPKMTNMMLSVPPTLRLRFLRGLLNERR
        + L A  EL+  +R  LM IL  +       L VP  ++  +   +L E R
Subjt:  EMLLAQQELSNDERVALMHILFSDPKMTNMMLSVPPTLRLRFLRGLLNERR

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]5.9e-6838.23Show/hide
Query:  MDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQ-DG---------
        MDRRCF ILC++LRT  GL   + +DV+EMV +FLHIVAHDVKNRV RR  ARSGET+SRHFNA L+ V+RL+++LLK+P P+T SC  DG         
Subjt:  MDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQ-DG---------

Query:  ------------------------------------RLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTID
                                             + +   K  KH WT  E+  LVE L++LV EGGWR DNGTF+ GYL                 
Subjt:  ------------------------------------RLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTID

Query:  STSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEG
                     +QY+ I+E++G  CSGFGWN+  KCI+ EK V+D WVK H +A+GLLNKPF  + DL  VFG+D+A+GG C  P E +  T  D E 
Subjt:  STSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEG

Query:  DQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQ
        D      +D  +P P  +   +    +D+P TPTS     GSS+  K+ R SY  + +D  R S+     +  KIA W   +   E      +   L   
Subjt:  DQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQ

Query:  QELSNDERVALMHILFSDPKMTNMMLSVP
          +  D+ + +   L  DP M +  L  P
Subjt:  QELSNDERVALMHILFSDPKMTNMMLSVP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.1e-7145.9Show/hide
Query:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR
        +LRT  GL   + +DVEEMV +FLHIVAHDVKNRV RR FARSGET+SRHFN  L+VV+RL+++LLK+P  +T SC   +     + +   K  KH WT 
Subjt:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS
         E+  LVE L++LV +G WR DNGTF+PGYL ++++++KEK+    I  T  ++  V+ LK+QY+TI+E++G  CSGF WN E KCI+AEK V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS

Query:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS
        H +A+ LLNKPF  + DL  VFG+D+A+GG C  P E    T  D E D  +   +D  +P P  +   +    +D+P TPTS     GSS+  K+ R S
Subjt:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS

Query:  YEVEALDIMR--QSVAMQETQFTKIADWP
        Y  + +D  R  +S+    T      D+P
Subjt:  YEVEALDIMR--QSVAMQETQFTKIADWP

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein2.8e-7145.59Show/hide
Query:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR
        +LRT  GL   + +DVEEMV +FLHIVAHDVKNRV RR FARSGET+SRHFN  L+VV+RL+++LLK+P  +T SC   +     + +   K  KH WT 
Subjt:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS
         E+  LVE L++LV +G WR DNGTF+PGYL ++++++KEK+    I  T  ++  V+ LK+QY+TI+E++G  CSGF WN E KCI+AEK V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS

Query:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS
        H +A+ LLNKPF  + DL  VFG+D+A+GG C  P E    T  D E D  +   +D  +P P  +   +    +D+P TPTS     GS +  K+ R S
Subjt:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS

Query:  YEVEALDIMR--QSVAMQETQFTKIADWP
        Y  + +D  R  +S+    T      D+P
Subjt:  YEVEALDIMR--QSVAMQETQFTKIADWP

A0A5A7SWD8 Retrotransposon protein7.2e-8033.79Show/hide
Query:  HELVSVLSIMADSQRQLFNLINSFMNNHRRIENQTPYLRHQMRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHI
        HEL S+++    SQRQL  ++    N+ +RI +     RH++RQLA FR+IH SDL CR+STRMDRRCFAILC LLRT  GL   E++DVEEMVAMFLHI
Subjt:  HELVSVLSIMADSQRQLFNLINSFMNNHRRIENQTPYLRHQMRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR----------------------------------------------
        +AHDVKNRVI+R+F RSGETISRHFN  L  VIRL+D LLKKP+P+   C D R                                              
Subjt:  VAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------LMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMS
                                                   M  + + PKH WT++EEA     LVELV+ GGWR DNGTFRPGYL +L RM+  K+ 
Subjt:  ------------------------------------------LMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMS

Query:  SCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTH
         C I + S ID +++ +KR +  ++E+ G  CSGFGWNDE KCI AEKEV+D W  SH +AKGLLNK F  Y++L++VFGKD+A+GG        AD   
Subjt:  SCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTH

Query:  GDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGS-SQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIG
         +  G     AD        P  +L  +M  DD+  T T+R S   + S   KR R  +  ++ DI+R ++     Q  +IA+WP  Q     + R  I 
Subjt:  GDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGS-SQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIG

Query:  EMLLAQQELSNDERVALMHILFSDPKMTNMMLSVPPTLRLRFLRGLLNERR
        + L A  EL+  +R  LM IL  +       L VP  ++  +   +L E R
Subjt:  EMLLAQQELSNDERVALMHILFSDPKMTNMMLSVPPTLRLRFLRGLLNERR

A0A5D3C7T4 Uncharacterized protein2.8e-6838.23Show/hide
Query:  MDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQ-DG---------
        MDRRCF ILC++LRT  GL   + +DV+EMV +FLHIVAHDVKNRV RR  ARSGET+SRHFNA L+ V+RL+++LLK+P P+T SC  DG         
Subjt:  MDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQ-DG---------

Query:  ------------------------------------RLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTID
                                             + +   K  KH WT  E+  LVE L++LV EGGWR DNGTF+ GYL                 
Subjt:  ------------------------------------RLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTID

Query:  STSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEG
                     +QY+ I+E++G  CSGFGWN+  KCI+ EK V+D WVK H +A+GLLNKPF  + DL  VFG+D+A+GG C  P E +  T  D E 
Subjt:  STSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEG

Query:  DQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQ
        D      +D  +P P  +   +    +D+P TPTS     GSS+  K+ R SY  + +D  R S+     +  KIA W   +   E      +   L   
Subjt:  DQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQ

Query:  QELSNDERVALMHILFSDPKMTNMMLSVP
          +  D+ + +   L  DP M +  L  P
Subjt:  QELSNDERVALMHILFSDPKMTNMMLSVP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein5.5e-7245.9Show/hide
Query:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR
        +LRT  GL   + +DVEEMV +FLHIVAHDVKNRV RR FARSGET+SRHFN  L+VV+RL+++LLK+P  +T SC   +     + +   K  KH WT 
Subjt:  LLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITASCQDGR-----LMAGADKHPKHIWTR

Query:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS
         E+  LVE L++LV +G WR DNGTF+PGYL ++++++KEK+    I  T  ++  V+ LK+QY+TI+E++G  CSGF WN E KCI+AEK V + WVK 
Subjt:  QEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKS

Query:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS
        H +A+ LLNKPF  + DL  VFG+D+A+GG C  P E    T  D E D  +   +D  +P P  +   +    +D+P TPTS     GSS+  K+ R S
Subjt:  HSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPITPTSRPSTVGSSQSRKRSRAS

Query:  YEVEALDIMR--QSVAMQETQFTKIADWP
        Y  + +D  R  +S+    T      D+P
Subjt:  YEVEALDIMR--QSVAMQETQFTKIADWP

E5GBB2 Retrotransposon protein3.3e-6430.27Show/hide
Query:  LIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITAS
        +IHESDL CR+STRMDRR FAILC LLR   GL   EI+DVEEMVAMFLH++AHDVKNRVI+++F RSGET+SRHFN  L  V+RLY+ L+K+P P+T++
Subjt:  LIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRLYDVLLKKPKPITAS

Query:  CQDGR-----------------------------------------------------------------------------------------------
        C D R                                                                                               
Subjt:  CQDGR-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------LMAGADKHPKHI------------------------------------
                                                            ++ G   +P  +                                    
Subjt:  ----------------------------------------------------LMAGADKHPKHI------------------------------------

Query:  -WTRQEEARLVESL---------------------------VELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISE
          T  E+ + +E+                            +ELV  GGW+ DNGTFRPGYLA+L RM+ EK+S C + +T++IDC++++LKR +  I+E
Subjt:  -WTRQEEARLVESL---------------------------VELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISE

Query:  ILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGD-----DEGDQNVQADQDCYVPAPP
        +LG  CSGFGWNDE KCI AEKE++D WV+S  +AKGLLN PF  Y++L +VFG+D+A+G      A+   +  G      D GD N           PP
Subjt:  ILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGD-----DEGDQNVQADQDCYVPAPP

Query:  DINLAADMEFDDVPITPTSRPS---TVGSSQSRKR-SRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQQELSNDERVALM
          +   D+  DDV  +  SR S   T  S   RKR S+  ++VEA+ +   ++     Q  +IA+WP    A +   R     +L    EL++ +R  L 
Subjt:  DINLAADMEFDDVPITPTSRPS---TVGSSQSRKR-SRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQQELSNDERVALM

Query:  HILFSDPKMTNMMLSVPPTLRLRFLRGLLNE
          L S        + +P   R  F R LL +
Subjt:  HILFSDPKMTNMMLSVPPTLRLRFLRGLLNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G02210.1 unknown protein1.6e-0724.24Show/hide
Query:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL
        ++L+ +   RG+   G FR      +  +   K  S       ++  + +SL+RQ++ I  I  L   GF W++E + + A+  V+  ++K+H  A+  +
Subjt:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL

Query:  NKPFHQYEDLAFVFGKDKASGGACNVPAEQAD
         +P   Y+DL  + G        C V  +  D
Subjt:  NKPFHQYEDLAFVFGKDKASGGACNVPAEQAD

AT4G02210.2 unknown protein1.6e-0724.24Show/hide
Query:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL
        ++L+ +   RG+   G FR      +  +   K  S       ++  + +SL+RQ++ I  I  L   GF W++E + + A+  V+  ++K+H  A+  +
Subjt:  VELVHEGGWRGD--NGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISEILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLL

Query:  NKPFHQYEDLAFVFGKDKASGGACNVPAEQAD
         +P   Y+DL  + G        C V  +  D
Subjt:  NKPFHQYEDLAFVFGKDKASGGACNVPAEQAD

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)2.7e-1038.64Show/hide
Query:  FRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRL
        +++++  +  C E+ RMD+  F  LC LL+T   L     I +E  +A+FL I+ H+++ R ++  F  SGETISRHFN  L+ VI +
Subjt:  FRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNATLSVVIRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCGCTTCAACTTGCCGTGGGTGTCGATGATAGGAATCGGTATCCGCCTGGTAAATTGGGGCACGGTGAGGGGTGGGACCAAGGTAAGGGTACGTTCTTCGTTACACT
AACTACACTCTCTTCTTATCTTCTGCCCCTGAAGACAAACCTGAACCCGACTCCACCTTCTTGTGTTTCGTTTCACGCATCTCAATCTCATATAACGGTTTACTCCAAAA
ACACCGTTATATTCGAGGCTGAAGTAGTACCGTTCCTAATGGATTCCATAACCCCACATGAACTTGTGTCCGTACTTTCAATAATGGCTGACTCTCAGCGCCAACTATTC
AACCTGATTAACTCCTTCATGAACAACCACCGTAGGATAGAAAACCAAACTCCCTACCTCAGACACCAGATGAGGCAGTTAGCCTGCTTCCGGTTGATTCATGAAAGTGA
CCTATGCTGTCGAGAAAGCACCAGGATGGATAGGAGATGTTTTGCCATTCTATGTAGTCTGTTGAGAACGACTTTCGGGTTGGTAGGAATGGAAATCATAGACGTCGAAG
AGATGGTCGCGATGTTCTTGCACATCGTTGCTCACGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGTGAAACCATTTCTCGGCACTTCAACGCGACT
TTGAGTGTAGTAATACGATTGTACGACGTCCTACTTAAGAAACCGAAACCGATCACGGCTTCTTGCCAAGATGGGAGACTAATGGCAGGTGCAGATAAACACCCTAAACA
CATCTGGACGAGGCAAGAGGAGGCAAGATTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTTAGGCCCGGATACCTCGCCC
GACTGAAGCGGATGATAAAAGAGAAAATGTCGTCCTGCACCATAGATTCAACGTCCATAATAGACTGCAAGGTGCGGTCCTTGAAACGGCAATACAGTACCATCTCGGAG
ATATTGGGTCTGGGCTGTAGTGGATTCGGTTGGAATGACGAGTTTAAATGCATCCAGGCTGAGAAGGAGGTCTACGATGCATGGGTGAAGTCACACTCCAGCGCAAAAGG
ACTGCTGAACAAGCCTTTTCATCAATATGAGGATCTTGCTTTCGTGTTCGGCAAAGACAAGGCGAGTGGCGGCGCGTGCAATGTTCCAGCGGAACAGGCCGACAGCACCC
ACGGGGACGATGAGGGTGATCAGAATGTCCAAGCTGACCAGGATTGTTATGTCCCCGCTCCTCCCGACATTAATCTGGCCGCAGACATGGAGTTCGATGACGTCCCAATC
ACACCGACAAGTCGACCAAGCACTGTAGGGTCCTCCCAGAGTCGGAAGCGGAGCAGAGCATCATATGAAGTGGAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCA
GGAGACACAGTTCACTAAGATCGCTGACTGGCCGGACGCCCAAGACGCACGAGAGTTCAAGAGGCGGGACACGATCGGAGAGATGCTCCTGGCGCAGCAGGAGCTATCGA
ACGATGAGAGAGTTGCTCTTATGCACATCCTCTTCTCCGACCCGAAGATGACAAATATGATGCTGTCCGTGCCACCGACCCTCAGACTTCGCTTTCTACGAGGACTACTC
AACGAACGTCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCGCTTCAACTTGCCGTGGGTGTCGATGATAGGAATCGGTATCCGCCTGGTAAATTGGGGCACGGTGAGGGGTGGGACCAAGGTAAGGGTACGTTCTTCGTTACACT
AACTACACTCTCTTCTTATCTTCTGCCCCTGAAGACAAACCTGAACCCGACTCCACCTTCTTGTGTTTCGTTTCACGCATCTCAATCTCATATAACGGTTTACTCCAAAA
ACACCGTTATATTCGAGGCTGAAGTAGTACCGTTCCTAATGGATTCCATAACCCCACATGAACTTGTGTCCGTACTTTCAATAATGGCTGACTCTCAGCGCCAACTATTC
AACCTGATTAACTCCTTCATGAACAACCACCGTAGGATAGAAAACCAAACTCCCTACCTCAGACACCAGATGAGGCAGTTAGCCTGCTTCCGGTTGATTCATGAAAGTGA
CCTATGCTGTCGAGAAAGCACCAGGATGGATAGGAGATGTTTTGCCATTCTATGTAGTCTGTTGAGAACGACTTTCGGGTTGGTAGGAATGGAAATCATAGACGTCGAAG
AGATGGTCGCGATGTTCTTGCACATCGTTGCTCACGATGTTAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCGGGTGAAACCATTTCTCGGCACTTCAACGCGACT
TTGAGTGTAGTAATACGATTGTACGACGTCCTACTTAAGAAACCGAAACCGATCACGGCTTCTTGCCAAGATGGGAGACTAATGGCAGGTGCAGATAAACACCCTAAACA
CATCTGGACGAGGCAAGAGGAGGCAAGATTGGTGGAATCCCTCGTGGAGCTCGTCCACGAAGGTGGATGGAGAGGGGACAACGGGACCTTTAGGCCCGGATACCTCGCCC
GACTGAAGCGGATGATAAAAGAGAAAATGTCGTCCTGCACCATAGATTCAACGTCCATAATAGACTGCAAGGTGCGGTCCTTGAAACGGCAATACAGTACCATCTCGGAG
ATATTGGGTCTGGGCTGTAGTGGATTCGGTTGGAATGACGAGTTTAAATGCATCCAGGCTGAGAAGGAGGTCTACGATGCATGGGTGAAGTCACACTCCAGCGCAAAAGG
ACTGCTGAACAAGCCTTTTCATCAATATGAGGATCTTGCTTTCGTGTTCGGCAAAGACAAGGCGAGTGGCGGCGCGTGCAATGTTCCAGCGGAACAGGCCGACAGCACCC
ACGGGGACGATGAGGGTGATCAGAATGTCCAAGCTGACCAGGATTGTTATGTCCCCGCTCCTCCCGACATTAATCTGGCCGCAGACATGGAGTTCGATGACGTCCCAATC
ACACCGACAAGTCGACCAAGCACTGTAGGGTCCTCCCAGAGTCGGAAGCGGAGCAGAGCATCATATGAAGTGGAAGCCCTTGATATTATGAGGCAGTCAGTGGCTATGCA
GGAGACACAGTTCACTAAGATCGCTGACTGGCCGGACGCCCAAGACGCACGAGAGTTCAAGAGGCGGGACACGATCGGAGAGATGCTCCTGGCGCAGCAGGAGCTATCGA
ACGATGAGAGAGTTGCTCTTATGCACATCCTCTTCTCCGACCCGAAGATGACAAATATGATGCTGTCCGTGCCACCGACCCTCAGACTTCGCTTTCTACGAGGACTACTC
AACGAACGTCGGTGA
Protein sequenceShow/hide protein sequence
MPLQLAVGVDDRNRYPPGKLGHGEGWDQGKGTFFVTLTTLSSYLLPLKTNLNPTPPSCVSFHASQSHITVYSKNTVIFEAEVVPFLMDSITPHELVSVLSIMADSQRQLF
NLINSFMNNHRRIENQTPYLRHQMRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTFGLVGMEIIDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETISRHFNAT
LSVVIRLYDVLLKKPKPITASCQDGRLMAGADKHPKHIWTRQEEARLVESLVELVHEGGWRGDNGTFRPGYLARLKRMIKEKMSSCTIDSTSIIDCKVRSLKRQYSTISE
ILGLGCSGFGWNDEFKCIQAEKEVYDAWVKSHSSAKGLLNKPFHQYEDLAFVFGKDKASGGACNVPAEQADSTHGDDEGDQNVQADQDCYVPAPPDINLAADMEFDDVPI
TPTSRPSTVGSSQSRKRSRASYEVEALDIMRQSVAMQETQFTKIADWPDAQDAREFKRRDTIGEMLLAQQELSNDERVALMHILFSDPKMTNMMLSVPPTLRLRFLRGLL
NERR