; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032314 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032314
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold2:33061381..33068940
RNA-Seq ExpressionSpg032314
SyntenySpg032314
Gene Ontology termsNA
InterPro domainsIPR011320 - Ribonuclease H1, N-terminal
IPR024752 - Myb/SANT-like domain
IPR037056 - Ribonuclease H1, N-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN34114.1 retrotransposon protein [Cucumis melo subsp. melo]3.0e-6828.63Show/hide
Query:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM
        ++  +L +I+    A+Q Q ++ ++ +L  D +     P   RH+IRQL +FR+IH                         T   L ST  VDVEEMVAM
Subjt:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM

Query:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------
        FLHI+AHD+K+RVI+R+F+RSGET+SRHFN VL A++RLH  LLK P+PV +  TD RWRWFE                                     
Subjt:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------

Query:  ---------------------------MATP--------------------------------------GP-----------------------------
                                   ++ P                                      GP                             
Subjt:  ---------------------------MATP--------------------------------------GP-----------------------------

Query:  -----------------------------------------------------------------------------------SSNNSKHVWTPEEDAVL
                                                                                           SS   KH WT EE+A L
Subjt:  -----------------------------------------------------------------------------------SSNNSKHVWTPEEDAVL

Query:  VQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKG
        V+CLVELV  GGWR DNGTFRPG+ NQ+ +MM  ++PG NI  S  IDSR++L+KR + A+AEM GP CSGFGWN+E+KCI AE+E+FD W   HP AKG
Subjt:  VQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKG

Query:  LRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDP---TSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGN
        L N+ F  ++EL+ VFGKD A G RA +  +      P  D      +   DF    PP+  P    S ++   T TAR        S + ++R     +
Subjt:  LRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDP---TSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGN

Query:  VAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC
          +++    +   +Q+ +IA WP ++ +   + R+++   L+ IP +++ D   + R L+ +   +  F++ P   K+ YC
Subjt:  VAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC

KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]2.0e-8854Show/hide
Query:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW
        +LRT G LE+T YVDVEEMV +FLHI+AHD+KNRV RR F RSGETVSRHFN VL+ +LRLH +LLK P+ VT + +  +WRWF+MA+   +S  +KH W
Subjt:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW

Query:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV
        T  ED  LV+CL++LV+ G WR DNGTF+PG+  Q+ K+MKE++   NI V+PN++S V++LK+QY  IAEMMGP CSGF WN+ERKCIEAE+ + + WV
Subjt:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV

Query:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR
        +GH  A+ L N+PFP+F +L +VFG+D A G + +TPVE   ++    + E+DM +  EDF +P+P  L+P SGE+  +TPT+   A  AG  R  ++RR
Subjt:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]4.3e-8332.4Show/hide
Query:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM
        ++  +L +I+    A+Q Q ++ ++ +L  D +     P   RH+IRQL +FR+IH  DL CR++TRMDRR F +LC LLRT   L ST  VDVEEMVAM
Subjt:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM

Query:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------
        FLHI+AHD+KNRVI+R+F+RSGET+SRHFN VL A++RLH  LLK P+PV +  TD RWRWFE                                     
Subjt:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------

Query:  ---------------------------MATP--------------------------------------GP-----------------------------
                                   ++ P                                      GP                             
Subjt:  ---------------------------MATP--------------------------------------GP-----------------------------

Query:  --------------------------------------------------SSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMK
                                                          SS   KH WT EE+A     LVELV  GGWR DNGTFRPG+ NQ+ +MM 
Subjt:  --------------------------------------------------SSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMK

Query:  ERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFA
         ++PGCNI  S  IDSR++L+KR + A+AEM GP CSGFGWN+E+KCI AE+E+FD W   HP AKGL N+ F  ++EL+ VFGKD A G RA +  +  
Subjt:  ERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFA

Query:  PESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD
          + P  D      V   DF  P   +    S ++   T TAR        S + ++R     +  +++    +   +Q+ +IA WP ++ +   + R++
Subjt:  PESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD

Query:  LYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC
        +  +L+ IP +++ D   + R L+ +   +  F++ P   K+ YC
Subjt:  LYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]5.8e-9645.33Show/hide
Query:  MDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVT-STSTDG---------
        MDRR FT+LC++LRT G LE+T YVDV+EMV +FLHI+AHD+KNRV RR   RSGETVSRHFNAVL+A+LRLH +LLK P+PVT S + DG         
Subjt:  MDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVT-STSTDG---------

Query:  ------RWRWFEMAT-----------------------PGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIV
              R+R  ++ T                          +S  +KH WT  ED VLV+CL++LV+ GGWR DNGTF+ G+                  
Subjt:  ------RWRWFEMAT-----------------------PGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIV

Query:  VSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADL
                     +QY AIAEMMGPACSGFGWNE +KCIE E+ +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TPVE + ++    + 
Subjt:  VSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADL

Query:  ENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIP
        E+DM++  EDF +P+P  L+P SGE+  +TPT+      AG SR  ++RR   G++ +      + T+++I KIA W   + E+E    K LY ELQ IP
Subjt:  ENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIP

Query:  GVSIQDGLVIARALLADPSMLTHFMDFP
        G+ + D L++A +LL DP+ML  F+D+P
Subjt:  GVSIQDGLVIARALLADPSMLTHFMDFP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]4.0e-8954.33Show/hide
Query:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW
        +LRT G LE+T YVDVEEMV +FLHI+AHD+KNRV RR F RSGETVSRHFN VL+ +LRLH +LLK P+ VT + +  +WRWF+MA+   +S  +KH W
Subjt:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW

Query:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV
        T  ED  LV+CL++LV+ G WR DNGTF+PG+  Q+ K+MKE++   NI V+PN++S V++LK+QY  IAEMMGP CSGF WN+ERKCIEAE+ + + WV
Subjt:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV

Query:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR
        +GH  A+ L N+PFP+F +L +VFG+D A G + +TPVE   ++    + E+DM +  EDF +P+P  L+P SGE+  +TPT+   A  AG SR  ++RR
Subjt:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein9.6e-8954Show/hide
Query:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW
        +LRT G LE+T YVDVEEMV +FLHI+AHD+KNRV RR F RSGETVSRHFN VL+ +LRLH +LLK P+ VT + +  +WRWF+MA+   +S  +KH W
Subjt:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW

Query:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV
        T  ED  LV+CL++LV+ G WR DNGTF+PG+  Q+ K+MKE++   NI V+PN++S V++LK+QY  IAEMMGP CSGF WN+ERKCIEAE+ + + WV
Subjt:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV

Query:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR
        +GH  A+ L N+PFP+F +L +VFG+D A G + +TPVE   ++    + E+DM +  EDF +P+P  L+P SGE+  +TPT+   A  AG  R  ++RR
Subjt:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR

A0A5A7SWD8 Retrotransposon protein2.1e-8332.4Show/hide
Query:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM
        ++  +L +I+    A+Q Q ++ ++ +L  D +     P   RH+IRQL +FR+IH  DL CR++TRMDRR F +LC LLRT   L ST  VDVEEMVAM
Subjt:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM

Query:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------
        FLHI+AHD+KNRVI+R+F+RSGET+SRHFN VL A++RLH  LLK P+PV +  TD RWRWFE                                     
Subjt:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------

Query:  ---------------------------MATP--------------------------------------GP-----------------------------
                                   ++ P                                      GP                             
Subjt:  ---------------------------MATP--------------------------------------GP-----------------------------

Query:  --------------------------------------------------SSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMK
                                                          SS   KH WT EE+A     LVELV  GGWR DNGTFRPG+ NQ+ +MM 
Subjt:  --------------------------------------------------SSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMK

Query:  ERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFA
         ++PGCNI  S  IDSR++L+KR + A+AEM GP CSGFGWN+E+KCI AE+E+FD W   HP AKGL N+ F  ++EL+ VFGKD A G RA +  +  
Subjt:  ERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFA

Query:  PESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD
          + P  D      V   DF  P   +    S ++   T TAR        S + ++R     +  +++    +   +Q+ +IA WP ++ +   + R++
Subjt:  PESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD

Query:  LYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC
        +  +L+ IP +++ D   + R L+ +   +  F++ P   K+ YC
Subjt:  LYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC

A0A5D3C7T4 Uncharacterized protein2.8e-9645.33Show/hide
Query:  MDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVT-STSTDG---------
        MDRR FT+LC++LRT G LE+T YVDV+EMV +FLHI+AHD+KNRV RR   RSGETVSRHFNAVL+A+LRLH +LLK P+PVT S + DG         
Subjt:  MDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVT-STSTDG---------

Query:  ------RWRWFEMAT-----------------------PGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIV
              R+R  ++ T                          +S  +KH WT  ED VLV+CL++LV+ GGWR DNGTF+ G+                  
Subjt:  ------RWRWFEMAT-----------------------PGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIV

Query:  VSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADL
                     +QY AIAEMMGPACSGFGWNE +KCIE E+ +FD WV+GHP A+GL N+PFP+F +L +VFG+D A G R +TPVE + ++    + 
Subjt:  VSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADL

Query:  ENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIP
        E+DM++  EDF +P+P  L+P SGE+  +TPT+      AG SR  ++RR   G++ +      + T+++I KIA W   + E+E    K LY ELQ IP
Subjt:  ENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIP

Query:  GVSIQDGLVIARALLADPSMLTHFMDFP
        G+ + D L++A +LL DP+ML  F+D+P
Subjt:  GVSIQDGLVIARALLADPSMLTHFMDFP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein1.9e-8954.33Show/hide
Query:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW
        +LRT G LE+T YVDVEEMV +FLHI+AHD+KNRV RR F RSGETVSRHFN VL+ +LRLH +LLK P+ VT + +  +WRWF+MA+   +S  +KH W
Subjt:  LLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVW

Query:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV
        T  ED  LV+CL++LV+ G WR DNGTF+PG+  Q+ K+MKE++   NI V+PN++S V++LK+QY  IAEMMGP CSGF WN+ERKCIEAE+ + + WV
Subjt:  TPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWV

Query:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR
        +GH  A+ L N+PFP+F +L +VFG+D A G + +TPVE   ++    + E+DM +  EDF +P+P  L+P SGE+  +TPT+   A  AG SR  ++RR
Subjt:  EGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRR

E5GCB5 Retrotransposon protein1.4e-6828.63Show/hide
Query:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM
        ++  +L +I+    A+Q Q ++ ++ +L  D +     P   RH+IRQL +FR+IH                         T   L ST  VDVEEMVAM
Subjt:  IEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAM

Query:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------
        FLHI+AHD+K+RVI+R+F+RSGET+SRHFN VL A++RLH  LLK P+PV +  TD RWRWFE                                     
Subjt:  FLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFE-------------------------------------

Query:  ---------------------------MATP--------------------------------------GP-----------------------------
                                   ++ P                                      GP                             
Subjt:  ---------------------------MATP--------------------------------------GP-----------------------------

Query:  -----------------------------------------------------------------------------------SSNNSKHVWTPEEDAVL
                                                                                           SS   KH WT EE+A L
Subjt:  -----------------------------------------------------------------------------------SSNNSKHVWTPEEDAVL

Query:  VQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKG
        V+CLVELV  GGWR DNGTFRPG+ NQ+ +MM  ++PG NI  S  IDSR++L+KR + A+AEM GP CSGFGWN+E+KCI AE+E+FD W   HP AKG
Subjt:  VQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKG

Query:  LRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDP---TSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGN
        L N+ F  ++EL+ VFGKD A G RA +  +      P  D      +   DF    PP+  P    S ++   T TAR        S + ++R     +
Subjt:  LRNRPFPWFNELALVFGKDSARGVRARTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDP---TSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGN

Query:  VAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC
          +++    +   +Q+ +IA WP ++ +   + R+++   L+ IP +++ D   + R L+ +   +  F++ P   K+ YC
Subjt:  VAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQNIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43722.1 unknown protein6.1e-1133.62Show/hide
Query:  NRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNA
        +R   ++P  +   +   N +R + +D  AC +  RM    FT LC++L+T   L+ T  + +EE VAMFL I  H+   R +  +F R+ ETV R F  
Subjt:  NRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNA

Query:  VLDAILRLHSVLLKSP
        VL A   L    +++P
Subjt:  VLDAILRLHSVLLKSP

AT2G24960.2 unknown protein1.0e-1025.64Show/hide
Query:  ATPGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEER
        A+   +S+ ++  WTP  D  L+  LVE V  G   G   TF     N++      +    +      + +R + L+R Y  I  ++    +GF W+  R
Subjt:  ATPGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEER

Query:  KCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAP
          + A+ +I++ +++ HP+A+  R +  P +  L  +FGK+++ G   R    F P
Subjt:  KCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTPVEFAP

AT5G27260.1 unknown protein8.5e-1327.24Show/hide
Query:  WTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKER--LPGCN--IVVSPNID---SRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAE
        W+PEE  +LVQ LVE +    WR  NGT        I K+  E   +P  N     S N +   SR++ LK QY +  ++     SGFGW+   K   A 
Subjt:  WTPEEDAVLVQCLVELVQVGGWRGDNGTFRPGFHNQIGKMMKER--LPGCN--IVVSPNID---SRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAE

Query:  REIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA------RTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFST--TPTAR
         E++  +++ HP  K LR   F +F+EL ++FG+  A G  A         + +     P  +  +D +  YE          D T+  E S    P   
Subjt:  REIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRA------RTPVEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFST--TPTAR

Query:  PGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD
         G +   P   P++R  S  + ++  E+   + + +I  I      R++ E  ++K+
Subjt:  PGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKD

AT5G28730.1 unknown protein1.2e-0635.29Show/hide
Query:  IHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRL
        I+ ++++C+   RM    FT LC +L     L+S+  + ++E VA+FL I A +   R I  +F  + ET+ R F+ VL A+ RL
Subjt:  IHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRL

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)1.8e-1542.05Show/hide
Query:  FRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRL
        +++++  +  C EN RMD+  F  LC LL+T G L  TN + +E  +A+FL II H+++ R ++  F  SGET+SRHFN VL+A++ +
Subjt:  FRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVEEMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGTCTGTTCGCTTACCCCTTTCGTTTCTTCAGGCGATGGCCACTGTCTACCATCCAATCAGGCGGCGTGGTTGATTTTCGCCGGATTTACGCTTCGATCTCGACC
ACGTTCGTCTGCCTACAATCTTCATGTTCATTATGTGGATGGTTCTGTGATGCCTCTGTTCCTCTGTTCGTGTGCTCTTTTCTTTGCATATTGTGTGTACCTGACATGGT
ACGAATGTTCTAGTCAAGTGAACGGATTTAGTGGGTCAATTTACTGCTCGTACGACACGATCGAGGACGCGAAAGCAGCATATATGACATACATGGAGAACGTCGAGGGT
CGACATTCCACTACGGGAATGATTGAACGATTCCAGTACGAACACATAGAAGCTGAAGACTTGATCGCCATCCTTACTGTGATCTGTGCCACACAGTACCAATTCGTGGT
TTCTTTGATCGGGATATTACATTGTGACAATAGAATTGATAATCAGTCTCCTCCTCACGTACGACATCAAATTAGACAACTAAACTTCTTCAGACTAATACATGAGGATG
ACTTAGCGTGTCGAGAGAACACTCGCATGGATAGGAGAACATTCACTGTCCTATGCTCCCTGTTACGAACGACAGGCAGATTGGAATCAACTAATTATGTCGACGTGGAG
GAAATGGTTGCAATGTTTCTACATATCATTGCCCACGACATGAAGAATAGAGTCATACGACGACAGTTCGTGCGGTCGGGTGAGACCGTATCTAGGCACTTTAACGCTGT
CCTGGATGCGATCCTCCGATTGCACTCGGTACTGCTAAAATCTCCTGAGCCAGTGACTAGCACAAGCACGGACGGGAGGTGGCGGTGGTTTGAGATGGCAACACCAGGAC
CATCTTCCAATAATTCGAAGCATGTCTGGACCCCAGAGGAGGATGCGGTGCTTGTGCAATGTCTGGTAGAATTGGTGCAGGTTGGTGGTTGGCGTGGTGACAATGGCACA
TTCCGACCAGGGTTCCATAACCAAATCGGTAAGATGATGAAAGAACGATTACCGGGATGCAACATCGTCGTCAGTCCGAACATTGATTCGAGGGTGAGGCTGTTGAAAAG
GCAGTACATGGCTATCGCTGAAATGATGGGCCCAGCGTGCAGTGGATTCGGCTGGAACGAAGAACGAAAATGCATCGAGGCGGAAAGGGAAATCTTTGACAAATGGGTCG
AGGGTCACCCACAAGCTAAGGGCCTGCGCAACCGACCATTCCCATGGTTCAATGAACTTGCACTGGTGTTCGGGAAGGATAGTGCACGGGGGGTAAGAGCACGCACACCA
GTTGAATTTGCGCCAGAATCTGAACCTGTTGCGGACTTGGAAAATGACATGAACGTCGAGTACGAAGACTTTTACGTCCCTAGTCCACCTGTTCTTGATCCCACCTCAGG
AGAGGAATTTTCCACTACACCGACAGCAAGACCTGGAGCTGCTGGTGCTGGGCCATCACGAACACCACAGAGGAGAAGACTATCCATGGGAAATGTGGCCGAGGTCTTAG
AAAATGGATTCCAGATGACTGCACAACAGATTGAGAAGATTGCCATGTGGCCCACCATCAGGGAGGAGATGGAACGTAAGCGTCGGAAGGACCTGTATACGGAGTTGCAG
AACATACCAGGAGTTTCGATACAGGACGGGTTAGTCATCGCACGGGCTTTACTGGCTGATCCGAGTATGCTTACACATTTCATGGACTTCCCACCTGAGTGGAAGTTCGA
TTACTGCATGCAGGGTAGCCTCGTGGTGGTGTCCGTTGAAGTATTCAGAGTCGTCAACGGAGGATATGAGGACTGCGTGGACCATGGTGATGCAAAGGAGGCATATGAGG
AAGGACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGTCTGTTCGCTTACCCCTTTCGTTTCTTCAGGCGATGGCCACTGTCTACCATCCAATCAGGCGGCGTGGTTGATTTTCGCCGGATTTACGCTTCGATCTCGACC
ACGTTCGTCTGCCTACAATCTTCATGTTCATTATGTGGATGGTTCTGTGATGCCTCTGTTCCTCTGTTCGTGTGCTCTTTTCTTTGCATATTGTGTGTACCTGACATGGT
ACGAATGTTCTAGTCAAGTGAACGGATTTAGTGGGTCAATTTACTGCTCGTACGACACGATCGAGGACGCGAAAGCAGCATATATGACATACATGGAGAACGTCGAGGGT
CGACATTCCACTACGGGAATGATTGAACGATTCCAGTACGAACACATAGAAGCTGAAGACTTGATCGCCATCCTTACTGTGATCTGTGCCACACAGTACCAATTCGTGGT
TTCTTTGATCGGGATATTACATTGTGACAATAGAATTGATAATCAGTCTCCTCCTCACGTACGACATCAAATTAGACAACTAAACTTCTTCAGACTAATACATGAGGATG
ACTTAGCGTGTCGAGAGAACACTCGCATGGATAGGAGAACATTCACTGTCCTATGCTCCCTGTTACGAACGACAGGCAGATTGGAATCAACTAATTATGTCGACGTGGAG
GAAATGGTTGCAATGTTTCTACATATCATTGCCCACGACATGAAGAATAGAGTCATACGACGACAGTTCGTGCGGTCGGGTGAGACCGTATCTAGGCACTTTAACGCTGT
CCTGGATGCGATCCTCCGATTGCACTCGGTACTGCTAAAATCTCCTGAGCCAGTGACTAGCACAAGCACGGACGGGAGGTGGCGGTGGTTTGAGATGGCAACACCAGGAC
CATCTTCCAATAATTCGAAGCATGTCTGGACCCCAGAGGAGGATGCGGTGCTTGTGCAATGTCTGGTAGAATTGGTGCAGGTTGGTGGTTGGCGTGGTGACAATGGCACA
TTCCGACCAGGGTTCCATAACCAAATCGGTAAGATGATGAAAGAACGATTACCGGGATGCAACATCGTCGTCAGTCCGAACATTGATTCGAGGGTGAGGCTGTTGAAAAG
GCAGTACATGGCTATCGCTGAAATGATGGGCCCAGCGTGCAGTGGATTCGGCTGGAACGAAGAACGAAAATGCATCGAGGCGGAAAGGGAAATCTTTGACAAATGGGTCG
AGGGTCACCCACAAGCTAAGGGCCTGCGCAACCGACCATTCCCATGGTTCAATGAACTTGCACTGGTGTTCGGGAAGGATAGTGCACGGGGGGTAAGAGCACGCACACCA
GTTGAATTTGCGCCAGAATCTGAACCTGTTGCGGACTTGGAAAATGACATGAACGTCGAGTACGAAGACTTTTACGTCCCTAGTCCACCTGTTCTTGATCCCACCTCAGG
AGAGGAATTTTCCACTACACCGACAGCAAGACCTGGAGCTGCTGGTGCTGGGCCATCACGAACACCACAGAGGAGAAGACTATCCATGGGAAATGTGGCCGAGGTCTTAG
AAAATGGATTCCAGATGACTGCACAACAGATTGAGAAGATTGCCATGTGGCCCACCATCAGGGAGGAGATGGAACGTAAGCGTCGGAAGGACCTGTATACGGAGTTGCAG
AACATACCAGGAGTTTCGATACAGGACGGGTTAGTCATCGCACGGGCTTTACTGGCTGATCCGAGTATGCTTACACATTTCATGGACTTCCCACCTGAGTGGAAGTTCGA
TTACTGCATGCAGGGTAGCCTCGTGGTGGTGTCCGTTGAAGTATTCAGAGTCGTCAACGGAGGATATGAGGACTGCGTGGACCATGGTGATGCAAAGGAGGCATATGAGG
AAGGACCTTAG
Protein sequenceShow/hide protein sequence
MAVCSLTPFVSSGDGHCLPSNQAAWLIFAGFTLRSRPRSSAYNLHVHYVDGSVMPLFLCSCALFFAYCVYLTWYECSSQVNGFSGSIYCSYDTIEDAKAAYMTYMENVEG
RHSTTGMIERFQYEHIEAEDLIAILTVICATQYQFVVSLIGILHCDNRIDNQSPPHVRHQIRQLNFFRLIHEDDLACRENTRMDRRTFTVLCSLLRTTGRLESTNYVDVE
EMVAMFLHIIAHDMKNRVIRRQFVRSGETVSRHFNAVLDAILRLHSVLLKSPEPVTSTSTDGRWRWFEMATPGPSSNNSKHVWTPEEDAVLVQCLVELVQVGGWRGDNGT
FRPGFHNQIGKMMKERLPGCNIVVSPNIDSRVRLLKRQYMAIAEMMGPACSGFGWNEERKCIEAEREIFDKWVEGHPQAKGLRNRPFPWFNELALVFGKDSARGVRARTP
VEFAPESEPVADLENDMNVEYEDFYVPSPPVLDPTSGEEFSTTPTARPGAAGAGPSRTPQRRRLSMGNVAEVLENGFQMTAQQIEKIAMWPTIREEMERKRRKDLYTELQ
NIPGVSIQDGLVIARALLADPSMLTHFMDFPPEWKFDYCMQGSLVVVSVEVFRVVNGGYEDCVDHGDAKEAYEEGP