; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg024086 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg024086
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon protein
Genome locationscaffold13:7791626..7794205
RNA-Seq ExpressionSpg024086
SyntenySpg024086
Gene Ontology termsNA
InterPro domainsIPR024752 - Myb/SANT-like domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ADN33754.1 retrotransposon protein [Cucumis melo subsp. melo]2.9e-6831.78Show/hide
Query:  LIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSS
        +IHESDL CR+STRMDRR FAILC LLR  +G+  TEIVDVEEMVAMFLH++AHDVKNRVI+++F RSGETVSRHFN  L AVLRLY+ L+K+P P+TS+
Subjt:  LIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSS

Query:  CQDGR---------------------------------------------------KMAG-----------RDK--------------------------
        C D R                                                    +AG           RD                           
Subjt:  CQDGR---------------------------------------------------KMAG-----------RDK--------------------------

Query:  -------QPKHI--------------------------------------WTM-----------------------------------------------
               Q  H+                                      WT+                                               
Subjt:  -------QPKHI--------------------------------------WTM-----------------------------------------------

Query:  ----QEEAKLVE---------------------------CLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISE
             E+ + +E                           C ++LV  GGW+ DNGTFRPGY A+L+RM+ +K+  C +  T+ ID +++ LKR + AI+E
Subjt:  ----QEEAKLVE---------------------------CLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISE

Query:  MLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLG
        MLGP CSGFGWNDE KCIVAEKE+++ WV+S   A+GLLN PFP+Y+ L +VFG+DRA+G  +E  A+   +  G      ++    E +   P   + G
Subjt:  MLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLG

Query:  ADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQPELSDDERVCLMRILFADPK
         D+  D+V  +  SR S   + S G KR R S     +E +  ++     Q  +IA+WP    A +   R     IL   PEL+  +R  L R L +   
Subjt:  ADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQPELSDDERVCLMRILFADPK

Query:  MTNMMLSVPPTMRLRFLRGLLNE
             + +P   R  F R LL +
Subjt:  MTNMMLSVPPTMRLRFLRGLLNE

KAA0033487.1 uncharacterized protein E6C27_scaffold261G00210 [Cucumis melo var. makuwa]5.0e-7647.11Show/hide
Query:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM
        +LRT  G+  T+ VDVEEMV +FLHIVAHDVKNRV RR FARSGETVSRHFN  L  VLRL+++LLK+P+ +T SC   +    +MA    K  KH WT 
Subjt:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM

Query:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS
         E+  LVECL+ LV +G WR DNGTF+PGY  ++ +++K+K+    I +T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI AEK V N WVK 
Subjt:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS

Query:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS
        H  AR LLNKPFP++  L  VFG+DRA+GG  + P E    T  D E D+ +   +++ +P P  L   +    +++P TPTS    AGS +  K+ R S
Subjt:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS

Query:  YEAEALEIMR--QSVVMQETQFTKIADWP
        Y  + ++  R  +S++   T      D+P
Subjt:  YEAEALEIMR--QSVVMQETQFTKIADWP

KAA0034843.1 retrotransposon protein [Cucumis melo var. makuwa]5.6e-8834.55Show/hide
Query:  HELVSILSIMVDSQRQLFNLISFFMNNHRRLENQSPYIRHQIRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHI
        HEL SI++  + SQRQL  ++    N+ +R+ +     RH+IRQLA FR+IH SDL CR+STRMDRRCFAILC LLRT +G+  TE+VDVEEMVAMFLHI
Subjt:  HELVSILSIMVDSQRQLFNLISFFMNNHRRLENQSPYIRHQIRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----------------------------------------------
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL+D LLKKP+P+ + C D R                                              
Subjt:  VAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------KMAGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMP
                                                   M    + PKH WT +EEA LVE    LV+ GGWR DNGTFRPGY  +L RM+  K+P
Subjt:  ------------------------------------------KMAGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMP

Query:  SCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQ
         C I   STID +++ +KR + A++EM GP CSGFGWNDE KCIVAEKEV++ W  SH  A+GLLNK F HY+ L++VFGKDRA+GG +E  A       
Subjt:  SCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQ

Query:  GDEEGDNNLQGSQEYYVPGPSD------LNLGADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFK
          + G NN  G   +      D       +LG ++  D++  T T+R S   + S G KR R  +  ++ +I+R ++     Q  +IA+WP  Q     +
Subjt:  GDEEGDNNLQGSQEYYVPGPSD------LNLGADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFK

Query:  RRDTVGEILLGQPELSDDERVCLMRILFADPKMTNMMLSVPPTMRLRFLRGLLNERR
         R  + + L   PEL+  +R  LMRIL  +       L VP  M+  +   +L E R
Subjt:  RRDTVGEILLGQPELSDDERVCLMRILFADPKMTNMMLSVPPTMRLRFLRGLLNERR

TYK07921.1 hypothetical protein E5676_scaffold265G00330 [Cucumis melo var. makuwa]1.8e-7339.63Show/hide
Query:  MDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQ-DGRKM------
        MDRRCF ILC++LRT  G+  T+ VDV+EMV +FLHIVAHDVKNRV RR  ARSGETVSRHFNA L AVLRL+++LLK+P+P+T SC  DG  +      
Subjt:  MDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQ-DGRKM------

Query:  ---------------------------------------AGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTID
                                               +   K  KH WT  E+  LVECL+ LV EGGWR DNGTF+ GY                  
Subjt:  ---------------------------------------AGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTID

Query:  LTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEG
                     +QY+AI+EM+GP CSGFGWN+  KCI  EK V++ WVK H  A+GLLNKPFP++  L  VFG+DRA+GG  + P E +  T  D E 
Subjt:  LTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEG

Query:  DNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQ
        D+     +++ +P P  L   +    +++P TPTS    AGSS+  K+ R SY  + ++  R S+     +  KIA W   +   E      +   L   
Subjt:  DNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQ

Query:  PELSDDERVCLMRILFADPKMTNMMLSVP
        P +  D+ + +   L  DP M +  L  P
Subjt:  PELSDDERVCLMRILFADPKMTNMMLSVP

TYK26842.1 uncharacterized protein E5676_scaffold260G00340 [Cucumis melo var. makuwa]1.0e-7647.42Show/hide
Query:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM
        +LRT  G+  T+ VDVEEMV +FLHIVAHDVKNRV RR FARSGETVSRHFN  L  VLRL+++LLK+P+ +T SC   +    +MA    K  KH WT 
Subjt:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM

Query:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS
         E+  LVECL+ LV +G WR DNGTF+PGY  ++ +++K+K+    I +T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI AEK V N WVK 
Subjt:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS

Query:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS
        H  AR LLNKPFP++  L  VFG+DRA+GG  + P E    T  D E D+ +   +++ +P P  L   +    +++P TPTS    AGSS+  K+ R S
Subjt:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS

Query:  YEAEALEIMR--QSVVMQETQFTKIADWP
        Y  + ++  R  +S++   T      D+P
Subjt:  YEAEALEIMR--QSVVMQETQFTKIADWP

TrEMBL top hitse value%identityAlignment
A0A5A7SW62 Myb_DNA-bind_3 domain-containing protein2.4e-7647.11Show/hide
Query:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM
        +LRT  G+  T+ VDVEEMV +FLHIVAHDVKNRV RR FARSGETVSRHFN  L  VLRL+++LLK+P+ +T SC   +    +MA    K  KH WT 
Subjt:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM

Query:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS
         E+  LVECL+ LV +G WR DNGTF+PGY  ++ +++K+K+    I +T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI AEK V N WVK 
Subjt:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS

Query:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS
        H  AR LLNKPFP++  L  VFG+DRA+GG  + P E    T  D E D+ +   +++ +P P  L   +    +++P TPTS    AGS +  K+ R S
Subjt:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS

Query:  YEAEALEIMR--QSVVMQETQFTKIADWP
        Y  + ++  R  +S++   T      D+P
Subjt:  YEAEALEIMR--QSVVMQETQFTKIADWP

A0A5A7SWD8 Retrotransposon protein2.7e-8834.55Show/hide
Query:  HELVSILSIMVDSQRQLFNLISFFMNNHRRLENQSPYIRHQIRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHI
        HEL SI++  + SQRQL  ++    N+ +R+ +     RH+IRQLA FR+IH SDL CR+STRMDRRCFAILC LLRT +G+  TE+VDVEEMVAMFLHI
Subjt:  HELVSILSIMVDSQRQLFNLISFFMNNHRRLENQSPYIRHQIRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHI

Query:  VAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----------------------------------------------
        +AHDVKNRVI+R+F RSGET+SRHFN  L AV+RL+D LLKKP+P+ + C D R                                              
Subjt:  VAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------KMAGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMP
                                                   M    + PKH WT +EEA LVE    LV+ GGWR DNGTFRPGY  +L RM+  K+P
Subjt:  ------------------------------------------KMAGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMP

Query:  SCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQ
         C I   STID +++ +KR + A++EM GP CSGFGWNDE KCIVAEKEV++ W  SH  A+GLLNK F HY+ L++VFGKDRA+GG +E  A       
Subjt:  SCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQ

Query:  GDEEGDNNLQGSQEYYVPGPSD------LNLGADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFK
          + G NN  G   +      D       +LG ++  D++  T T+R S   + S G KR R  +  ++ +I+R ++     Q  +IA+WP  Q     +
Subjt:  GDEEGDNNLQGSQEYYVPGPSD------LNLGADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFK

Query:  RRDTVGEILLGQPELSDDERVCLMRILFADPKMTNMMLSVPPTMRLRFLRGLLNERR
         R  + + L   PEL+  +R  LMRIL  +       L VP  M+  +   +L E R
Subjt:  RRDTVGEILLGQPELSDDERVCLMRILFADPKMTNMMLSVPPTMRLRFLRGLLNERR

A0A5D3C7T4 Uncharacterized protein8.5e-7439.63Show/hide
Query:  MDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQ-DGRKM------
        MDRRCF ILC++LRT  G+  T+ VDV+EMV +FLHIVAHDVKNRV RR  ARSGETVSRHFNA L AVLRL+++LLK+P+P+T SC  DG  +      
Subjt:  MDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQ-DGRKM------

Query:  ---------------------------------------AGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTID
                                               +   K  KH WT  E+  LVECL+ LV EGGWR DNGTF+ GY                  
Subjt:  ---------------------------------------AGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTID

Query:  LTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEG
                     +QY+AI+EM+GP CSGFGWN+  KCI  EK V++ WVK H  A+GLLNKPFP++  L  VFG+DRA+GG  + P E +  T  D E 
Subjt:  LTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEG

Query:  DNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQ
        D+     +++ +P P  L   +    +++P TPTS    AGSS+  K+ R SY  + ++  R S+     +  KIA W   +   E      +   L   
Subjt:  DNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQ

Query:  PELSDDERVCLMRILFADPKMTNMMLSVP
        P +  D+ + +   L  DP M +  L  P
Subjt:  PELSDDERVCLMRILFADPKMTNMMLSVP

A0A5D3DTL0 Myb_DNA-bind_3 domain-containing protein4.8e-7747.42Show/hide
Query:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM
        +LRT  G+  T+ VDVEEMV +FLHIVAHDVKNRV RR FARSGETVSRHFN  L  VLRL+++LLK+P+ +T SC   +    +MA    K  KH WT 
Subjt:  LLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGR----KMAG-RDKQPKHIWTM

Query:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS
         E+  LVECL+ LV +G WR DNGTF+PGY  ++ +++K+K+    I +T  ++  V+ LK+QY+ I+EM+GP CSGF WN E KCI AEK V N WVK 
Subjt:  QEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKS

Query:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS
        H  AR LLNKPFP++  L  VFG+DRA+GG  + P E    T  D E D+ +   +++ +P P  L   +    +++P TPTS    AGSS+  K+ R S
Subjt:  HSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRAS

Query:  YEAEALEIMR--QSVVMQETQFTKIADWP
        Y  + ++  R  +S++   T      D+P
Subjt:  YEAEALEIMR--QSVVMQETQFTKIADWP

E5GBB2 Retrotransposon protein1.4e-6831.78Show/hide
Query:  LIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSS
        +IHESDL CR+STRMDRR FAILC LLR  +G+  TEIVDVEEMVAMFLH++AHDVKNRVI+++F RSGETVSRHFN  L AVLRLY+ L+K+P P+TS+
Subjt:  LIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSS

Query:  CQDGR---------------------------------------------------KMAG-----------RDK--------------------------
        C D R                                                    +AG           RD                           
Subjt:  CQDGR---------------------------------------------------KMAG-----------RDK--------------------------

Query:  -------QPKHI--------------------------------------WTM-----------------------------------------------
               Q  H+                                      WT+                                               
Subjt:  -------QPKHI--------------------------------------WTM-----------------------------------------------

Query:  ----QEEAKLVE---------------------------CLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISE
             E+ + +E                           C ++LV  GGW+ DNGTFRPGY A+L+RM+ +K+  C +  T+ ID +++ LKR + AI+E
Subjt:  ----QEEAKLVE---------------------------CLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISE

Query:  MLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLG
        MLGP CSGFGWNDE KCIVAEKE+++ WV+S   A+GLLN PFP+Y+ L +VFG+DRA+G  +E  A+   +  G      ++    E +   P   + G
Subjt:  MLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLG

Query:  ADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQPELSDDERVCLMRILFADPK
         D+  D+V  +  SR S   + S G KR R S     +E +  ++     Q  +IA+WP    A +   R     IL   PEL+  +R  L R L +   
Subjt:  ADLEFDEVPITPTSRPSTAGS-SQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQPELSDDERVCLMRILFADPK

Query:  MTNMMLSVPPTMRLRFLRGLLNE
             + +P   R  F R LL +
Subjt:  MTNMMLSVPPTMRLRFLRGLLNE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G24960.2 unknown protein1.1e-0925.76Show/hide
Query:  WTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAW
        WT   +  L++ LV+ V+ G   G   TF       ++     K  S        +  + + L+R Y+ I  +L    +GF W+     ++A+ +++N +
Subjt:  WTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAW

Query:  VKSHSGARGLLNKPFPHYETLAFVFGKDRASG
        +++H  AR    K  P Y  L F+FGK+ + G
Subjt:  VKSHSGARGLLNKPFPHYETLAFVFGKDRASG

AT4G02210.1 unknown protein2.5e-0924.24Show/hide
Query:  VDLVHEGGWRGD--NGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLL
        +DL+ +   RG+   G FR      ++ +   K  S   D+   +  + ++L+RQ++AI  +L     GF W++E + + A+  V+  ++K+H  AR  +
Subjt:  VDLVHEGGWRGD--NGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLL

Query:  NKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLE
         +P P+Y+ L  + G          V  +  D               QE+   G +DL++ A+ E
Subjt:  NKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLE

AT4G02210.2 unknown protein2.5e-0924.24Show/hide
Query:  VDLVHEGGWRGD--NGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLL
        +DL+ +   RG+   G FR      ++ +   K  S   D+   +  + ++L+RQ++AI  +L     GF W++E + + A+  V+  ++K+H  AR  +
Subjt:  VDLVHEGGWRGD--NGTFRPGYHARLLRMLKDKMPSCTIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLL

Query:  NKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLE
         +P P+Y+ L  + G          V  +  D               QE+   G +DL++ A+ E
Subjt:  NKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGSQEYYVPGPSDLNLGADLE

AT5G28730.1 unknown protein2.2e-0531.76Show/hide
Query:  IHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRL
        I+ +++ C+   RM    F  LC +L    G+  +  + ++E VA+FL I A +   R I  +F  + ET+ R F+  L+A+ RL
Subjt:  IHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRL

AT5G41980.1 CONTAINS InterPro DOMAIN/s: Putative harbinger transposase-derived nuclease (InterPro:IPR006912)3.5e-1136.36Show/hide
Query:  FRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRL
        +++++  +  C E+ RMD+  F  LC LL+T   +  T  + +E  +A+FL I+ H+++ R ++  F  SGET+SRHFN  L AV+ +
Subjt:  FRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHDVKNRVIRRQFARSGETVSRHFNATLQAVLRL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTCGATAAGCCCGCATGAACTTGTCTCCATACTTTCTATAATGGTTGACTCTCAGCGCCAACTATTCAACCTGATTAGCTTCTTCATGAACAACCACCGTAGGTT
AGAAAACCAATCTCCCTACATCCGCCACCAGATAAGGCAATTAGCCTGCTTCCGGTTGATACATGAAAGTGACTTGTGCTGTCGAGAAAGCACCAGGATGGATAGGAGAT
GTTTTGCAATTTTGTGTTCTTTGTTGAGAACGACTTCTGGGATTGTGGGAACGGAAATCGTAGACGTGGAGGAGATGGTCGCGATGTTCTTGCACATTGTAGCTCACGAT
GTCAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCAGGCGAGACCGTTTCTCGACACTTCAACGCGACTTTGCAGGCCGTACTACGATTGTATGACGTTCTACTTAA
GAAACCAGAACCAATCACGTCTTCTTGCCAAGATGGGAGGAAAATGGCAGGTAGAGATAAACAACCGAAGCACATATGGACGATGCAGGAGGAGGCGAAATTGGTGGAAT
GCCTCGTTGACCTTGTCCACGAAGGCGGTTGGAGGGGGGACAACGGAACGTTCAGGCCTGGATATCACGCACGACTGTTGCGTATGTTGAAGGATAAAATGCCGTCATGC
ACAATAGACTTAACTTCAACAATAGACGGCAAGGTGCGGGCATTAAAACGGCAGTATAGTGCGATATCTGAGATGCTGGGTCCGGGTTGCAGTGGGTTCGGGTGGAATGA
CGAATTCAAATGCATCGTGGCTGAGAAAGAAGTGTACAATGCATGGGTGAAGTCACACTCCGGTGCCAGGGGATTGCTCAATAAGCCATTTCCTCACTACGAGACGCTCG
CTTTCGTGTTCGGCAAAGATCGGGCAAGTGGCGGCGGTTCCGAAGTTCCAGCGGAACAGACAGACAGCACCCAGGGAGACGAAGAGGGCGATAACAATTTACAGGGGTCA
CAGGAGTACTATGTCCCCGGACCGTCGGACCTTAATCTTGGTGCAGACCTGGAGTTCGACGAAGTCCCAATCACACCCACGAGTCGACCGAGCACAGCGGGGTCATCACA
GGGACGGAAAAGGAGCAGAGCATCGTATGAAGCTGAAGCCCTGGAAATTATGCGACAGTCAGTGGTGATGCAGGAGACACAATTCACCAAGATCGCTGACTGGCCGGACA
CTCAGGATGCCCGGGAATTCAAACGGAGGGACACGGTCGGGGAGATTCTCCTGGGACAGCCGGAGTTATCAGACGACGAGAGAGTATGTCTGATGCGCATCCTCTTCGCT
GACCCCAAGATGACGAATATGATGCTATCCGTGCCACCGACCATGAGGCTTCGTTTCCTCCGCGGATTGCTCAATGAACGCCGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGATTCGATAAGCCCGCATGAACTTGTCTCCATACTTTCTATAATGGTTGACTCTCAGCGCCAACTATTCAACCTGATTAGCTTCTTCATGAACAACCACCGTAGGTT
AGAAAACCAATCTCCCTACATCCGCCACCAGATAAGGCAATTAGCCTGCTTCCGGTTGATACATGAAAGTGACTTGTGCTGTCGAGAAAGCACCAGGATGGATAGGAGAT
GTTTTGCAATTTTGTGTTCTTTGTTGAGAACGACTTCTGGGATTGTGGGAACGGAAATCGTAGACGTGGAGGAGATGGTCGCGATGTTCTTGCACATTGTAGCTCACGAT
GTCAAGAATCGAGTCATTAGAAGACAGTTTGCACGGTCAGGCGAGACCGTTTCTCGACACTTCAACGCGACTTTGCAGGCCGTACTACGATTGTATGACGTTCTACTTAA
GAAACCAGAACCAATCACGTCTTCTTGCCAAGATGGGAGGAAAATGGCAGGTAGAGATAAACAACCGAAGCACATATGGACGATGCAGGAGGAGGCGAAATTGGTGGAAT
GCCTCGTTGACCTTGTCCACGAAGGCGGTTGGAGGGGGGACAACGGAACGTTCAGGCCTGGATATCACGCACGACTGTTGCGTATGTTGAAGGATAAAATGCCGTCATGC
ACAATAGACTTAACTTCAACAATAGACGGCAAGGTGCGGGCATTAAAACGGCAGTATAGTGCGATATCTGAGATGCTGGGTCCGGGTTGCAGTGGGTTCGGGTGGAATGA
CGAATTCAAATGCATCGTGGCTGAGAAAGAAGTGTACAATGCATGGGTGAAGTCACACTCCGGTGCCAGGGGATTGCTCAATAAGCCATTTCCTCACTACGAGACGCTCG
CTTTCGTGTTCGGCAAAGATCGGGCAAGTGGCGGCGGTTCCGAAGTTCCAGCGGAACAGACAGACAGCACCCAGGGAGACGAAGAGGGCGATAACAATTTACAGGGGTCA
CAGGAGTACTATGTCCCCGGACCGTCGGACCTTAATCTTGGTGCAGACCTGGAGTTCGACGAAGTCCCAATCACACCCACGAGTCGACCGAGCACAGCGGGGTCATCACA
GGGACGGAAAAGGAGCAGAGCATCGTATGAAGCTGAAGCCCTGGAAATTATGCGACAGTCAGTGGTGATGCAGGAGACACAATTCACCAAGATCGCTGACTGGCCGGACA
CTCAGGATGCCCGGGAATTCAAACGGAGGGACACGGTCGGGGAGATTCTCCTGGGACAGCCGGAGTTATCAGACGACGAGAGAGTATGTCTGATGCGCATCCTCTTCGCT
GACCCCAAGATGACGAATATGATGCTATCCGTGCCACCGACCATGAGGCTTCGTTTCCTCCGCGGATTGCTCAATGAACGCCGGTGA
Protein sequenceShow/hide protein sequence
MDSISPHELVSILSIMVDSQRQLFNLISFFMNNHRRLENQSPYIRHQIRQLACFRLIHESDLCCRESTRMDRRCFAILCSLLRTTSGIVGTEIVDVEEMVAMFLHIVAHD
VKNRVIRRQFARSGETVSRHFNATLQAVLRLYDVLLKKPEPITSSCQDGRKMAGRDKQPKHIWTMQEEAKLVECLVDLVHEGGWRGDNGTFRPGYHARLLRMLKDKMPSC
TIDLTSTIDGKVRALKRQYSAISEMLGPGCSGFGWNDEFKCIVAEKEVYNAWVKSHSGARGLLNKPFPHYETLAFVFGKDRASGGGSEVPAEQTDSTQGDEEGDNNLQGS
QEYYVPGPSDLNLGADLEFDEVPITPTSRPSTAGSSQGRKRSRASYEAEALEIMRQSVVMQETQFTKIADWPDTQDAREFKRRDTVGEILLGQPELSDDERVCLMRILFA
DPKMTNMMLSVPPTMRLRFLRGLLNERR