; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg036413 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg036413
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTy3/gypsy retrotransposon protein
Genome locationscaffold5:42807956..42809110
RNA-Seq ExpressionSpg036413
SyntenySpg036413
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037097.1 Transposon Ty3-G Gag-Pol polyprotein [Cucumis melo var. makuwa]1.1e-4430.02Show/hide
Query:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL
        + ++E R++ +   +  +  E+ K+PT+E  +  + + M  +R + +  E ++  + E                           +KE  SS++  SG  
Subjt:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL

Query:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE
                    ++   D    + +E+P+F G D   W+ + ERYF+ H++ D+ K   M V  +   G AL+W+R  +  E    SW   +  L  RF 
Subjt:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE

Query:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------
        +   + + +RF+ ++QE +V DY + F+ L    L D+PD +++  FMNGL   +RAEVR+ +PKG+ E+M  A+ VE    E+N V L NFA       
Subjt:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------

Query:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV
                         +T+F                TMK KG +++R V++ I  G  +N IS+ L   L+LP   TG Y V LG+G  ++G  I   V
Subjt:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV

Query:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         +Q  N  V E+FLPLE+G  +D VLGM WL +LG   VDWK LT+      + + ++GDP+L
Subjt:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

KAA0056890.1 aminoacyl-tRNA ligase [Cucumis melo var. makuwa]2.5e-9452.63Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI
        MEN +EAL   LKK+ D EV  IPT  +NN+T L+ +MA           L+    ++  PQ +           R+P+      LELPMFDGTD+ MWI
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI

Query:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC
        LKMERYF+ H IDD A  ++MD I LCMSGQAL WFRC Q+   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV +YCS+FE LG  LLP++   
Subjt:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC

Query:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV
        +LEAKFMNGLK E+R +VRML PK I +IM +AR  E+KNNVALN         + K KGT+++RSV+VK+ S   YNLIS+NLAT+LKL  D  GDY V
Subjt:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV

Query:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         LG+GK ++GD I RGV+LQ  N    EDF PL+MGE+ + +LG  WLV LG+MEVDWK L MK+K+G+ETVTL+ DP L
Subjt:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

KAE8652678.1 hypothetical protein Csa_013756 [Cucumis sativus]1.6e-9653.56Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWIL
        MEN +EAL+  LKK+ D EV  IP  +NN T LL +MA L    +  + L+Q     + P                        LELPMFDGTD+ MWIL
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWIL

Query:  KMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCI
        KMERYF+ H IDD A  ++M+ I LCMSGQAL WFRC Q+  NPP SW EFR +L+KRF +G  +  RFI LQQEGSV +YCS+FE LG  LLP++  C+
Subjt:  KMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCI

Query:  LEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVD
        +EAKFMNGLK E+R EVRML  +GI +IM +AR  E KNNVA         F + K KGT+++RSVVVK+ S   YNLIS+NLAT+LKL  D  GDY V 
Subjt:  LEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVD

Query:  LGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
        LG+GKT++GD I RGV+LQ  N    EDF PL+MGE+ + +LG  WLV LG+MEVDWK LTMK+++G+E VTL+ DP+L
Subjt:  LGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

TYK06549.1 transposon Tf2-1 polyprotein isoform X1 [Cucumis melo var. makuwa]6.3e-4530.02Show/hide
Query:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL
        + ++E R++ +   +  +  E+ K+PT+E  +  + + M  +R + +  E ++  + E                           +KE  SS++  SG  
Subjt:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL

Query:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE
                    ++   D    + +E+P+F G D   W+ + ERYF+ H++ D+ K   M V  +   G AL+W+R  +  E    SW   +  L  RF 
Subjt:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE

Query:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------
        +   + + +RF+ ++QE +V DY + F+ L    L D+PD +++  FMNGL   +RAEVR+ +PKG+ E+M  A+ VE    E+N V L NFA       
Subjt:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------

Query:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV
                         +T+F                TMK KG +++R V++ I  G  +N IS+ L   L+LP   TG Y V LG+G  ++G  I   V
Subjt:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV

Query:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         +Q  N  V E+FLPLE+G  +D VLGM WL +LG   VDWK LT+      + ++++GDP+L
Subjt:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

XP_016900762.1 PREDICTED: uncharacterized protein LOC107991016 [Cucumis melo]7.2e-8952.19Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI
        MEN +EAL   LKK+ D EV  IPT  +NN+T L+ +MA           L+    ++  PQ +           R+P+      LELPMFDGTD+ MWI
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI

Query:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC
        LKMERYF+ H IDD A  ++MD I LCMSGQAL WFRC Q+   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV +YCS+FE LG  LLP++   
Subjt:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC

Query:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV
        +LEAKFMNGLK E+R +VRML PK I +IM +AR  E+KNNVALN         + K KGT+++RSV+VK+ S   YNLIS+NLAT+LKL  D  GDY V
Subjt:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV

Query:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMK
         LG+GK ++GD I RGV+LQ  N    EDF PL+MGE+ + +LG  WLV LG+MEVDWK L MK+K
Subjt:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMK

TrEMBL top hitse value%identityAlignment
A0A0A0LUB3 Retrotrans_gag domain-containing protein3.0e-5649.26Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWIL
        MEN +EAL+  LKK+ D EV  IP  +NN T LL +MA L    +  + L+Q     + P                        LELPMFDGTD+ MWIL
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWIL

Query:  KMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCI
        KMERYF+ H IDD A  ++M+ I LCMSGQAL WFRC Q+  NPP SW EFR +L+KRF +G  +  RFI LQQEGSV +YCS+FE LG  LLP++  C+
Subjt:  KMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCI

Query:  LEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIAS
        +EAKFMNGLK E+R EVRML  +GI +IM +AR  E KNNVA         F + K KGT+++ S+++ + S
Subjt:  LEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIAS

A0A1S4DXQ7 uncharacterized protein LOC1079910163.5e-8952.19Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI
        MEN +EAL   LKK+ D EV  IPT  +NN+T L+ +MA           L+    ++  PQ +           R+P+      LELPMFDGTD+ MWI
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI

Query:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC
        LKMERYF+ H IDD A  ++MD I LCMSGQAL WFRC Q+   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV +YCS+FE LG  LLP++   
Subjt:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC

Query:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV
        +LEAKFMNGLK E+R +VRML PK I +IM +AR  E+KNNVALN         + K KGT+++RSV+VK+ S   YNLIS+NLAT+LKL  D  GDY V
Subjt:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV

Query:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMK
         LG+GK ++GD I RGV+LQ  N    EDF PL+MGE+ + +LG  WLV LG+MEVDWK L MK+K
Subjt:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMK

A0A5A7T6B1 Transposon Ty3-G Gag-Pol polyprotein5.2e-4530.02Show/hide
Query:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL
        + ++E R++ +   +  +  E+ K+PT+E  +  + + M  +R + +  E ++  + E                           +KE  SS++  SG  
Subjt:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL

Query:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE
                    ++   D    + +E+P+F G D   W+ + ERYF+ H++ D+ K   M V  +   G AL+W+R  +  E    SW   +  L  RF 
Subjt:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE

Query:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------
        +   + + +RF+ ++QE +V DY + F+ L    L D+PD +++  FMNGL   +RAEVR+ +PKG+ E+M  A+ VE    E+N V L NFA       
Subjt:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------

Query:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV
                         +T+F                TMK KG +++R V++ I  G  +N IS+ L   L+LP   TG Y V LG+G  ++G  I   V
Subjt:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV

Query:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         +Q  N  V E+FLPLE+G  +D VLGM WL +LG   VDWK LT+      + + ++GDP+L
Subjt:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

A0A5D3BJD9 Aminoacyl-tRNA ligase1.2e-9452.63Show/hide
Query:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI
        MEN +EAL   LKK+ D EV  IPT  +NN+T L+ +MA           L+    ++  PQ +           R+P+      LELPMFDGTD+ MWI
Subjt:  MENRIEALQTNLKKLTD-EVQKIPTM-ENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWI

Query:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC
        LKMERYF+ H IDD A  ++MD I LCMSGQAL WFRC Q+   PP SWDEFR +L+ RF +   +  +F+ L+QEGSV +YCS+FE LG  LLP++   
Subjt:  LKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDC

Query:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV
        +LEAKFMNGLK E+R +VRML PK I +IM +AR  E+KNNVALN         + K KGT+++RSV+VK+ S   YNLIS+NLAT+LKL  D  GDY V
Subjt:  ILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCV

Query:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         LG+GK ++GD I RGV+LQ  N    EDF PL+MGE+ + +LG  WLV LG+MEVDWK L MK+K+G+ETVTL+ DP L
Subjt:  DLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

A0A5D3C860 Transposon Tf2-1 polyprotein isoform X13.1e-4530.02Show/hide
Query:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL
        + ++E R++ +   +  +  E+ K+PT+E  +  + + M  +R + +  E ++  + E                           +KE  SS++  SG  
Subjt:  RNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQE---------------------------KKEPQSSEAKRSGEL

Query:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE
                    ++   D    + +E+P+F G D   W+ + ERYF+ H++ D+ K   M V  +   G AL+W+R  +  E    SW   +  L  RF 
Subjt:  Q-----------QQRLPDSLARRMLELPMFDGTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE

Query:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------
        +   + + +RF+ ++QE +V DY + F+ L    L D+PD +++  FMNGL   +RAEVR+ +PKG+ E+M  A+ VE    E+N V L NFA       
Subjt:  NG--DTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVE----EKNNVAL-NFA-------

Query:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV
                         +T+F                TMK KG +++R V++ I  G  +N IS+ L   L+LP   TG Y V LG+G  ++G  I   V
Subjt:  ---------------DVDTSFK---------------TMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGV

Query:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL
         +Q  N  V E+FLPLE+G  +D VLGM WL +LG   VDWK LT+      + ++++GDP+L
Subjt:  MLQFQNLFVFEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G67020.1 unknown protein1.4e-0527.81Show/hide
Query:  NKMENRIEALQTNLKKL-TDEVQ---KIPTMENNITALLEEMAQLRLRQ-EATETLIQKMQEKKEPQSSEAKRSGELQQQRLPD---SLARRMLELPMFD
        ++ E  ++ L    K+L  D+VQ   K+ +M++ +  ++  + ++ +R+ +  E      Q  +   SS   + G  +   L D   SL RR +E+P+FD
Subjt:  NKMENRIEALQTNLKKL-TDEVQ---KIPTMENNITALLEEMAQLRLRQ-EATETLIQKMQEKKEPQSSEAKRSGELQQQRLPD---SLARRMLELPMFD

Query:  GTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE
        G+    W  K+ER+F+  R  D+ K   +D++AL + G AL WF   +        W+ F   L  RF+
Subjt:  GTDSPMWILKMERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFE

AT3G29750.1 Eukaryotic aspartyl protease family protein1.3e-1930.51Show/hide
Query:  FIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEE-------------IMRKARFVEEKNNVALNFADVD------
        +  +QQEGSVRDY  +FE L  L    +P    E  F+ GL+  ++  VR L+P GI               +  K   V++K  V     +++      
Subjt:  FIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRAEVRMLQPKGIEE-------------IMRKARFVEEKNNVALNFADVD------

Query:  -----------TSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGE-N
                   T  K M+F G + D  VVV I SG   N I   LA  LKLP   T    V LG  + I+      G+ L  Q + + E+FL L++ + +
Subjt:  -----------TSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGE-N

Query:  IDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTL
        +D +LG  WL  LGE  V+W+         ++ +TL
Subjt:  IDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTL

AT3G30770.1 Eukaryotic aspartyl protease family protein9.5e-0732Show/hide
Query:  TSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGE-NIDFVLGMSWLV
        T  K M+F G +    VVV I SG   N IS  LA  LKLP   T    V LG  + I+      G+ L  Q + + E+FL L++ + ++D +LG     
Subjt:  TSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGE-NIDFVLGMSWLV

Query:  TLGEMEVDWKALTMKMKMGRETVTL
         L    + W          ++ VTL
Subjt:  TLGEMEVDWKALTMKMKMGRETVTL

AT3G42723.1 aminoacyl-tRNA ligases;ATP binding;nucleotide binding1.1e-1526.09Show/hide
Query:  ERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTM----YDRFIALQQEGSVRDYCSQFE--LLGGLLLPDI
        E YF  + I +  +   + ++   + G    W + +   +N P SW EF+  + +  E   TM       +  +QQEGSVR+Y  +FE   LG ++LP  
Subjt:  ERYFKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTM----YDRFIALQQEGSVRDYCSQFE--LLGGLLLPDI

Query:  PDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGT-LEDRSVV--------VKIASGVAYNLISQNLATEL
            LEA F+ GL+  ++  VR L+P GI ++M  A+++EE N++ +  + +    +   +  T  E RS+V        +K     A N  +  L  E 
Subjt:  PDCILEAKFMNGLKAEVRAEVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGT-LEDRSVV--------VKIASGVAYNLISQNLATEL

Query:  KLP------------WDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGEN-IDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTL
        +LP            + Y     V        +  R  + + L+  ++ + ED+   ++  + +D +LG  WL  LGE EV+W+  +      ++ VTL
Subjt:  KLP------------WDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFVFEDFLPLEMGEN-IDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAGACGGAACAAAATGGAAAACCGAATAGAAGCACTCCAAACGAACTTGAAGAAGCTTACCGATGAAGTTCAGAAGATTCCAACAATGGAGAACAACATCACGGC
TCTGTTAGAAGAAATGGCCCAGCTTCGGCTGCGCCAGGAAGCCACTGAGACATTGATCCAGAAGATGCAAGAGAAGAAGGAACCACAGTCCAGTGAAGCGAAACGAAGTG
GAGAACTACAACAACAACGCTTACCTGACTCCCTAGCCCGCCGGATGCTGGAATTGCCTATGTTCGATGGAACCGATTCGCCCATGTGGATCCTGAAAATGGAACGCTAC
TTTAAATTTCACCGCATCGACGACACTGCCAAGACCAAGATCATGGACGTTATCGCACTCTGTATGTCAGGCCAAGCCCTAGACTGGTTCCGATGCGTTCAACATGGGGA
AAATCCGCCGGGATCGTGGGATGAGTTTCGGCATGCTTTGTTTAAGCGATTTGAAAACGGCGACACCATGTATGACAGGTTCATTGCCTTACAGCAAGAGGGGAGCGTGA
GGGACTATTGCAGCCAGTTCGAGTTACTTGGGGGGCTCCTCCTTCCAGACATTCCTGACTGCATTCTTGAAGCAAAATTTATGAACGGCTTAAAGGCAGAGGTTCGAGCG
GAGGTTCGGATGTTACAACCAAAAGGTATAGAGGAAATCATGAGAAAGGCGAGGTTCGTGGAAGAAAAGAACAACGTTGCGCTGAACTTTGCCGACGTGGATACGTCGTT
TAAGACCATGAAGTTCAAAGGCACGCTCGAGGATAGGTCTGTGGTCGTTAAGATCGCCAGTGGGGTAGCCTACAACTTAATCTCTCAAAATTTGGCTACGGAATTGAAGC
TTCCGTGGGACTACACCGGCGATTACTGTGTGGATTTGGGTAACGGGAAGACGATCGAAGGAGACAGAATTTTCCGTGGAGTGATGCTGCAATTCCAGAATCTCTTCGTT
TTCGAGGACTTCTTGCCGCTTGAAATGGGAGAGAATATTGATTTCGTATTGGGAATGTCGTGGCTGGTGACCCTAGGCGAAATGGAGGTTGACTGGAAAGCTCTCACGAT
GAAGATGAAGATGGGGAGAGAGACTGTGACACTACAAGGAGACCCAGCTCTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAGACGGAACAAAATGGAAAACCGAATAGAAGCACTCCAAACGAACTTGAAGAAGCTTACCGATGAAGTTCAGAAGATTCCAACAATGGAGAACAACATCACGGC
TCTGTTAGAAGAAATGGCCCAGCTTCGGCTGCGCCAGGAAGCCACTGAGACATTGATCCAGAAGATGCAAGAGAAGAAGGAACCACAGTCCAGTGAAGCGAAACGAAGTG
GAGAACTACAACAACAACGCTTACCTGACTCCCTAGCCCGCCGGATGCTGGAATTGCCTATGTTCGATGGAACCGATTCGCCCATGTGGATCCTGAAAATGGAACGCTAC
TTTAAATTTCACCGCATCGACGACACTGCCAAGACCAAGATCATGGACGTTATCGCACTCTGTATGTCAGGCCAAGCCCTAGACTGGTTCCGATGCGTTCAACATGGGGA
AAATCCGCCGGGATCGTGGGATGAGTTTCGGCATGCTTTGTTTAAGCGATTTGAAAACGGCGACACCATGTATGACAGGTTCATTGCCTTACAGCAAGAGGGGAGCGTGA
GGGACTATTGCAGCCAGTTCGAGTTACTTGGGGGGCTCCTCCTTCCAGACATTCCTGACTGCATTCTTGAAGCAAAATTTATGAACGGCTTAAAGGCAGAGGTTCGAGCG
GAGGTTCGGATGTTACAACCAAAAGGTATAGAGGAAATCATGAGAAAGGCGAGGTTCGTGGAAGAAAAGAACAACGTTGCGCTGAACTTTGCCGACGTGGATACGTCGTT
TAAGACCATGAAGTTCAAAGGCACGCTCGAGGATAGGTCTGTGGTCGTTAAGATCGCCAGTGGGGTAGCCTACAACTTAATCTCTCAAAATTTGGCTACGGAATTGAAGC
TTCCGTGGGACTACACCGGCGATTACTGTGTGGATTTGGGTAACGGGAAGACGATCGAAGGAGACAGAATTTTCCGTGGAGTGATGCTGCAATTCCAGAATCTCTTCGTT
TTCGAGGACTTCTTGCCGCTTGAAATGGGAGAGAATATTGATTTCGTATTGGGAATGTCGTGGCTGGTGACCCTAGGCGAAATGGAGGTTGACTGGAAAGCTCTCACGAT
GAAGATGAAGATGGGGAGAGAGACTGTGACACTACAAGGAGACCCAGCTCTGTAA
Protein sequenceShow/hide protein sequence
MGRRNKMENRIEALQTNLKKLTDEVQKIPTMENNITALLEEMAQLRLRQEATETLIQKMQEKKEPQSSEAKRSGELQQQRLPDSLARRMLELPMFDGTDSPMWILKMERY
FKFHRIDDTAKTKIMDVIALCMSGQALDWFRCVQHGENPPGSWDEFRHALFKRFENGDTMYDRFIALQQEGSVRDYCSQFELLGGLLLPDIPDCILEAKFMNGLKAEVRA
EVRMLQPKGIEEIMRKARFVEEKNNVALNFADVDTSFKTMKFKGTLEDRSVVVKIASGVAYNLISQNLATELKLPWDYTGDYCVDLGNGKTIEGDRIFRGVMLQFQNLFV
FEDFLPLEMGENIDFVLGMSWLVTLGEMEVDWKALTMKMKMGRETVTLQGDPAL