; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039262 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039262
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionBED-type domain-containing protein
Genome locationscaffold10:40732817..40744777
RNA-Seq ExpressionSpg039262
SyntenySpg039262
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
InterPro domainsIPR003656 - Zinc finger, BED-type
IPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_038884678.1 uncharacterized protein LOC120075395 isoform X1 [Benincasa hispida]1.2e-13544.26Show/hide
Query:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE
        K+GMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQI+QLLGFK  EKLKRQKKGSKNAVSCFPSREE
Subjt:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE

Query:  IDDGV-----------------------------------------------------------------------------------------------
        IDDG+                                                                                               
Subjt:  IDDGV-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------H
                                                                                                           H
Subjt:  ---------------------------------------------------------------------------------------------------H

Query:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF
        QTFTSGAWMQS+LSKYGAGLEV KI ADPLFWSKCDHITMGTKPLLSVLQFLESEEKP+ GFIYDAFEK+KNSVMLAFN+KES+YLPYLKAI+HVL KEF
Subjt:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------
        QS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA      
Subjt:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------

Query:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE
                                                   RRLET KARCSIDA+D   LEAID NM+DWV      EDEHK WVD+KVT+QET VE
Subjt:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE

Query:  HKLSSMDNCIDSTD
        HKLS+MD+CID TD
Subjt:  HKLSSMDNCIDSTD

XP_038884679.1 uncharacterized protein LOC120075395 isoform X2 [Benincasa hispida]1.2e-13544.26Show/hide
Query:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE
        K+GMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQI+QLLGFK  EKLKRQKKGSKNAVSCFPSREE
Subjt:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE

Query:  IDDGV-----------------------------------------------------------------------------------------------
        IDDG+                                                                                               
Subjt:  IDDGV-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------H
                                                                                                           H
Subjt:  ---------------------------------------------------------------------------------------------------H

Query:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF
        QTFTSGAWMQS+LSKYGAGLEV KI ADPLFWSKCDHITMGTKPLLSVLQFLESEEKP+ GFIYDAFEK+KNSVMLAFN+KES+YLPYLKAI+HVL KEF
Subjt:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------
        QS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA      
Subjt:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------

Query:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE
                                                   RRLET KARCSIDA+D   LEAID NM+DWV      EDEHK WVD+KVT+QET VE
Subjt:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE

Query:  HKLSSMDNCIDSTD
        HKLS+MD+CID TD
Subjt:  HKLSSMDNCIDSTD

XP_038884682.1 uncharacterized protein LOC120075395 isoform X3 [Benincasa hispida]2.9e-13444.16Show/hide
Query:  MVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDD
        MVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQI+QLLGFK  EKLKRQKKGSKNAVSCFPSREEIDD
Subjt:  MVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDD

Query:  GV--------------------------------------------------------------------------------------------------
        G+                                                                                                  
Subjt:  GV--------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------------------------HQTF
                                                                                                        HQTF
Subjt:  ------------------------------------------------------------------------------------------------HQTF

Query:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP
        TSGAWMQS+LSKYGAGLEV KI ADPLFWSKCDHITMGTKPLLSVLQFLESEEKP+ GFIYDAFEK+KNSVMLAFN+KES+YLPYLKAI+HVL KEFQS 
Subjt:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP

Query:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA---------
        LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA         
Subjt:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA---------

Query:  ----------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKL
                                                RRLET KARCSIDA+D   LEAID NM+DWV      EDEHK WVD+KVT+QET VEHKL
Subjt:  ----------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKL

Query:  SSMDNCIDSTD
        S+MD+CID TD
Subjt:  SSMDNCIDSTD

XP_038884685.1 uncharacterized protein LOC120075395 isoform X4 [Benincasa hispida]1.2e-13544.26Show/hide
Query:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE
        K+GMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQI+QLLGFK  EKLKRQKKGSKNAVSCFPSREE
Subjt:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE

Query:  IDDGV-----------------------------------------------------------------------------------------------
        IDDG+                                                                                               
Subjt:  IDDGV-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------H
                                                                                                           H
Subjt:  ---------------------------------------------------------------------------------------------------H

Query:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF
        QTFTSGAWMQS+LSKYGAGLEV KI ADPLFWSKCDHITMGTKPLLSVLQFLESEEKP+ GFIYDAFEK+KNSVMLAFN+KES+YLPYLKAI+HVL KEF
Subjt:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------
        QS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA      
Subjt:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------

Query:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE
                                                   RRLET KARCSIDA+D   LEAID NM+DWV      EDEHK WVD+KVT+QET VE
Subjt:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE

Query:  HKLSSMDNCIDSTD
        HKLS+MD+CID TD
Subjt:  HKLSSMDNCIDSTD

XP_038884686.1 uncharacterized protein LOC120075395 isoform X5 [Benincasa hispida]2.0e-13544.2Show/hide
Query:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE
        K+GMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCE VPEEVKVQI+QLLGFK  EKLKRQKKGSKNAVSCFPSREE
Subjt:  KKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREE

Query:  IDDGV-----------------------------------------------------------------------------------------------
        IDDG+                                                                                               
Subjt:  IDDGV-----------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------------------------------H
                                                                                                           H
Subjt:  ---------------------------------------------------------------------------------------------------H

Query:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF
        QTFTSGAWMQS+LSKYGAGLEV KI ADPLFWSKCDHITMGTKPLLSVLQFLESEEKP+ GFIYDAFEK+KNSVMLAFN+KES+YLPYLKAI+HVL KEF
Subjt:  QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEF

Query:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------
        QS LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA      
Subjt:  QSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA------

Query:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE
                                                   RRLET KARCSIDA+D   LEAID NM+DWV      EDEHK WVD+KVT+QET VE
Subjt:  -------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVE

Query:  HKLSSMDNCIDSTDE
        HKLS+MD+CID T E
Subjt:  HKLSSMDNCIDSTDE

TrEMBL top hitse value%identityAlignment
A0A6J1BVZ0 uncharacterized protein LOC111006240 isoform X25.6e-13141.81Show/hide
Query:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
        M+E +E LTD++FKEKR                   G+VP RASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
Subjt:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK

Query:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGV-------------------------------------------------------------
        +QI+QLLGFK  EKLKRQKK +KNAV CFPSRE IDD V                                                             
Subjt:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGV-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------HQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAF
                                        HQTFTSG WMQS+ SK+GAGLEVAKITADPLFWSKCDH+T GTKPLLSVLQFLESEEKPS GFIYDAF
Subjt:  --------------------------------HQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAF

Query:  EKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVA
        EK+KNSVMLAFN KES Y P+LKAI+HVL KEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM  +NINFYEEAVGDFGR VA
Subjt:  EKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVA

Query:  LHGRDLLAP-------------VSKLA-------------------------------------------------RRLETGKARCSIDALDTACLEAID
        LHGR+ LAP             + +LA                                                 RRLE GK RCSI ALD  CLEAID
Subjt:  LHGRDLLAP-------------VSKLA-------------------------------------------------RRLETGKARCSIDALDTACLEAID

Query:  ANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKLSSMDNCIDSTDERGS
          MEDW+ D+EV+EDEHKRW+++KVTSQET VEHK S++++CID+TDER S
Subjt:  ANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKLSSMDNCIDSTDERGS

A0A6J1BWT8 uncharacterized protein LOC111006240 isoform X15.6e-13141.81Show/hide
Query:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
        M+E +E LTD++FKEKR                   G+VP RASDPGWAHGIMVNGGRQKIKCKYC+KVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
Subjt:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK

Query:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGV-------------------------------------------------------------
        +QI+QLLGFK  EKLKRQKK +KNAV CFPSRE IDD V                                                             
Subjt:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGV-------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------HQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAF
                                        HQTFTSG WMQS+ SK+GAGLEVAKITADPLFWSKCDH+T GTKPLLSVLQFLESEEKPS GFIYDAF
Subjt:  --------------------------------HQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAF

Query:  EKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVA
        EK+KNSVMLAFN KES Y P+LKAI+HVL KEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVM  +NINFYEEAVGDFGR VA
Subjt:  EKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVA

Query:  LHGRDLLAP-------------VSKLA-------------------------------------------------RRLETGKARCSIDALDTACLEAID
        LHGR+ LAP             + +LA                                                 RRLE GK RCSI ALD  CLEAID
Subjt:  LHGRDLLAP-------------VSKLA-------------------------------------------------RRLETGKARCSIDALDTACLEAID

Query:  ANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKLSSMDNCIDSTDERGS
          MEDW+ D+EV+EDEHKRW+++KVTSQET VEHK S++++CID+TDER S
Subjt:  ANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKLSSMDNCIDSTDERGS

A0A6J1H0E4 uncharacterized protein LOC111459278 isoform X12.0e-12842.49Show/hide
Query:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
        M ESEELLTDMKFKEKR                   GM PPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNV PCEEVPEEVK
Subjt:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK

Query:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGVH------------------------------------------------------------
        VQI+QLLGFK   KLKR  KGSKNA SCFPSREEIDDGVH                                                            
Subjt:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGVH------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYD
                                           Q FTSGAWMQS+LSK GAGLEVAKITADP+FWSKCDHITMGTKPLLSVLQFLESEE+PS GFIYD
Subjt:  -----------------------------------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYD

Query:  AFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGR
        AFEK+K++VMLAFN+KESVYLPYLKAI+HVLLKEFQS LH+AAYYLNPSIFYSP TF+ SKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGR
Subjt:  AFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSP-TFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGR

Query:  PVALHGRDLLAP-------------VSKLA--------------------------------------------------RRLETGKARCSIDALDTACL
        PVALHGRD LAP             + +LA                                                  RRLET K RCSIDALD   L
Subjt:  PVALHGRDLLAP-------------VSKLA--------------------------------------------------RRLETGKARCSIDALDTACL

Query:  EAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQ
        E I ANMEDWVED+E LEDE +RWVD+K TSQ
Subjt:  EAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQ

A0A6J1JMG6 uncharacterized protein LOC111487192 isoform X21.7e-12742.88Show/hide
Query:  MPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCF
        M C V KGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNV PCEEVPEEVKVQI+QLLGFK   KLKR KKGSKNA SC 
Subjt:  MPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCF

Query:  PSREEIDDGVH-----------------------------------------------------------------------------------------
         SREEIDDGVH                                                                                         
Subjt:  PSREEIDDGVH-----------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINH
              Q FTSGAWMQS+ SK GAGLEVAKITADP+FWSKC+HITMGTKPLLSV+QFLESEEKPS GFIYDAFEK+KNSVMLAFN+KESVYLPYLKAI+H
Subjt:  ------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINH

Query:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA
        VLLKEFQS LH+AAYYLNPSIFYSPTF+ SKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRPVALHGRD LAP             + +LA
Subjt:  VLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP-------------VSKLA

Query:  --------------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVT
                                                          RRLET KARCSIDALD   LE I ANMEDWVED+E LEDEH+RWVD+K T
Subjt:  --------------------------------------------------RRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVT

Query:  SQ
        SQ
Subjt:  SQ

A0A6J1JSR3 uncharacterized protein LOC111487192 isoform X17.3e-13142.82Show/hide
Query:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK
        M ESEELLTDMKFKEKR                   GMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNV PCEEVPEEVK
Subjt:  MVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVK

Query:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGVH------------------------------------------------------------
        VQI+QLLGFK   KLKR KKGSKNA SC  SREEIDDGVH                                                            
Subjt:  VQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGVH------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYD
                                           Q FTSGAWMQS+ SK GAGLEVAKITADP+FWSKC+HITMGTKPLLSV+QFLESEEKPS GFIYD
Subjt:  -----------------------------------QTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYD

Query:  AFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRP
        AFEK+KNSVMLAFN+KESVYLPYLKAI+HVLLKEFQS LH+AAYYLNPSIFYSPTF+ SKVIQKGLLDCIEALEPDITSQVM+TNNINFYEEAVGDFGRP
Subjt:  AFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRP

Query:  VALHGRDLLAP-------------VSKLA--------------------------------------------------RRLETGKARCSIDALDTACLE
        VALHGRD LAP             + +LA                                                  RRLET KARCSIDALD   LE
Subjt:  VALHGRDLLAP-------------VSKLA--------------------------------------------------RRLETGKARCSIDALDTACLE

Query:  AIDANMEDWVEDIEVLEDEHKRWVDLKVTSQ
         I ANMEDWVED+E LEDEH+RWVD+K TSQ
Subjt:  AIDANMEDWVEDIEVLEDEHKRWVDLKVTSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G22220.1 hAT transposon superfamily9.1e-2533.99Show/hide
Query:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP
        TS  W     SK   GL + +   D  FW         T P+L VL+ + SE KP++G++Y A  ++K ++      +E  Y+ Y K I+   L   Q P
Subjt:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP

Query:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAPVSKLARRLET--GKARCSIDALDTACL
        L+ A +YLNP  FYS        I   ++DCIE L PD+  Q +V  +IN Y+ AVG FGR +A+  RD + P    +   E+    +R +I  L   C 
Subjt:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAPVSKLARRLET--GKARCSIDALDTACL

Query:  EAI
         +I
Subjt:  EAI

AT3G22220.1 hAT transposon superfamily1.5e-1134.74Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKN-AVSCFPSRE
        P+  D  W H  +   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VP+EV++ ++Q +      + KR+K   +   ++ FP  E
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKN-AVSCFPSRE

AT3G22220.2 hAT transposon superfamily9.1e-2533.99Show/hide
Query:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP
        TS  W     SK   GL + +   D  FW         T P+L VL+ + SE KP++G++Y A  ++K ++      +E  Y+ Y K I+   L   Q P
Subjt:  TSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSP

Query:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAPVSKLARRLET--GKARCSIDALDTACL
        L+ A +YLNP  FYS        I   ++DCIE L PD+  Q +V  +IN Y+ AVG FGR +A+  RD + P    +   E+    +R +I  L   C 
Subjt:  LHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAPVSKLARRLET--GKARCSIDALDTACL

Query:  EAI
         +I
Subjt:  EAI

AT3G22220.2 hAT transposon superfamily1.5e-1134.74Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKN-AVSCFPSRE
        P+  D  W H  +   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VP+EV++ ++Q +      + KR+K   +   ++ FP  E
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKN-AVSCFPSRE

AT4G15020.1 hAT transposon superfamily3.0e-2030.05Show/hide
Query:  EIDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAIN
        E+   +    TS  W +   S+  +GL +  +T D  FW     +   T PLL  L+ + SE++P++G++Y A  ++K+++      +E  Y+ Y K I+
Subjt:  EIDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAIN

Query:  HVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP
            ++   PL  A ++LNP +FY+        +   +LDCIE L PD   Q  +   +  Y+ A G FGR +A+  RD + P
Subjt:  HVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP

AT4G15020.1 hAT transposon superfamily8.8e-1238.1Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSK
        P+  D  W H  I   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VPE+V++ ++Q +      + KR K  S+
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSK

AT4G15020.2 hAT transposon superfamily3.0e-2030.05Show/hide
Query:  EIDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAIN
        E+   +    TS  W +   S+  +GL +  +T D  FW     +   T PLL  L+ + SE++P++G++Y A  ++K+++      +E  Y+ Y K I+
Subjt:  EIDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAIN

Query:  HVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP
            ++   PL  A ++LNP +FY+        +   +LDCIE L PD   Q  +   +  Y+ A G FGR +A+  RD + P
Subjt:  HVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP

AT4G15020.2 hAT transposon superfamily8.8e-1238.1Show/hide
Query:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSK
        P+  D  W H  I   G R +++C YC K+  GGGI+R+K+HLAG++G    C++VPE+V++ ++Q +      + KR K  S+
Subjt:  PRASDPGWAH-GIMVNGGRQKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSK

AT5G33406.1 hAT dimerisation domain-containing protein / transposase-related1.2e-2128.96Show/hide
Query:  IDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINH
        + D + +   S  W  S  +K   G+++        FW    H      PL+ VL+ ++ E KP +G+IY A +++K ++M +F  KE  Y    + I+ 
Subjt:  IDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITADPLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINH

Query:  VLLKEFQSPLHVAAYYLNPSIFY-SPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP
            +   PLH A YYLNP   Y  P  +  + +  G L C+  L P I +Q  +   ++ +++A G FG P+A+  R  ++P
Subjt:  VLLKEFQSPLHVAAYYLNPSIFY-SPTFLSSKVIQKGLLDCIEALEPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
GATTTCGTTCAGACGTTATTTGATCTTTCAGTTTTTCATTTTCAGCTCCATCCATCTGTTTCGGTTCTCAATTACTTGGCCGGATTCCGCTCCTTCTCCGGGATTTCGTC
CGATATTTCGGATTTAACATGGGGTAAGTCAGGTTATGAACATCCAATGGTGGAGTCTGAGGAGCTTCTCACGGATATGAAATTCAAGGAAAAACGAGTGAGTGGGTCTG
TTTGCAACTTTGCATTGGCAGGCATGCCTTGTTTTGTTAAAAAGGGTATGGTACCTCCACGGGCTTCTGATCCTGGTTGGGCTCATGGAATTATGGTCAATGGGGGTCGC
CAGAAGATTAAATGCAAATACTGTAATAAAGTTATGCTTGGGGGCGGCATATCCAGACTAAAGCAACATCTAGCTGGGGAAAGGGGAAATGTAGCTCCATGTGAGGAAGT
TCCAGAAGAAGTTAAGGTGCAGATTAAACAACTTTTAGGCTTTAAGTTTTCGGAGAAGCTGAAGCGGCAGAAGAAAGGTAGCAAAAATGCAGTATCATGCTTCCCAAGTA
GGGAGGAAATAGATGATGGGGTGCACCAGACGTTCACCAGTGGTGCTTGGATGCAGTCACACTTGTCGAAGTATGGGGCTGGACTTGAGGTGGCAAAGATCACTGCTGAT
CCACTCTTCTGGTCGAAGTGTGATCATATCACAATGGGAACGAAACCTTTACTTTCTGTGTTGCAGTTTCTTGAATCAGAGGAGAAACCATCTGTCGGGTTTATATATGA
TGCATTTGAAAAATCAAAGAACAGTGTCATGCTCGCTTTCAACCGGAAGGAATCTGTCTACTTGCCATATTTGAAAGCCATTAACCATGTTTTGCTGAAGGAATTTCAGA
GCCCTCTTCACGTGGCTGCATACTACCTAAATCCATCAATATTCTATAGTCCTACATTTTTATCCAGCAAAGTTATTCAAAAGGGTTTACTTGATTGCATCGAAGCCTTA
GAGCCAGATATAACATCCCAGGTTATGGTTACAAACAACATAAATTTCTATGAGGAAGCTGTTGGAGATTTTGGCCGGCCAGTGGCATTACATGGTCGAGATTTATTGGC
CCCAGTGTCCAAGCTAGCTAGGAGACTGGAGACTGGTAAAGCAAGGTGCTCAATAGATGCACTTGATACTGCTTGTTTGGAAGCCATTGATGCGAACATGGAAGATTGGG
TGGAGGATATTGAGGTATTGGAGGATGAGCACAAGAGGTGGGTGGATCTGAAGGTCACTAGTCAGGAGACCTTGGTGGAACATAAATTGTCCAGTATGGATAATTGTATT
GACAGCACAGATGAGAGAGGCAGTTAG
mRNA sequenceShow/hide mRNA sequence
GATTTCGTTCAGACGTTATTTGATCTTTCAGTTTTTCATTTTCAGCTCCATCCATCTGTTTCGGTTCTCAATTACTTGGCCGGATTCCGCTCCTTCTCCGGGATTTCGTC
CGATATTTCGGATTTAACATGGGGTAAGTCAGGTTATGAACATCCAATGGTGGAGTCTGAGGAGCTTCTCACGGATATGAAATTCAAGGAAAAACGAGTGAGTGGGTCTG
TTTGCAACTTTGCATTGGCAGGCATGCCTTGTTTTGTTAAAAAGGGTATGGTACCTCCACGGGCTTCTGATCCTGGTTGGGCTCATGGAATTATGGTCAATGGGGGTCGC
CAGAAGATTAAATGCAAATACTGTAATAAAGTTATGCTTGGGGGCGGCATATCCAGACTAAAGCAACATCTAGCTGGGGAAAGGGGAAATGTAGCTCCATGTGAGGAAGT
TCCAGAAGAAGTTAAGGTGCAGATTAAACAACTTTTAGGCTTTAAGTTTTCGGAGAAGCTGAAGCGGCAGAAGAAAGGTAGCAAAAATGCAGTATCATGCTTCCCAAGTA
GGGAGGAAATAGATGATGGGGTGCACCAGACGTTCACCAGTGGTGCTTGGATGCAGTCACACTTGTCGAAGTATGGGGCTGGACTTGAGGTGGCAAAGATCACTGCTGAT
CCACTCTTCTGGTCGAAGTGTGATCATATCACAATGGGAACGAAACCTTTACTTTCTGTGTTGCAGTTTCTTGAATCAGAGGAGAAACCATCTGTCGGGTTTATATATGA
TGCATTTGAAAAATCAAAGAACAGTGTCATGCTCGCTTTCAACCGGAAGGAATCTGTCTACTTGCCATATTTGAAAGCCATTAACCATGTTTTGCTGAAGGAATTTCAGA
GCCCTCTTCACGTGGCTGCATACTACCTAAATCCATCAATATTCTATAGTCCTACATTTTTATCCAGCAAAGTTATTCAAAAGGGTTTACTTGATTGCATCGAAGCCTTA
GAGCCAGATATAACATCCCAGGTTATGGTTACAAACAACATAAATTTCTATGAGGAAGCTGTTGGAGATTTTGGCCGGCCAGTGGCATTACATGGTCGAGATTTATTGGC
CCCAGTGTCCAAGCTAGCTAGGAGACTGGAGACTGGTAAAGCAAGGTGCTCAATAGATGCACTTGATACTGCTTGTTTGGAAGCCATTGATGCGAACATGGAAGATTGGG
TGGAGGATATTGAGGTATTGGAGGATGAGCACAAGAGGTGGGTGGATCTGAAGGTCACTAGTCAGGAGACCTTGGTGGAACATAAATTGTCCAGTATGGATAATTGTATT
GACAGCACAGATGAGAGAGGCAGTTAG
Protein sequenceShow/hide protein sequence
DFVQTLFDLSVFHFQLHPSVSVLNYLAGFRSFSGISSDISDLTWGKSGYEHPMVESEELLTDMKFKEKRVSGSVCNFALAGMPCFVKKGMVPPRASDPGWAHGIMVNGGR
QKIKCKYCNKVMLGGGISRLKQHLAGERGNVAPCEEVPEEVKVQIKQLLGFKFSEKLKRQKKGSKNAVSCFPSREEIDDGVHQTFTSGAWMQSHLSKYGAGLEVAKITAD
PLFWSKCDHITMGTKPLLSVLQFLESEEKPSVGFIYDAFEKSKNSVMLAFNRKESVYLPYLKAINHVLLKEFQSPLHVAAYYLNPSIFYSPTFLSSKVIQKGLLDCIEAL
EPDITSQVMVTNNINFYEEAVGDFGRPVALHGRDLLAPVSKLARRLETGKARCSIDALDTACLEAIDANMEDWVEDIEVLEDEHKRWVDLKVTSQETLVEHKLSSMDNCI
DSTDERGS