; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019522 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019522
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr5:42991446..42999401
RNA-Seq ExpressionLag0019522
SyntenyLag0019522
Gene Ontology termsNA
InterPro domainsIPR039495 - TATA box-binding protein-associated factor RNA polymerase I subunit A-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7031054.1 hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma]1.3e-20984.3Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPI+SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+HVDSE HT +SFE DH IKVE+  Q FE  DF  +S EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAALVEHFD  N +LLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWREL MCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

XP_022142927.1 uncharacterized protein LOC111012919 [Momordica charantia]2.7e-21285.48Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEH+SVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQ DR+ISNS GCSVSNSHGDGA Y+SN
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVMNDKLVHVDSE H ++S EVD D+KVE+  QNFEA DF ++SAEK+ENEAS SDNGG+Q+YVSIFSALEGLDPLLLPLHLP SI+NWENAISLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLN YYKDAVKHL+LALNSNPPILVALLPLIQLLLIGGRVDKAL E+E IC DSNA LPFRLRAALVEHFDR NDVLLS+CYEQILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHL-DGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGH
        V MHRNGNY+LESLLEMI LHL DGTC EYD WRELA+CFLK SQ EEDRVSTACSIGT  H LMSS N++SNLK LTEK  RNTWRLRCRWW T+HF H
Subjt:  VHMHRNGNYSLESLLEMIALHL-DGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGH

Query:  KITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        KI SETL GNLEL TYKAACA HMYGSN+KYV E
Subjt:  KITSETLAGNLELWTYKAACASHMYGSNYKYVGE

XP_022941583.1 uncharacterized protein LOC111446895 [Cucurbita moschata]2.6e-21084.53Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPI+SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+HVDSE HT +SFE DH IKVE+  Q FE  DF  +S EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAALVEHFD  N +LLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

XP_022979683.1 uncharacterized protein LOC111479331 [Cucurbita maxima]3.7e-20984.53Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPI+SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+HVDSE HT++SFE DH IKVE+  Q FE  DF V+S EKDENEASFSDNGG+Q+ VSIFSALEGLDPLLLPLHLPPS+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENIC DSNA LPFRL+AALVEHFD  N VLLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        V MHRNGNYSLESLLEMIALHLDGT AEYDTWRELAMCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

XP_023536647.1 uncharacterized protein LOC111797853 [Cucurbita pepo subsp. pepo]3.7e-20983.83Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEH+SVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSP++SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+ VDSE HT++SFE DH IKVE+  Q FE  DF V+S EKDENE SFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAALVEHFD  N +LLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G+LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

TrEMBL top hitse value%identityAlignment
A0A0A0KXN5 Uncharacterized protein4.0e-19383.95Show/hide
Query:  MQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSNSETSVMNDKLVHVDSEEHTKSSFEVD---
        MQ  ESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPI SD +I NSDGCS SNSHGDGASY S +ETSVMN KLV VDSE HT++SF+VD   
Subjt:  MQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSNSETSVMNDKLVHVDSEEHTKSSFEVD---

Query:  HDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCSEFLNDYYKDAVKHLNLALNSNPPILV
        H+IKVES  QNFEAQDFCV SAEKDENEASFSDNGG+Q+YVSIFSALEGLDPLLLPLHLPPSIENWENAISLC EFLNDYYKDAVKHL+LALNSNPPILV
Subjt:  HDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCSEFLNDYYKDAVKHLNLALNSNPPILV

Query:  ALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTC
        ALLPLIQLLLIGGR+DKAL+EME  C DSNA LPFRLRAALVEHFDR N+VLLSTCYEQ LKKDPTCCHS+GKLV MHRNGNY+LESLLEMIALHLDGT 
Subjt:  ALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEMIALHLDGTC

Query:  AEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHKITSET-LAGNLELWTYKAACASHMYG
         EYDTWRELA+CFL+  Q EEDRVS ACSIGT GHKL+SSLN++SN+K LTEKN RNTWRLRCRWWLT+HFGHKIT ET + GNLEL TYKAAC  H+YG
Subjt:  AEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHKITSET-LAGNLELWTYKAACASHMYG

Query:  SNYKY
        +N+KY
Subjt:  SNYKY

A0A1S3BS63 uncharacterized protein LOC1034929161.9e-19880.41Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMV +E+ILFCLEEG  EDAHQ  L LMQ  ES NDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQ  SPI SD +I NSDGCS+SNSHG GA   SN
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVD---HDIKVESQ--NFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAIS
        +E+SVMNDK+VHVD E HT++S +VD   H+IKVE+   NFEAQDFCV+SAEKDENEASFSDNGG+Q+YVSIFSALEGLDPLLLPL LPPSIENWENAIS
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVD---HDIKVESQ--NFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAIS

Query:  LCSEFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSL
        LC EFLNDYYKDAVKHL LALNSNPPILVALLPLIQLLLIGGR+DKAL+EME  C DSNA LPFRLRAALVEHFDR N+VLLSTCYEQ LKKDPTC HS+
Subjt:  LCSEFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSL

Query:  GKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHF
        GKLV MHRNGNY+LESLLEMIALHLDGT  EYDTWRELA+CFLK  Q EEDRVSTACSIGT GHKL+SSL ++SN+K LTEKN RNTWRLRCRWWLT+HF
Subjt:  GKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHF

Query:  GHKITSE-TLAGNLELWTYKAACASHMYGSNYKY
        GH+IT E ++ GNLEL TYKAAC  H+YG+N+KY
Subjt:  GHKITSE-TLAGNLELWTYKAACASHMYGSNYKY

A0A6J1CPA4 uncharacterized protein LOC1110129191.3e-21285.48Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEH+SVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQ DR+ISNS GCSVSNSHGDGA Y+SN
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVMNDKLVHVDSE H ++S EVD D+KVE+  QNFEA DF ++SAEK+ENEAS SDNGG+Q+YVSIFSALEGLDPLLLPLHLP SI+NWENAISLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLN YYKDAVKHL+LALNSNPPILVALLPLIQLLLIGGRVDKAL E+E IC DSNA LPFRLRAALVEHFDR NDVLLS+CYEQILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHL-DGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGH
        V MHRNGNY+LESLLEMI LHL DGTC EYD WRELA+CFLK SQ EEDRVSTACSIGT  H LMSS N++SNLK LTEK  RNTWRLRCRWW T+HF H
Subjt:  VHMHRNGNYSLESLLEMIALHL-DGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGH

Query:  KITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        KI SETL GNLEL TYKAACA HMYGSN+KYV E
Subjt:  KITSETLAGNLELWTYKAACASHMYGSNYKYVGE

A0A6J1FSH8 uncharacterized protein LOC1114468951.2e-21084.53Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPI+SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+HVDSE HT +SFE DH IKVE+  Q FE  DF  +S EKDENEASFSDNG +Q+ VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENICRDSNA LPFRLRAALVEHFD  N +LLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

A0A6J1IWZ8 uncharacterized protein LOC1114793311.8e-20984.53Show/hide
Query:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN
        DRFMVHVEFILFCLEEG+TEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFST+PEEIQWRDSLQ+HSPI+SDR+I NSDGCSVSNS GDGASY+S+
Subjt:  DRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGASYRSN

Query:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS
        SETSVM+ KL+HVDSE HT++SFE DH IKVE+  Q FE  DF V+S EKDENEASFSDNGG+Q+ VSIFSALEGLDPLLLPLHLPPS+ENWENA+SLC 
Subjt:  SETSVMNDKLVHVDSEEHTKSSFEVDHDIKVES--QNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCS

Query:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL
        EFLNDYYKDAVKHL LALNSNPPILVALLP IQLLLIGGRVDKAL+EMENIC DSNA LPFRL+AALVEHFD  N VLLSTCYE+ILKKDPTCCHSLGKL
Subjt:  EFLNDYYKDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKL

Query:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK
        V MHRNGNYSLESLLEMIALHLDGT AEYDTWRELAMCFLK SQ EEDRVS ACSIG+ GHKL SSLN++ NLK  TEKNLRN WRLRCRWWLT HF   
Subjt:  VHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHK

Query:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE
        ITSET  G LEL TYKAACA HMYGSN+KYV E
Subjt:  ITSETLAGNLELWTYKAACASHMYGSNYKYVGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53200.1 unknown protein3.4e-5935.96Show/hide
Query:  KLDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRD-----SLQFHSPIQSDRLISNSDGCSVSNSHGD
        K +R +V  E I   +E     +A+   + LMQ  +    P +N+ IG++F ++W +   +E+Q  D     S+   S   S  L+  S       S   
Subjt:  KLDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRD-----SLQFHSPIQSDRLISNSDGCSVSNSHGD

Query:  GASYRSNSETSVM-NDKLVHVDSEEHTKSSFEVDHDIKVESQNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWEN
          S R +SETSVM N K+ H+   +   S   +D  +KV S  +      +  A  +ENEAS  D G  ++  ++ + L  +DP LLP   P   + +  
Subjt:  GASYRSNSETSVM-NDKLVHVDSEEHTKSSFEVDHDIKVESQNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWEN

Query:  AISLCSEFLNDYYKDAVKHLNLALNSNPPI-LVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTC
         ++      + YYK+AVK++   L S P + L AL PL+Q+LLIGG VD+A+  +E +C   + V PFR++A ++E F R +D +L+ CYE ILK DP C
Subjt:  AISLCSEFLNDYYKDAVKHLNLALNSNPPI-LVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTC

Query:  CHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-FSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNT-WRLRCRW
          +L KL+ M     YS ESL EMIALH++ +  E + W+ELA CF   F   +EDR+S  C  G+E  +   + +V  N    T K    T W LR +W
Subjt:  CHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-FSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNT-WRLRCRW

Query:  WLTQHFG-----HKITSETLAGNLELWTYKAACASHMYGSNYKYV
        WL +HF       +I + TL G+ E+ TYKAACAS++YG  + YV
Subjt:  WLTQHFG-----HKITSETLAGNLELWTYKAACASHMYGSNYKYV

AT1G53200.2 unknown protein1.2e-5935.96Show/hide
Query:  KLDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRD-----SLQFHSPIQSDRLISNSDGCSVSNSHGD
        +L+R +V  E I   +E     +A+   + LMQ  +    P +N+ IG++F ++W +   +E+Q  D     S+   S   S  L+  S       S   
Subjt:  KLDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRD-----SLQFHSPIQSDRLISNSDGCSVSNSHGD

Query:  GASYRSNSETSVM-NDKLVHVDSEEHTKSSFEVDHDIKVESQNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWEN
          S R +SETSVM N K+ H+   +   S   +D  +KV S  +      +  A  +ENEAS  D G  ++  ++ + L  +DP LLP   P   + +  
Subjt:  GASYRSNSETSVM-NDKLVHVDSEEHTKSSFEVDHDIKVESQNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWEN

Query:  AISLCSEFLNDYYKDAVKHLNLALNSNPPI-LVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTC
         ++      + YYK+AVK++   L S P + L AL PL+Q+LLIGG VD+A+  +E +C   + V PFR++A ++E F R +D +L+ CYE ILK DP C
Subjt:  AISLCSEFLNDYYKDAVKHLNLALNSNPPI-LVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTC

Query:  CHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-FSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNT-WRLRCRW
          +L KL+ M     YS ESL EMIALH++ +  E + W+ELA CF   F   +EDR+S  C  G+E  +   + +V  N    T K    T W LR +W
Subjt:  CHSLGKLVHMHRNGNYSLESLLEMIALHLDGTCAEYDTWRELAMCFLK-FSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNT-WRLRCRW

Query:  WLTQHFG-----HKITSETLAGNLELWTYKAACASHMYGSNYKYV
        WL +HF       +I + TL G+ E+ TYKAACAS++YG  + YV
Subjt:  WLTQHFG-----HKITSETLAGNLELWTYKAACASHMYGSNYKYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACCTTGGCTCTATCCTTGAGAGGATAGCTGGCGAGAAATTGGACAGATTTATGGTCCATGTGGAATTTATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGC
ACATCAGGCTGCTTTATGCCTCATGCAAGAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGGCAGCTGTGGTTCTCTACCATTCCGG
AAGAGATTCAGTGGAGGGACTCTCTCCAGTTTCACTCCCCAATTCAATCAGATAGGTTGATTTCAAACTCAGATGGGTGTTCAGTCAGCAACTCTCATGGAGATGGTGCC
TCATATCGCAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGCGAGGAGCACACAAAATCTTCTTTTGAGGTCGATCATGACATAAAAGTGGA
AAGTCAAAACTTTGAGGCACAAGATTTTTGTGTGAATTCTGCAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTCATCAGTACTATGTTTCAATTTTTT
CTGCTCTTGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTACCACCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCAGCGAGTTTCTGAATGACTATTAT
AAGGATGCAGTGAAACACCTAAACCTTGCTCTTAACTCAAATCCCCCAATATTGGTTGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAGGC
ACTCAATGAAATGGAAAATATATGTCGTGATTCAAATGCAGTACTTCCCTTCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAATGATGTCTTGCTTTCAA
CTTGTTATGAGCAAATATTGAAGAAGGATCCAACCTGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAACGGCAATTACAGTCTTGAATCTCTATTGGAAATG
ATAGCTTTGCATTTAGATGGTACATGTGCAGAATATGATACATGGAGAGAGTTGGCTATGTGTTTTCTGAAATTTTCTCAATTTGAAGAGGATAGAGTATCAACAGCATG
TTCAATTGGGACTGAAGGGCATAAGCTGATGTCCTCATTGAATGTTAGCAGTAACCTTAAGTTTTTGACTGAAAAGAACTTGAGAAACACATGGAGATTGCGTTGTCGAT
GGTGGTTGACGCAGCATTTCGGTCATAAAATAACATCAGAAACTTTGGCTGGTAATTTGGAGCTTTGGACTTACAAAGCAGCATGCGCAAGTCATATGTATGGAAGCAAC
TACAAATATGTGGGAGAGGAGAGAGGATCAGATCAACTTGGTCCTTGCATTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAACCTTGGCTCTATCCTTGAGAGGATAGCTGGCGAGAAATTGGACAGATTTATGGTCCATGTGGAATTTATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGC
ACATCAGGCTGCTTTATGCCTCATGCAAGAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGATTGACATTTCGGCAGCTGTGGTTCTCTACCATTCCGG
AAGAGATTCAGTGGAGGGACTCTCTCCAGTTTCACTCCCCAATTCAATCAGATAGGTTGATTTCAAACTCAGATGGGTGTTCAGTCAGCAACTCTCATGGAGATGGTGCC
TCATATCGCAGTAATTCAGAGACTTCTGTCATGAATGATAAATTAGTTCATGTTGATAGCGAGGAGCACACAAAATCTTCTTTTGAGGTCGATCATGACATAAAAGTGGA
AAGTCAAAACTTTGAGGCACAAGATTTTTGTGTGAATTCTGCAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTCATCAGTACTATGTTTCAATTTTTT
CTGCTCTTGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTACCACCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCAGCGAGTTTCTGAATGACTATTAT
AAGGATGCAGTGAAACACCTAAACCTTGCTCTTAACTCAAATCCCCCAATATTGGTTGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAGGC
ACTCAATGAAATGGAAAATATATGTCGTGATTCAAATGCAGTACTTCCCTTCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAATGATGTCTTGCTTTCAA
CTTGTTATGAGCAAATATTGAAGAAGGATCCAACCTGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAACGGCAATTACAGTCTTGAATCTCTATTGGAAATG
ATAGCTTTGCATTTAGATGGTACATGTGCAGAATATGATACATGGAGAGAGTTGGCTATGTGTTTTCTGAAATTTTCTCAATTTGAAGAGGATAGAGTATCAACAGCATG
TTCAATTGGGACTGAAGGGCATAAGCTGATGTCCTCATTGAATGTTAGCAGTAACCTTAAGTTTTTGACTGAAAAGAACTTGAGAAACACATGGAGATTGCGTTGTCGAT
GGTGGTTGACGCAGCATTTCGGTCATAAAATAACATCAGAAACTTTGGCTGGTAATTTGGAGCTTTGGACTTACAAAGCAGCATGCGCAAGTCATATGTATGGAAGCAAC
TACAAATATGTGGGAGAGGAGAGAGGATCAGATCAACTTGGTCCTTGCATTTAG
Protein sequenceShow/hide protein sequence
MNLGSILERIAGEKLDRFMVHVEFILFCLEEGNTEDAHQAALCLMQEHESVNDPMSNMIIGLTFRQLWFSTIPEEIQWRDSLQFHSPIQSDRLISNSDGCSVSNSHGDGA
SYRSNSETSVMNDKLVHVDSEEHTKSSFEVDHDIKVESQNFEAQDFCVNSAEKDENEASFSDNGGHQYYVSIFSALEGLDPLLLPLHLPPSIENWENAISLCSEFLNDYY
KDAVKHLNLALNSNPPILVALLPLIQLLLIGGRVDKALNEMENICRDSNAVLPFRLRAALVEHFDRGNDVLLSTCYEQILKKDPTCCHSLGKLVHMHRNGNYSLESLLEM
IALHLDGTCAEYDTWRELAMCFLKFSQFEEDRVSTACSIGTEGHKLMSSLNVSSNLKFLTEKNLRNTWRLRCRWWLTQHFGHKITSETLAGNLELWTYKAACASHMYGSN
YKYVGEERGSDQLGPCI