; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0003098 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0003098
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:48040208..48047513
RNA-Seq ExpressionLag0003098
SyntenyLag0003098
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008462359.1 PREDICTED: uncharacterized protein LOC103500733 isoform X2 [Cucumis melo]2.7e-7084.7Show/hide
Query:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES
        MSEE + KLYA KP KAQ+KQFQEQHK  DASSS     ASS+M S S+SSP  PQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEE+
Subjt:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES

Query:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        APDSAKT KIAA VVEE LA+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

XP_022954191.1 uncharacterized protein LOC111456527 isoform X1 [Cucurbita moschata]1.2e-7085.16Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA
        EEP KLYA KPKKAQVKQFQEQHKV  A  SSSPA PA+S+   + S+SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEE+A
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA

Query:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        PDSAKTAKIAA VVEE  A+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

XP_022992414.1 uncharacterized protein LOC111488728 isoform X1 [Cucurbita maxima]3.5e-7085.25Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSN---MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES
        EEP KLYA KPKKAQVKQFQEQHKV  ASSS  AP ASSN     S S+SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEE+
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSN---MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES

Query:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        APDSAKTAKIAA VVEE  A+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKD EEKKRIDEEKAILKEFIRAKS PN+
Subjt:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

XP_023548106.1 uncharacterized protein LOC111806841 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-7085.16Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA
        EEP KLYA KPKKAQVKQFQEQHKV  A  SSSPA PA+S+   + S+SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEE+A
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA

Query:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        PDSAKTAKIAA VVEE  A+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

XP_038898399.1 uncharacterized protein LOC120086050 [Benincasa hispida]3.0e-6984.32Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASS-----PSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE
        EEP KLYA KPKKAQVKQFQEQHKVRDASSSP AP ASSNM S SAS+     PS PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVA+E
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASS-----PSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEE

Query:  ESAPDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        ++APDSA   KIA  VVEE    PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILK+FIRAKS PN+
Subjt:  ESAPDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

TrEMBL top hitse value%identityAlignment
A0A1S3CGT4 uncharacterized protein LOC103500733 isoform X21.3e-7084.7Show/hide
Query:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES
        MSEE + KLYA KP KAQ+KQFQEQHK  DASSS     ASS+M S S+SSP  PQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEE+
Subjt:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES

Query:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        APDSAKT KIAA VVEE LA+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

A0A1S3CIB4 uncharacterized protein LOC103500733 isoform X13.2e-6984.24Show/hide
Query:  MSEEPI-KLYAKKP-KKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEE
        MSEE + KLYA KP K AQ+KQFQEQHK  DASSS     ASS+M S S+SSP  PQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEE
Subjt:  MSEEPI-KLYAKKP-KKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEE

Query:  SAPDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        +APDSAKT KIAA VVEE LA+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  SAPDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

A0A6J1GQ86 uncharacterized protein LOC111456527 isoform X15.9e-7185.16Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA
        EEP KLYA KPKKAQVKQFQEQHKV  A  SSSPA PA+S+   + S+SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEE+A
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDA--SSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA

Query:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        PDSAKTAKIAA VVEE  A+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  PDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

A0A6J1JXH4 uncharacterized protein LOC111488728 isoform X11.7e-7085.25Show/hide
Query:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSN---MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES
        EEP KLYA KPKKAQVKQFQEQHKV  ASSS  AP ASSN     S S+SS S PQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDE V EEE+
Subjt:  EEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSN---MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES

Query:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        APDSAKTAKIAA VVEE  A+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKD EEKKRIDEEKAILKEFIRAKS PN+
Subjt:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

E5GCA6 Uncharacterized protein1.3e-7084.7Show/hide
Query:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES
        MSEE + KLYA KP KAQ+KQFQEQHK  DASSS     ASS+M S S+SSP  PQPPKESFARRYKFLWPMLLTVNLAVGAY+FMRTKKQDEHVAEEE+
Subjt:  MSEEPI-KLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEES

Query:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
        APDSAKT KIAA VVEE LA+PAIVEPVKVREPIPVDQQRELFKWILEEKRK+KPKDREEKKRIDEEKAILKEFIRAKS PN+
Subjt:  APDSAKTAKIAATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G55160.1 unknown protein1.3e-3856.77Show/hide
Query:  MSEEPIKLYAKKPKK----AQVK----QFQEQHKVRDASSSPAAPAASS-NMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD
        MSEE  KL+  KPKK    AQ+K     F        +  SPAA AA+S  MG GS      P PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D
Subjt:  MSEEPIKLYAKKPKK----AQVK----QFQEQHKVRDASSSPAAPAASS-NMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQD

Query:  EHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFP
             EE+A   AK++ +AA V VE+ L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR DEEKAILK+FI +K+ P
Subjt:  EHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFP

AT1G55160.2 unknown protein2.3e-3562.41Show/hide
Query:  MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQR
        MG GS      P PPKESFARRYK++WP+LLTVNLAVG YLF RTKK+D     EE+A   AK++ +AA V VE+ L+   + EPV  K REPIP  QQR
Subjt:  MGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQR

Query:  ELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFP
        ELFKW+LEEKRK+ PK+ EEKKR DEEKAILK+FI +K+ P
Subjt:  ELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFP

AT1G55160.3 unknown protein1.5e-3450.23Show/hide
Query:  MSEEPIKLYAKKPKK----AQVK----QFQEQHKVRDASSSPAAPAASS-NMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVG-----------
        MSEE  KL+  KPKK    AQ+K     F        +  SPAA AA+S  MG GS      P PPKESFARRYK++WP+LLTVNLAVG           
Subjt:  MSEEPIKLYAKKPKK----AQVK----QFQEQHKVRDASSSPAAPAASS-NMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVG-----------

Query:  --------------AYLFMRTKKQDEHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRI
                      +YLF RTKK+D     EE+A   AK++ +AA V VE+ L+   + EPV  K REPIP  QQRELFKW+LEEKRK+ PK+ EEKKR 
Subjt:  --------------AYLFMRTKKQDEHVAEEESAPDSAKTAKIAATV-VEEPLARPAIVEPV--KVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRI

Query:  DEEKAILKEFIRAKSFP
        DEEKAILK+FI +K+ P
Subjt:  DEEKAILKEFIRAKSFP

AT2G19530.1 unknown protein3.7e-1736.46Show/hide
Query:  SSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA---------------------------PDSAKTAKIAATVVEEPLARP
        SSPS  +PP++  ++  K  W   +  NL   AY+F   +++D    E++                              D AK A+ A    EE    P
Subjt:  SSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESA---------------------------PDSAKTAKIAATVVEEPLARP

Query:  AI-------------------VEPVKV-REPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV
         +                    E VKV R+PIP D+Q+ELFKWILEEKRK++PKDR+EKK+IDEEKAILK+FIRA+  P +
Subjt:  AI-------------------VEPVKV-REPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGAAGAACCTATTAAGCTCTACGCCAAAAAACCCAAGAAAGCCCAGGTTAAACAATTTCAAGAGCAGCACAAAGTCAGAGACGCTTCTTCTTCACCGGCGGCACC
AGCAGCATCGTCGAACATGGGATCAGGATCTGCGTCTTCGCCTTCAGCGCCGCAGCCTCCGAAGGAATCGTTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGA
CTGTGAACCTTGCTGTTGGAGCTTATCTGTTTATGAGGACAAAGAAGCAAGATGAACATGTAGCTGAAGAAGAGTCTGCCCCGGATTCAGCTAAAACCGCCAAGATTGCT
GCTACTGTTGTTGAGGAACCATTGGCTAGACCAGCTATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCAGTGGATCAGCAGCGTGAACTTTTCAAGTGGATTTTGGA
AGAAAAGCGCAAGATGAAGCCAAAAGATCGTGAAGAGAAGAAGCGCATTGACGAAGAGAAAGCAATTCTCAAGGAGTTCATCCGAGCAAAATCTTTTCCAAATGTGTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCGAAGAACCTATTAAGCTCTACGCCAAAAAACCCAAGAAAGCCCAGGTTAAACAATTTCAAGAGCAGCACAAAGTCAGAGACGCTTCTTCTTCACCGGCGGCACC
AGCAGCATCGTCGAACATGGGATCAGGATCTGCGTCTTCGCCTTCAGCGCCGCAGCCTCCGAAGGAATCGTTTGCAAGGCGATATAAGTTCTTATGGCCCATGCTTTTGA
CTGTGAACCTTGCTGTTGGAGCTTATCTGTTTATGAGGACAAAGAAGCAAGATGAACATGTAGCTGAAGAAGAGTCTGCCCCGGATTCAGCTAAAACCGCCAAGATTGCT
GCTACTGTTGTTGAGGAACCATTGGCTAGACCAGCTATTGTGGAGCCTGTGAAGGTAAGAGAACCAATTCCAGTGGATCAGCAGCGTGAACTTTTCAAGTGGATTTTGGA
AGAAAAGCGCAAGATGAAGCCAAAAGATCGTGAAGAGAAGAAGCGCATTGACGAAGAGAAAGCAATTCTCAAGGAGTTCATCCGAGCAAAATCTTTTCCAAATGTGTAA
Protein sequenceShow/hide protein sequence
MSEEPIKLYAKKPKKAQVKQFQEQHKVRDASSSPAAPAASSNMGSGSASSPSAPQPPKESFARRYKFLWPMLLTVNLAVGAYLFMRTKKQDEHVAEEESAPDSAKTAKIA
ATVVEEPLARPAIVEPVKVREPIPVDQQRELFKWILEEKRKMKPKDREEKKRIDEEKAILKEFIRAKSFPNV