; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009074 (gene) of Snake gourd v1 genome

Gene IDTan0009074
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF4050 domain-containing protein
Genome locationLG11:7459777..7462116
RNA-Seq ExpressionTan0009074
SyntenyTan0009074
Gene Ontology termsNA
InterPro domainsIPR025124 - Domain of unknown function DUF4050


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022134091.1 uncharacterized protein LOC111006447 [Momordica charantia]2.9e-4387.27Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        +EM+K SS+SKEK   G  SSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWI+KQSQQQQRMERESIISWSTAYEDLLSTNEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW D+GLFD
Subjt:  DIWQDEGLFD

XP_022957458.1 uncharacterized protein LOC111458848 isoform X1 [Cucurbita moschata]2.5e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

XP_022957466.1 uncharacterized protein LOC111458848 isoform X2 [Cucurbita moschata]2.5e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

XP_022957473.1 uncharacterized protein LOC111458848 isoform X3 [Cucurbita moschata]2.5e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

XP_038886790.1 uncharacterized protein LOC120076904 [Benincasa hispida]6.2e-4690Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNK SSHSKE LTVGRSSSSSEVKKP EKDLSSSTF+NQAAIRWHE RKKW+DK SQQQQRMERESIISWSTAYEDLLST+EPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

TrEMBL top hitse value%identityAlignment
A0A0A0LK63 Uncharacterized protein2.0e-4589.09Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNK SSHSKEK T+GRSSSSSEVKKPAEKDLSS TFVNQAAI WHESRKKW+DK SQQQQRMERES+ISWSTAYEDLLSTN+PF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

A0A6J1BWZ9 uncharacterized protein LOC1110064471.4e-4387.27Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        +EM+K SS+SKEK   G  SSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWI+KQSQQQQRMERESIISWSTAYEDLLSTNEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW D+GLFD
Subjt:  DIWQDEGLFD

A0A6J1GZA2 uncharacterized protein LOC111458848 isoform X21.2e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

A0A6J1H0B4 uncharacterized protein LOC111458848 isoform X31.2e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

A0A6J1JTG3 uncharacterized protein LOC1114873411.2e-4286.36Show/hide
Query:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV
        MEMNKGSSHSKEK ++GRSS SSEVKKP+EKDLSSSTFVNQAAI WHESRKKWIDKQ   QQRME+ES+ISWSTAYEDLLS+NEPF+EPIPL EMVDFLV
Subjt:  MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLV

Query:  DIWQDEGLFD
        DIW DEGLFD
Subjt:  DIWQDEGLFD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G54880.1 unknown protein1.1e-2151.89Show/hide
Query:  SSHSKEKLTVGR--SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQ
        S+ +K  L + +      S VK  +E  L   T VN  A  W E+R+KW+  QS+Q++   ++ IISWST YEDLLST+EPF+E IPL EMVDFLVDIW 
Subjt:  SSHSKEKLTVGR--SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQ

Query:  DEGLFD
        DEGL+D
Subjt:  DEGLFD

AT5G03440.1 unknown protein1.0e-1747.83Show/hide
Query:  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD
        SS+SS  K+ + +++    FVN A I W E RKKW+   S +   M  E +I ++  YEDLL++N PF +PIPL EMVDFL DIW  +GLF+
Subjt:  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD

AT5G03440.2 unknown protein1.0e-1747.83Show/hide
Query:  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD
        SS+SS  K+ + +++    FVN A I W E RKKW+   S +   M  E +I ++  YEDLL++N PF +PIPL EMVDFL DIW  +GLF+
Subjt:  SSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD

AT5G25360.1 unknown protein6.1e-1537.17Show/hide
Query:  EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVD
        EM+  +  S+  ++       +S+S+    P E       FVN     W+++R++W+   + Q++   RE  ISW+  YE LL  N+ F+ PIPL EMVD
Subjt:  EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVD

Query:  FLVDIWQDEGLFD
        FLVD+W+ EGL+D
Subjt:  FLVDIWQDEGLFD

AT5G25360.2 unknown protein6.1e-1537.17Show/hide
Query:  EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVD
        EM+  +  S+  ++       +S+S+    P E       FVN     W+++R++W+   + Q++   RE  ISW+  YE LL  N+ F+ PIPL EMVD
Subjt:  EMNKGSSHSKEKLT----VGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVD

Query:  FLVDIWQDEGLFD
        FLVD+W+ EGL+D
Subjt:  FLVDIWQDEGLFD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAATGAATAAAGGAAGCTCCCATTCAAAAGAGAAACTGACTGTGGGTCGTTCTTCATCTTCCAGTGAAGTGAAAAAGCCTGCAGAAAAAGATTTGAGCTCCTCCAC
ATTTGTTAATCAGGCTGCAATTCGTTGGCATGAGAGTAGAAAGAAGTGGATTGATAAACAATCTCAGCAACAACAAAGAATGGAGAGGGAATCAATCATAAGCTGGTCAA
CAGCATACGAAGATCTGCTCTCCACCAACGAGCCCTTCACTGAGCCAATACCTCTCACAGAGATGGTAGATTTCTTGGTTGATATTTGGCAAGATGAAGGGCTTTTCGAT
TAG
mRNA sequenceShow/hide mRNA sequence
ATGGAAATGAATAAAGGAAGCTCCCATTCAAAAGAGAAACTGACTGTGGGTCGTTCTTCATCTTCCAGTGAAGTGAAAAAGCCTGCAGAAAAAGATTTGAGCTCCTCCAC
ATTTGTTAATCAGGCTGCAATTCGTTGGCATGAGAGTAGAAAGAAGTGGATTGATAAACAATCTCAGCAACAACAAAGAATGGAGAGGGAATCAATCATAAGCTGGTCAA
CAGCATACGAAGATCTGCTCTCCACCAACGAGCCCTTCACTGAGCCAATACCTCTCACAGAGATGGTAGATTTCTTGGTTGATATTTGGCAAGATGAAGGGCTTTTCGAT
TAG
Protein sequenceShow/hide protein sequence
MEMNKGSSHSKEKLTVGRSSSSSEVKKPAEKDLSSSTFVNQAAIRWHESRKKWIDKQSQQQQRMERESIISWSTAYEDLLSTNEPFTEPIPLTEMVDFLVDIWQDEGLFD