; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012449 (gene) of Snake gourd v1 genome

Gene IDTan0012449
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDUF3143 domain-containing protein
Genome locationLG01:1542203..1544232
RNA-Seq ExpressionTan0012449
SyntenyTan0012449
Gene Ontology termsNA
InterPro domainsIPR021489 - Protein of unknown function DUF3143


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576682.1 hypothetical protein SDJN03_24256, partial [Cucurbita argyrosperma subsp. sororia]1.5e-8092.07Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLC NV+PSP SN PE+SSIWRPISTK  QL NPISQS RKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QS EDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

XP_008439322.1 PREDICTED: uncharacterized protein LOC103484146 [Cucumis melo]1.1e-7890.24Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAF SLCNNV+PSPTSN  E+SSIWRPISTK  QL NPISQ  RK+ +IVSSKSSEAEELS+PEDEWLNKLPEKKKPLYSHSLPCVEAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

XP_022922878.1 uncharacterized protein LOC111430721 [Cucurbita moschata]1.5e-8092.07Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLC NV+PSP SN PE+SSIWRPISTK  QL NPISQS RKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QS EDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

XP_022984153.1 uncharacterized protein LOC111482568 [Cucurbita maxima]1.3e-7991.46Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLC NV+PSP SN PE+SSIWRPISTK  QL NPISQS RKSLSIVSSKSSEAEELSA EDEWLNKLPEKKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QS EDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

XP_038880287.1 uncharacterized protein LOC120071904 [Benincasa hispida]7.5e-8091.46Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLCNNV+PSPTSN  E+SSIWRPISTK  QL NPISQS RK+ +IVSSKSSEAEELS PEDEWLNKLPEKKKPLYSHSLPCVEAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

TrEMBL top hitse value%identityAlignment
A0A1S3AY50 uncharacterized protein LOC1034841465.3e-7990.24Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAF SLCNNV+PSPTSN  E+SSIWRPISTK  QL NPISQ  RK+ +IVSSKSSEAEELS+PEDEWLNKLPEKKKPLYSHSLPCVEAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

A0A5D3DHV1 Uncharacterized protein5.3e-7990.24Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAF SLCNNV+PSPTSN  E+SSIWRPISTK  QL NPISQ  RK+ +IVSSKSSEAEELS+PEDEWLNKLPEKKKPLYSHSLPCVEAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QSKEDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

A0A6J1CHI2 uncharacterized protein LOC1110116931.4e-7687.2Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSI FAF SLCNNV+PS  SN+PE+S IWR IS K RQ  NPIS S RKS ++VSS+SSEAEELSAPEDEWL+KLPEKKKPLYSHSLPC+EAWLKSLGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

A0A6J1E806 uncharacterized protein LOC1114307217.4e-8192.07Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLC NV+PSP SN PE+SSIWRPISTK  QL NPISQS RKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QS EDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

A0A6J1J9Q7 uncharacterized protein LOC1114825686.2e-8091.46Show/hide
Query:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY
        MSITFAFQSLC NV+PSP SN PE+SSIWRPISTK  QL NPISQS RKSLSIVSSKSSEAEELSA EDEWLNKLPEKKKPLYSHSLPC+EAWLK+LGFY
Subjt:  MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFY

Query:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        QS EDRA+WLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
Subjt:  QSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G52960.1 unknown protein9.9e-5462.35Show/hide
Query:  MSITFAFQSLCNN-VYPSPTSNFP----ELSSIWRPISTKQRQLHNPISQS-SRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWL
        M+   AF+ + ++ ++P   S+ P     + S    IS + R   +  S +  RK L++VSSKSS+AEE+S  EDEWL KLPEK KPLYSHSLPC+EAWL
Subjt:  MSITFAFQSLCNN-VYPSPTSNFP----ELSSIWRPISTKQRQLHNPISQS-SRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWL

Query:  KSLGFYQSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP
        + LGFYQSK+DRA+WLI+KP+WHAQLSLDVTDL IRY+KSGPGNLE+D+ERRFSYALSRED ENA+LGGP
Subjt:  KSLGFYQSKEDRALWLIEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCATCACTTTTGCTTTTCAGTCTCTCTGTAACAATGTTTACCCATCTCCCACTTCAAATTTTCCCGAACTTTCATCGATTTGGCGTCCCATTTCCACTAAACAACG
ACAATTACACAACCCCATCTCTCAATCCTCAAGGAAAAGCTTGAGCATTGTCTCTTCCAAGTCATCCGAAGCAGAGGAGCTCTCTGCTCCAGAGGACGAGTGGCTGAACA
AGCTTCCAGAAAAGAAGAAGCCCTTGTACTCTCACAGCTTGCCTTGCGTCGAGGCCTGGTTGAAGAGCTTAGGATTTTACCAAAGTAAAGAGGACCGAGCCTTGTGGTTA
ATCGAGAAGCCTGAATGGCACGCCCAGCTCTCCCTTGATGTCACCGACCTTTATATAAGATATCTAAAGAGTGGACCAGGAAATCTTGAGAAAGACGTGGAGAGGAGATT
TAGCTATGCACTAAGCAGAGAAGATATTGAGAATGCTGTACTTGGAGGACCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCCATCACTTTTGCTTTTCAGTCTCTCTGTAACAATGTTTACCCATCTCCCACTTCAAATTTTCCCGAACTTTCATCGATTTGGCGTCCCATTTCCACTAAACAACG
ACAATTACACAACCCCATCTCTCAATCCTCAAGGAAAAGCTTGAGCATTGTCTCTTCCAAGTCATCCGAAGCAGAGGAGCTCTCTGCTCCAGAGGACGAGTGGCTGAACA
AGCTTCCAGAAAAGAAGAAGCCCTTGTACTCTCACAGCTTGCCTTGCGTCGAGGCCTGGTTGAAGAGCTTAGGATTTTACCAAAGTAAAGAGGACCGAGCCTTGTGGTTA
ATCGAGAAGCCTGAATGGCACGCCCAGCTCTCCCTTGATGTCACCGACCTTTATATAAGATATCTAAAGAGTGGACCAGGAAATCTTGAGAAAGACGTGGAGAGGAGATT
TAGCTATGCACTAAGCAGAGAAGATATTGAGAATGCTGTACTTGGAGGACCTTGA
Protein sequenceShow/hide protein sequence
MSITFAFQSLCNNVYPSPTSNFPELSSIWRPISTKQRQLHNPISQSSRKSLSIVSSKSSEAEELSAPEDEWLNKLPEKKKPLYSHSLPCVEAWLKSLGFYQSKEDRALWL
IEKPEWHAQLSLDVTDLYIRYLKSGPGNLEKDVERRFSYALSREDIENAVLGGP