; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017721 (gene) of Snake gourd v1 genome

Gene IDTan0017721
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionMucin-5AC like
Genome locationLG04:85983990..85984744
RNA-Seq ExpressionTan0017721
SyntenyTan0017721
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606765.1 hypothetical protein SDJN03_00107, partial [Cucurbita argyrosperma subsp. sororia]1.5e-6179.63Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT
        MDA+E H+SK P LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASA+KSTATA LS  D  DSDGY VED WSPTA+LPTD++ GILPE DPPAAC T
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT

Query:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        +KGLKLPEF  D VVVG MEERRGKG CVVDVLEGLE+GDE KK+K    EEE+PILAEGFA
Subjt:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

XP_008447517.1 PREDICTED: uncharacterized protein LOC103489947 [Cucumis melo]1.1e-5675.62Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK
        MDA+E H + QP  FDQILPPRLEDAGLED ALPPDSIREAFFKAASAVKS ATALLSP DD +    +DPWSPT+ LPTDI+ GILP+ D PA C T K
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK

Query:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        GLKLPEFG+DEVV+G MEERRGK  CVVD LEGLEIGDE +KEKK   EEEKPIL EGFA
Subjt:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

XP_022949149.1 uncharacterized protein LOC111452587 [Cucurbita moschata]4.1e-6280.25Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT
        MDA+E H+SK P LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASA+KSTATA LS  D  DSDGY VED WSPTA+LPTD++ GILPE DPPAAC T
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT

Query:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        +KGLKLPEF  D VVVG MEERRGKG CVVDVLEGLE+GDE KK+KK   EEE+PILAEGFA
Subjt:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

XP_023523630.1 uncharacterized protein LOC111787807 [Cucurbita pepo subsp. pepo]2.6e-6179.63Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLS--PFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT
        MDA+E H+SK P LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASA+KSTATA LS    DDSDGY VED WSP A+LPTD++ GILPE DPPAAC T
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLS--PFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT

Query:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        +KGLKLPEF  D VVVG MEERRGKG CVVDVLEGLE+GDE KK+KK   EEE+PILAEGFA
Subjt:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

XP_038900451.1 uncharacterized protein LOC120087668 [Benincasa hispida]1.9e-5675.62Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK
        MDA+E+H +  P +FDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSP DD      +DPWSPT+ LPTD++ GILP+RD PA C TEK
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK

Query:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        GLKLPE G DEVV+G MEERRGK  CVVD LEGLEIGDE K  +KKS +EEKPIL EGFA
Subjt:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

TrEMBL top hitse value%identityAlignment
A0A0A0LDD6 Uncharacterized protein1.1e-4468.12Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK
        MDA+E H + QPS F QILPPRLEDAGLED ALPPDSIREAFFKAASAVKS ATA LS  DD D    + P SPT+ALPTD         D PA C T+K
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK

Query:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        GL+LPEFG+DEVV+G MEERRGKG CVVD LEGLEIGD+ +KE  K  E++KP+L EGFA
Subjt:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

A0A1S3BI85 uncharacterized protein LOC1034899475.5e-5775.62Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK
        MDA+E H + QP  FDQILPPRLEDAGLED ALPPDSIREAFFKAASAVKS ATALLSP DD +    +DPWSPT+ LPTDI+ GILP+ D PA C T K
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK

Query:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        GLKLPEFG+DEVV+G MEERRGK  CVVD LEGLEIGDE +KEKK   EEEKPIL EGFA
Subjt:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

A0A5A7U9B5 Uncharacterized protein5.5e-5775.62Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK
        MDA+E H + QP  FDQILPPRLEDAGLED ALPPDSIREAFFKAASAVKS ATALLSP DD +    +DPWSPT+ LPTDI+ GILP+ D PA C T K
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEK

Query:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        GLKLPEFG+DEVV+G MEERRGK  CVVD LEGLEIGDE +KEKK   EEEKPIL EGFA
Subjt:  GLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

A0A6J1GBZ4 uncharacterized protein LOC1114525872.0e-6280.25Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT
        MDA+E H+SK P LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASA+KSTATA LS  D  DSDGY VED WSPTA+LPTD++ GILPE DPPAAC T
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFD--DSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT

Query:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA
        +KGLKLPEF  D VVVG MEERRGKG CVVDVLEGLE+GDE KK+KK   EEE+PILAEGFA
Subjt:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA

A0A6J1K989 uncharacterized protein LOC1114928312.8e-5377.18Show/hide
Query:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLS--PFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT
        MDA+E H+SK P LFDQILPPRLEDAGLEDCALPPDSI EAFFKAASA+KSTAT  LS    DDSDGY VED WSPTAAL TD++ GI PE DPPAAC T
Subjt:  MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLS--PFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGT

Query:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKS
        +KGLKLPEF  D VVVG MEERRGKG C VDVLEGLE+GDE KK+KK +
Subjt:  EKGLKLPEFGRDEVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G15230.1 unknown protein2.4e-2050.32Show/hide
Query:  LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEKGL-KLPEFGR--D
        L D ILPP L DAGLEDCALPP+SI+EAF KAA+AVKS A ++    ++ DG C+ DP   TA     II G   ERD    C   KG+ KL E  +  D
Subjt:  LFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEKGL-KLPEFGR--D

Query:  EVVVGEMEERRGKGCCVVDVLEGLEI-GDEDKKEKKKSSEEE-----KPILAEGF
         VV GE EE  GK C  VD L+ L++ G E   EKK  S+E+     KPIL EGF
Subjt:  EVVVGEMEERRGKGCCVVDVLEGLEI-GDEDKKEKKKSSEEE-----KPILAEGF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGCATCAGAACGCCACAAATCCAAACAACCAAGTCTCTTCGATCAAATCCTCCCTCCCCGTCTCGAAGACGCCGGCCTCGAGGATTGCGCCCTTCCTCCCGATTC
CATTCGTGAAGCCTTCTTCAAGGCCGCCTCCGCCGTCAAATCCACGGCCACCGCTCTTCTTTCCCCCTTCGACGATTCCGACGGCTACTGTGTCGAGGATCCATGGTCGC
CTACTGCCGCTCTACCTACCGACATCATCGCTGGAATCTTGCCGGAGCGCGATCCTCCAGCGGCTTGCGGGACGGAGAAGGGATTGAAATTGCCGGAGTTTGGTCGGGAT
GAGGTCGTTGTTGGGGAAATGGAGGAGAGGAGAGGGAAGGGTTGCTGTGTGGTAGATGTATTGGAAGGGTTGGAGATTGGTGATGAAGACAAGAAGGAGAAGAAGAAGAG
CAGTGAAGAAGAGAAACCTATTTTAGCGGAAGGTTTTGCTTGA
mRNA sequenceShow/hide mRNA sequence
AATTCCCAAATCTTCCAAATTTAGGGTTTCTCATTCTCATGGACGCATCAGAACGCCACAAATCCAAACAACCAAGTCTCTTCGATCAAATCCTCCCTCCCCGTCTCGAA
GACGCCGGCCTCGAGGATTGCGCCCTTCCTCCCGATTCCATTCGTGAAGCCTTCTTCAAGGCCGCCTCCGCCGTCAAATCCACGGCCACCGCTCTTCTTTCCCCCTTCGA
CGATTCCGACGGCTACTGTGTCGAGGATCCATGGTCGCCTACTGCCGCTCTACCTACCGACATCATCGCTGGAATCTTGCCGGAGCGCGATCCTCCAGCGGCTTGCGGGA
CGGAGAAGGGATTGAAATTGCCGGAGTTTGGTCGGGATGAGGTCGTTGTTGGGGAAATGGAGGAGAGGAGAGGGAAGGGTTGCTGTGTGGTAGATGTATTGGAAGGGTTG
GAGATTGGTGATGAAGACAAGAAGGAGAAGAAGAAGAGCAGTGAAGAAGAGAAACCTATTTTAGCGGAAGGTTTTGCTTGATATTCTGCAGTTTGATTAAGCTCTGCGAA
ATTCTTTGGTGCAAATTTTAATGGAGTTCTTGTTATTGATTGTGCATAACGTGGAGTTAGAGAGATTTTGCTTCAATTTTGCTGCCTGCTTCTTTGTGAATGTGCATAGA
GAGAGTCAGAGAGATGAGTTGATTTTTTTTTAAATTTTTTGTTTCTATTTAATCAAACTCCTTTTTAATTTTCTTTTCTCTAAGTTAATAGCCAC
Protein sequenceShow/hide protein sequence
MDASERHKSKQPSLFDQILPPRLEDAGLEDCALPPDSIREAFFKAASAVKSTATALLSPFDDSDGYCVEDPWSPTAALPTDIIAGILPERDPPAACGTEKGLKLPEFGRD
EVVVGEMEERRGKGCCVVDVLEGLEIGDEDKKEKKKSSEEEKPILAEGFA