; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0011366 (gene) of Snake gourd v1 genome

Gene IDTan0011366
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG11:8633870..8635081
RNA-Seq ExpressionTan0011366
SyntenyTan0011366
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0097159 - organic cyclic compound binding (molecular function)
GO:1901363 - heterocyclic compound binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602205.1 hypothetical protein SDJN03_07438, partial [Cucurbita argyrosperma subsp. sororia]1.7e-7188.69Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDS+LLPNSEFP   RPD  HFSY+MRSNRQFVT+Q LRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

KAG7032887.1 hypothetical protein SDJN02_06937, partial [Cucurbita argyrosperma subsp. argyrosperma]8.6e-7188.1Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDS+LLPNSEFP   RPD  HF Y+MRSNRQFVT+Q LRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

XP_022964579.1 uncharacterized protein LOC111464557 [Cucurbita moschata]6.6e-7188.1Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDS+LLPNSEFP   RPD  HFSY+MRSNRQFVT+Q LRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

XP_022990474.1 uncharacterized protein LOC111487326 [Cucurbita maxima]8.3e-7491.07Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDSALLPNSEFP   RPD  HFSYLMRSNRQFVT+QCLRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

XP_023545197.1 uncharacterized protein LOC111804570 [Cucurbita pepo subsp. pepo]1.5e-7087.5Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSC  RRILDS+LLPNSEFP   RPD  HFSY+MR NRQFVT+QCLRRYSGG+GNSEPE DQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

TrEMBL top hitse value%identityAlignment
A0A1S3C573 uncharacterized protein LOC1034965828.1e-5977.98Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPR RLPLYR L+G SQ S SSSS  FRRILD +  P       WR D  HFSYLM SN QFVT Q  R +S  +G+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLE+YEGLLAKLS+SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

A0A5D3BXL9 Uncharacterized protein8.1e-5977.98Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPR RLPLYR L+G SQ S SSSS  FRRILD +  P       WR D  HFSYLM SN QFVT Q  R +S  +G+SEP+FDQVREVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLE+YEGLLAKLS+SERKALQRSMGLKIEQLKAEL QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1BV77 uncharacterized protein LOC1110060462.1e-6281.55Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        M+ R RL L R L+GKSQASTSSSS S RRI DS L P SEFP +WR    HFS LMR N  FVT+QCLRRYSG + +SEPEFDQVREVD INLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEM+EGLLAKL ESERKALQRSMGLKIEQLKAELKQLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1HL89 uncharacterized protein LOC1114645573.2e-7188.1Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDS+LLPNSEFP   RPD  HFSY+MRSNRQFVT+Q LRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYF+EEAECARDAVKEVL+MYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

A0A6J1JQ75 uncharacterized protein LOC1114873264.0e-7491.07Show/hide
Query:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR
        MSPRLRL L RLL GKSQASTSSSSCS RRILDSALLPNSEFP   RPD  HFSYLMRSNRQFVT+QCLRRYSGG+GNSEPEFDQ+REVDRINLKFAEAR
Subjt:  MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAR

Query:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKL E+ERKALQRSMGLKIEQLKAEL+QLDE
Subjt:  EEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

SwissProt top hitse value%identityAlignment
Q9M9H3 Embryogenesis-like protein3.2e-2870.71Show/hide
Query:  RRYSGGAGNSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        RRYS G+ +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAECARDAV EVLEM++GLL K++E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  RRYSGGAGNSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE

Arabidopsis top hitse value%identityAlignment
AT1G71730.1 unknown protein2.3e-2970.71Show/hide
Query:  RRYSGGAGNSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE
        RRYS G+ +  P  D  + VD INLKFAEAREEIE AM++KETVYF+EEAECARDAV EVLEM++GLL K++E E+ +LQRSMGLKIEQLKAEL+QL+E
Subjt:  RRYSGGAGNSEPEFDQVREVDRINLKFAEAREEIESAMESKETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCCTCGTTTACGCCTTCCTCTCTACAGATTGCTTGTCGGAAAATCTCAGGCCTCTACTTCTTCTTCTTCGTGTAGCTTTCGCCGAATTCTGGACTCTGCTCTGTT
GCCAAATTCTGAGTTTCCTATCATATGGCGGCCGGATGTTTTCCATTTTTCATATCTAATGAGGTCGAATCGTCAGTTCGTAACTATTCAGTGTTTGAGGAGGTACAGTG
GTGGTGCAGGCAACTCGGAGCCCGAATTTGATCAGGTTAGAGAGGTGGACAGGATCAATCTCAAGTTCGCCGAAGCGAGAGAAGAGATAGAGTCGGCCATGGAGTCTAAA
GAGACCGTGTATTTTGATGAAGAGGCCGAGTGTGCTCGGGATGCTGTGAAGGAAGTTTTAGAAATGTACGAGGGGCTTCTTGCGAAGTTGTCCGAGAGCGAGAGGAAGGC
GTTGCAGAGGTCTATGGGGCTTAAGATTGAACAGTTGAAGGCCGAGCTTAAACAGCTTGACGAGTAA
mRNA sequenceShow/hide mRNA sequence
CTTTCACTGTCGCTGAACTGATGCTTCAGCCTTTATGCTTCGTTTTCTCACCGAACCGTGATTTCCTTGGCTCTCCTTGTTCGTAGTATCGAGAAATTGCAAAAATGAGC
CCTCGTTTACGCCTTCCTCTCTACAGATTGCTTGTCGGAAAATCTCAGGCCTCTACTTCTTCTTCTTCGTGTAGCTTTCGCCGAATTCTGGACTCTGCTCTGTTGCCAAA
TTCTGAGTTTCCTATCATATGGCGGCCGGATGTTTTCCATTTTTCATATCTAATGAGGTCGAATCGTCAGTTCGTAACTATTCAGTGTTTGAGGAGGTACAGTGGTGGTG
CAGGCAACTCGGAGCCCGAATTTGATCAGGTTAGAGAGGTGGACAGGATCAATCTCAAGTTCGCCGAAGCGAGAGAAGAGATAGAGTCGGCCATGGAGTCTAAAGAGACC
GTGTATTTTGATGAAGAGGCCGAGTGTGCTCGGGATGCTGTGAAGGAAGTTTTAGAAATGTACGAGGGGCTTCTTGCGAAGTTGTCCGAGAGCGAGAGGAAGGCGTTGCA
GAGGTCTATGGGGCTTAAGATTGAACAGTTGAAGGCCGAGCTTAAACAGCTTGACGAGTAATTGAGGTTCATGTATTTTATTTTATTCTTCCCAATTTTGAACATGTCCA
ATTCCCATTCGTTTTCTTATCTGATTGCTGCTTGTGACTGGGAAAGCAAGTCTGGCGTCAATAACTCAGCTCTAGTGTTAGGAGCGGCGAGTAGGAACTTGTTTAGTTGT
TTCTTCCTTGAGAAAAGATGCTGCTGCTGCTAAAAAAAGAACGCACTTCATTTTTCCAATTTGACGTTTATTCTAAATGGTTTGGTGATCCTAAATCTTGATTATCGTTT
TGTAATTGGGAACTGAACCACCCATGAAGGCAGTATATCTATATACATCTCAATTCAGTACGGCCTTGGGGTTACATAGAATGTCCTGTTGCAGGTTTTGTCTTTGAAAC
GATTAGTTTCTAATTTGAGATTTAACTGTGGGATGAAATTTCTATAGTTAGTGCTGGTTCTCAGGAAAAAGCTAAAGAATAAAGTTCTATGTGAAACTGTGGTGAGGATA
ATTTGGAATGTCGTGGTGCTTTCTCTCTCTCTTCATCTGTTGAAAGCTTGGAAAAAAAGAAATAGAGGAATATTTAATGATGGGCTCTTGGACAGGGACATTTTATGGAG
TG
Protein sequenceShow/hide protein sequence
MSPRLRLPLYRLLVGKSQASTSSSSCSFRRILDSALLPNSEFPIIWRPDVFHFSYLMRSNRQFVTIQCLRRYSGGAGNSEPEFDQVREVDRINLKFAEAREEIESAMESK
ETVYFDEEAECARDAVKEVLEMYEGLLAKLSESERKALQRSMGLKIEQLKAELKQLDE