; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003442 (gene) of Snake gourd v1 genome

Gene IDTan0003442
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionULP_PROTEASE domain-containing protein
Genome locationLG02:22463173..22464162
RNA-Seq ExpressionTan0003442
SyntenyTan0003442
Gene Ontology termsGO:0110165 - cellular anatomical structure (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8649224.1 hypothetical protein Csa_014966 [Cucumis sativus]6.5e-1638.41Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG DN++V +DVI VE  ++ +P P+KG +ET L QA   FVAWPR LVI+++  K  S    +          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        A       +++ + +++ IFGK +T+YL P+DI+++C + EI  +C+L YI
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

XP_008451868.1 PREDICTED: uncharacterized protein LOC103493028 isoform X1 [Cucumis melo]4.2e-1534.21Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWPR LVI++K  K  S   ++          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN
        A +    ++++ + +S+ IFGK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S  +  +N++N
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN

XP_031740251.1 uncharacterized protein LOC101213947 [Cucumis sativus]6.5e-1638.41Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG DN++V +DVI VE  ++ +P P+KG +ET L QA   FVAWPR LVI+++  K  S    +          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        A       +++ + +++ IFGK +T+YL P+DI+++C + EI  +C+L YI
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

XP_038895921.1 uncharacterized protein LOC120084092 isoform X1 [Benincasa hispida]4.2e-1537.5Show/hide
Query:  MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYR
        M ESD +   +NE+ LG DNV+  +D++  E  ++ +P P K  ++T L QA   FVAWPR LVI +K  K        +I  Q  ++  V   IK+L R
Subjt:  MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYR

Query:  YA-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        YA      D+++ + +S++I GK +T+YL  +DI+++C + EI  +C+L YI
Subjt:  YA-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

XP_038895930.1 uncharacterized protein LOC120084092 isoform X2 [Benincasa hispida]4.2e-1537.5Show/hide
Query:  MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYR
        M ESD +   +NE+ LG DNV+  +D++  E  ++ +P P K  ++T L QA   FVAWPR LVI +K  K        +I  Q  ++  V   IK+L R
Subjt:  MVESDEK-EKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRF--VSSCIKILYR

Query:  YA-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        YA      D+++ + +S++I GK +T+YL  +DI+++C + EI  +C+L YI
Subjt:  YA-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

TrEMBL top hitse value%identityAlignment
A0A1S3BRX5 uncharacterized protein LOC103493028 isoform X12.0e-1534.21Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWPR LVI++K  K  S   ++          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN
        A +    ++++ + +S+ IFGK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S  +  +N++N
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN

A0A1S4DZN2 uncharacterized protein LOC103493028 isoform X22.0e-1534.21Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWPR LVI++K  K  S   ++          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN
        A +    ++++ + +S+ IFGK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S  +  +N++N
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN

A0A5D3CYL9 ULP_PROTEASE domain-containing protein2.0e-1534.21Show/hide
Query:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY
        M ESD +   ++ + LG +N++V +D+  VE  ++ +P P+KG++ET L QA   FVAWPR LVI++K  K  S   ++          V   IK+L RY
Subjt:  MVESD-EKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRY

Query:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN
        A +    ++++ + +S+ IFGK +T+YL  +DI+++C + EI  +C+L YI     +C  ++   FV  D    S H  S  +  +N++N
Subjt:  A-EKYGKDNLVVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIV----LCVVDV---FVNNDYLFYSLH-PSMADDLQNIVN

A0A6J1C2V2 uncharacterized protein LOC111007859 isoform X41.9e-1339.01Show/hide
Query:  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNL
        V+ V LG DNV+V +D++  E     IP P++G +ET L Q    FVAWPR LVI+S+     S + +Q          V   IK+L RY       ++ 
Subjt:  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNL

Query:  VVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        V + +S  IFGK + +YL   DIM++C +IEI  +C+L YI
Subjt:  VVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

A0A6J1C398 uncharacterized protein LOC111007859 isoform X31.9e-1339.01Show/hide
Query:  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNL
        V+ V LG DNV+V +D++  E     IP P++G +ET L Q    FVAWPR LVI+S+     S + +Q          V   IK+L RY       ++ 
Subjt:  VNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDK-DSKQKNQLNIVFQVIRFVSSCIKILYRYAE-KYGKDNL

Query:  VVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI
        V + +S  IFGK + +YL   DIM++C +IEI  +C+L YI
Subjt:  VVVPISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGAGAGTGATGAGAAAGAAAAAGTAAATGAAGTGCTTTTAGGGAAAGACAATGTGAAGGTATTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTTGTATTCC
TTTTCCGATGAAGGGGAATATGGAGACGAGTCTTTACCAAGCTCAGAGTTGTTTTGTTGCTTGGCCTCGCAATCTCGTTATTATTTCTAAAAATGACAAGGATTCTAAAC
AAAAGAACCAACTAAACATAGTTTTCCAAGTCATACGATTTGTCTCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAATATGGTAAAGATAATCTAGTAGTTGTG
CCCATAAGTGATAGAATATTTGGAAAAGGAGAAACTCTTTATCTTATGCCAGAAGATATCATGAAATTTTGTGCAGTGATAGAGATATCAAACACATGCATGTTAGTCTA
CATTGTATTGTGCGTGGTGGATGTATTTGTGAATAATGATTATCTTTTTTATTCCCTACATCCTAGCATGGCAGACGACCTCCAAAACATTGTGAATACGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTCGAGAGTGATGAGAAAGAAAAAGTAAATGAAGTGCTTTTAGGGAAAGACAATGTGAAGGTATTTATTGATGTCATACAAGTTGAAAAAGGTAATCTTTGTATTCC
TTTTCCGATGAAGGGGAATATGGAGACGAGTCTTTACCAAGCTCAGAGTTGTTTTGTTGCTTGGCCTCGCAATCTCGTTATTATTTCTAAAAATGACAAGGATTCTAAAC
AAAAGAACCAACTAAACATAGTTTTCCAAGTCATACGATTTGTCTCTTCATGCATCAAGATTCTTTATCGATATGCTGAAAAATATGGTAAAGATAATCTAGTAGTTGTG
CCCATAAGTGATAGAATATTTGGAAAAGGAGAAACTCTTTATCTTATGCCAGAAGATATCATGAAATTTTGTGCAGTGATAGAGATATCAAACACATGCATGTTAGTCTA
CATTGTATTGTGCGTGGTGGATGTATTTGTGAATAATGATTATCTTTTTTATTCCCTACATCCTAGCATGGCAGACGACCTCCAAAACATTGTGAATACGTAA
Protein sequenceShow/hide protein sequence
MVESDEKEKVNEVLLGKDNVKVFIDVIQVEKGNLCIPFPMKGNMETSLYQAQSCFVAWPRNLVIISKNDKDSKQKNQLNIVFQVIRFVSSCIKILYRYAEKYGKDNLVVV
PISDRIFGKGETLYLMPEDIMKFCAVIEISNTCMLVYIVLCVVDVFVNNDYLFYSLHPSMADDLQNIVNT