; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0006081 (gene) of Snake gourd v1 genome

Gene IDTan0006081
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCysteine proteinase
Genome locationLG05:39984032..39985732
RNA-Seq ExpressionTan0006081
SyntenyTan0006081
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605202.1 hypothetical protein SDJN03_02519, partial [Cucurbita argyrosperma subsp. sororia]5.0e-6177.65Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNEDE EE E+E  FIEE+E+ELTLGPS+YSNQ +GGG RR
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR

Query:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        R+K           S  S SSSSSTTGSAQKSR+FYR GE VG SEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

KAG7035180.1 hypothetical protein SDJN02_01975, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-6077.09Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+ ENRGKTERWEVKNEDE EE E+E  FIEE+E+ELTLGPS+YSNQ +GGG RR
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR

Query:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        R+K           S  S SSSSSTTGSAQKSR+FYR GE VG SEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

XP_022947587.1 uncharacterized protein LOC111451410 [Cucurbita moschata]6.5e-6177.65Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNEDE EE E+E  FIEE+E+ELTLGPS+YSNQ +GGG RR
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR

Query:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        R+K           S  S SSSSSTTGSAQKSR+FYR GE VG SEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

XP_023007023.1 uncharacterized protein LOC111499640 [Cucurbita maxima]1.5e-6076.11Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEE--IHFIEENEIELTLGPSSYSNQKIGGGGR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNE+EEE ++E    FIEE+E+ELTLGPS+YSNQ +GGG R
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEE--IHFIEENEIELTLGPSSYSNQKIGGGGR

Query:  RRKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        RR+K           S  S SSSSSTTGSAQK+R+FYR GE VGGSEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RRKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

XP_023534353.1 uncharacterized protein LOC111795940 [Cucurbita pepo subsp. pepo]2.2e-6177.65Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEG-EEEIHFIEENEIELTLGPSSYSNQKIGGGGRR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNEDE+E  E+E  FIEE+E+ELTLGPS+YSNQ +GGG RR
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEG-EEEIHFIEENEIELTLGPSSYSNQKIGGGGRR

Query:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        R+K           S  S SSSSSTTGSAQKSR+FYR GE VGGSEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

TrEMBL top hitse value%identityAlignment
A0A0A0LJL7 Uncharacterized protein3.5e-5269.14Show/hide
Query:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTE-NRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLG
        EQSYSK NMK AML HEQTFRHQVYELHRLYRIQK+LMKN+ E NRGKTE      E+EEEGEEE  FIEE++IELTLGPS+Y+NQ  GG  R R +++ 
Subjt:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTE-NRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLG

Query:  EMNRGGSDSGISFS-SSSSTTGSAQKSRNFYRD-GEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        +   G SDSG+SFS SSSST GS QK + FYRD GEFV GS+MGF+GFL+V+D++++S HHPNPWLYQ VSLNLT
Subjt:  EMNRGGSDSGISFS-SSSSTTGSAQKSRNFYRD-GEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

A0A1S3C5U9 uncharacterized protein LOC1034973804.1e-5370.29Show/hide
Query:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTE-NRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLG
        EQSYSK NMK AML HEQTFRHQVYELHRLYRIQK+LMKN+ E NRGKTE      E+EEEGEEE  FIEE+EIELTLGPS+Y+NQ IGG  R R +++ 
Subjt:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTE-NRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLG

Query:  EMNRGGSDSGISFS-SSSSTTGSAQKSRNFYRD-GEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        +   G SDSG+SFS SSSST GS QK + FYRD GEFV GS+MGF+GFL+V+D++++S HHPNPWLYQ VSLNLT
Subjt:  EMNRGGSDSGISFS-SSSSTTGSAQKSRNFYRD-GEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

A0A6J1EXP9 uncharacterized protein LOC1114394736.8e-4868.02Show/hide
Query:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLGE
        ++SY K N+K AML HE TFRHQVYELHRLYRIQK+LMKN+ ENRGKTE      E+EEE  EEI FI+E+EIELTLGPSSYSNQ   GG RRR+KKLGE
Subjt:  EQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLGE

Query:  MNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        + +GGSD     SSSSSTTGSA K    YRDGE        F+GFLE++DE RL+QHHPNPWL QVVSLNLT
Subjt:  MNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

A0A6J1G6V7 uncharacterized protein LOC1114514103.2e-6177.65Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNEDE EE E+E  FIEE+E+ELTLGPS+YSNQ +GGG RR
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDE-EEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRR

Query:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        R+K           S  S SSSSSTTGSAQKSR+FYR GE VG SEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

A0A6J1L6K1 uncharacterized protein LOC1114996407.0e-6176.11Show/hide
Query:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEE--IHFIEENEIELTLGPSSYSNQKIGGGGR
        MAKLAE++S SKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKN+TENRGKTERWEVKNE+EEE ++E    FIEE+E+ELTLGPS+YSNQ +GGG R
Subjt:  MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEE--IHFIEENEIELTLGPSSYSNQKIGGGGR

Query:  RRKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT
        RR+K           S  S SSSSSTTGSAQK+R+FYR GE VGGSEMGFVGFL  EDEMRLSQHHPNPWL Q ++LNLT
Subjt:  RRKKKLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G26620.1 Plant protein of unknown function (DUF863)5.4e-0537.29Show/hide
Query:  MAKLAEEQSYS---KENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGG
        M   A+  SYS   K+ MK+ ML+HE  F++QV+ELHRLYR+QK L++   E +GK    EV N  +       H   ENE +  L      N   G G 
Subjt:  MAKLAEEQSYS---KENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGG

Query:  RRRKKKLGEMNRGGSDSG
          +    G +  GGS +G
Subjt:  RRRKKKLGEMNRGGSDSG

AT5G67390.1 unknown protein1.8e-1637.3Show/hide
Query:  YSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEV--------KNEDEE--------EGEEEIHFIEENEIELTLGPSSYSNQKI
        Y K+ MK AMLKHE+TF+ QVYELHRLY++QK+LMKN+  N+  T+   V        +  D E         G   I  ++E+EIELTLGPS Y   ++
Subjt:  YSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEV--------KNEDEE--------EGEEEIHFIEENEIELTLGPSSYSNQKI

Query:  GGGGRRRKK-KLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNL
            +++KK  L EM  G  +SG   S SSS+TGS+  + N   +               +V  E  +      PWL Q ++LN+
Subjt:  GGGGRRRKK-KLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNL

AT5G67390.2 unknown protein1.8e-1637.3Show/hide
Query:  YSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEV--------KNEDEE--------EGEEEIHFIEENEIELTLGPSSYSNQKI
        Y K+ MK AMLKHE+TF+ QVYELHRLY++QK+LMKN+  N+  T+   V        +  D E         G   I  ++E+EIELTLGPS Y   ++
Subjt:  YSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEV--------KNEDEE--------EGEEEIHFIEENEIELTLGPSSYSNQKI

Query:  GGGGRRRKK-KLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNL
            +++KK  L EM  G  +SG   S SSS+TGS+  + N   +               +V  E  +      PWL Q ++LN+
Subjt:  GGGGRRRKK-KLGEMNRGGSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAGCTAGCAGAAGAACAATCTTACAGTAAAGAGAATATGAAGAACGCCATGTTAAAGCACGAACAAACATTTCGTCACCAAGTGTATGAACTTCACAGATTGTA
CAGAATTCAGAAGCTTTTGATGAAGAACTTAACAGAAAACAGAGGAAAAACAGAGAGATGGGAAGTGAAAAATGAGGACGAAGAGGAAGGTGAAGAAGAGATTCATTTTA
TAGAAGAAAATGAGATTGAATTGACATTGGGACCTTCAAGTTATAGCAATCAAAAAATTGGAGGAGGAGGAAGAAGAAGAAAGAAGAAATTGGGTGAAATGAATAGAGGA
GGTTCAGATTCAGGGATCAGCTTCTCTTCTTCTTCTTCAACCACTGGATCTGCACAAAAAAGTAGAAATTTTTACAGAGATGGAGAATTTGTTGGTGGTTCTGAAATGGG
GTTTGTGGGATTTCTTGAGGTTGAAGATGAGATGAGGCTTTCACAGCACCATCCAAATCCTTGGCTTTATCAAGTTGTAAGCCTTAATCTCACTTGA
mRNA sequenceShow/hide mRNA sequence
GAGGAGGCTTTGTCTTTCTCTCTCTCTCTCCCTTGGAGCTGATTTCCTGTTTCTGCGTATTCTCTCTCTCTTTTGAAGCTCTCTCATTATCTCTTTCAGATACATCTCAA
TGGCAAAGCTAGCAGAAGAACAATCTTACAGTAAAGAGAATATGAAGAACGCCATGTTAAAGCACGAACAAACATTTCGTCACCAAGTGTATGAACTTCACAGATTGTAC
AGAATTCAGAAGCTTTTGATGAAGAACTTAACAGAAAACAGAGGAAAAACAGAGAGATGGGAAGTGAAAAATGAGGACGAAGAGGAAGGTGAAGAAGAGATTCATTTTAT
AGAAGAAAATGAGATTGAATTGACATTGGGACCTTCAAGTTATAGCAATCAAAAAATTGGAGGAGGAGGAAGAAGAAGAAAGAAGAAATTGGGTGAAATGAATAGAGGAG
GTTCAGATTCAGGGATCAGCTTCTCTTCTTCTTCTTCAACCACTGGATCTGCACAAAAAAGTAGAAATTTTTACAGAGATGGAGAATTTGTTGGTGGTTCTGAAATGGGG
TTTGTGGGATTTCTTGAGGTTGAAGATGAGATGAGGCTTTCACAGCACCATCCAAATCCTTGGCTTTATCAAGTTGTAAGCCTTAATCTCACTTGATTTTACACTTCTAA
TTTTTCATGTTGTTGATTCAGTTTCTTTTTTTTTTTTTTTTTTCCTTTTTGAGCTTAGTTTTTTTTTTTTTTGGTTTTCTTCTCTGATTGGGAGTCTTCTCTGTTGTACA
AGTTTATAACCAAGTTCATATAATTATCATCAGGC
Protein sequenceShow/hide protein sequence
MAKLAEEQSYSKENMKNAMLKHEQTFRHQVYELHRLYRIQKLLMKNLTENRGKTERWEVKNEDEEEGEEEIHFIEENEIELTLGPSSYSNQKIGGGGRRRKKKLGEMNRG
GSDSGISFSSSSSTTGSAQKSRNFYRDGEFVGGSEMGFVGFLEVEDEMRLSQHHPNPWLYQVVSLNLT