; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0001203 (gene) of Snake gourd v1 genome

Gene IDTan0001203
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionRicin B-type lectin domain-containing protein
Genome locationLG11:12010891..12012739
RNA-Seq ExpressionTan0001203
SyntenyTan0001203
Gene Ontology termsNA
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579017.1 hypothetical protein SDJN03_23465, partial [Cucurbita argyrosperma subsp. sororia]4.1e-4863.8Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL++N T+ D++P ASF E GNSKRD K   M R D+K YPPPIPLLVRTENL SH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ E+ +EE                 + KY NVR RDA FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR

KAG7016540.1 hypothetical protein SDJN02_21649 [Cucurbita argyrosperma subsp. argyrosperma]1.4e-4864.42Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL++N T+ D+SP ASF E GNSKRD K   M R D+K YPPPIPLLVRTENL SH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ E+ +EE                 + KY NVR RDA FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR

XP_022939970.1 uncharacterized protein LOC111445672 [Cucurbita moschata]4.9e-4964.42Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL++N T+ D++P ASF E GNSKRD K   MGR D+K YPPPIPLLVRTENL SH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ E+ +EE                 + KY NVR RDA FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR

XP_022992861.1 uncharacterized protein LOC111489066 [Cucurbita maxima]1.3e-4964.24Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL+ +HT+ D++P ASF E GNSKRDNK   MGR ++K YPPPIPLLVRTENLASH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRT-------------AFYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ EE +EE                   + KY NVR RD  FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRT-------------AFYKYGNVRGRDAGFMFGVAVPAGSLRTVR

XP_023551237.1 uncharacterized protein LOC111809117 [Cucurbita pepo subsp. pepo]4.4e-5066.46Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL+++ T+VD +P ASF E GNSKRD K   MGR D+K YPPPIPLLVRTENLASH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRTA---------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD  + EE +EE+   +          + KY NVR RDA FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRTA---------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR

TrEMBL top hitse value%identityAlignment
A0A6J1CJ12 uncharacterized protein LOC1110118044.9e-4764.42Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ
        MESCVDL+EN T+ D  PP    + + KR+NKMWG     E+EYPPPIPLLVRTENLASHMPWV+KRHYTGDGRLILTEE+VRHHE+FRAHRS+GRLMLQ
Subjt:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ

Query:  LVAVDD-------YQVSEESNEEMGMRTAFYKYGNV-RGRDAGFMFGVAVP---AGSLRTVRS
        LVA+DD        Q  E+ + E G +    K G V R RD   MFGVAVP   AG+LRTVRS
Subjt:  LVAVDD-------YQVSEESNEEMGMRTAFYKYGNV-RGRDAGFMFGVAVP---AGSLRTVRS

A0A6J1FP95 uncharacterized protein LOC1114456722.4e-4964.42Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL++N T+ D++P ASF E GNSKRD K   MGR D+K YPPPIPLLVRTENL SH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ E+ +EE                 + KY NVR RDA FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRTA-----------FYKYGNVRGRDAGFMFGVAVPAGSLRTVR

A0A6J1GVY4 uncharacterized protein LOC1114580312.0e-4062.26Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ
        +ESCVDL ENHTS D+S        N +RDNKMWGM R D  EYPPPI LLVRTENLAS MPWV+KRHYT DGRLILTEER+R++EFFRAHRSNGRLMLQ
Subjt:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ

Query:  LVAVDDYQVSEESNEEMGMR---TAFYKYGNVRGRDAGFM----FGVAVPAGSLRTVRS
        LVA DD Q  +ESN E+G               GRDA        G   P G+LRTVRS
Subjt:  LVAVDDYQVSEESNEEMGMR---TAFYKYGNVRGRDAGFM----FGVAVPAGSLRTVRS

A0A6J1JTI2 uncharacterized protein LOC1114873727.1e-3859.49Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ
        MESCVDLRENHT+ D+S        N KRDNKM GM R D  EYPPPI LLVRTENLAS MPWV+KR YT DGRLILT+ER+RH+EFFRAHRS+GRLMLQ
Subjt:  MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQ

Query:  LVAVDDYQVSEESNEEM-------GMRTAFYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        LVA DD Q  +ESN E+       G   A    G+     +    G   P G+L TVR
Subjt:  LVAVDDYQVSEESNEEM-------GMRTAFYKYGNVRGRDAGFMFGVAVPAGSLRTVR

A0A6J1JUP6 uncharacterized protein LOC1114890666.2e-5064.24Show/hide
Query:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML
        +ESCVDL+ +HT+ D++P ASF E GNSKRDNK   MGR ++K YPPPIPLLVRTENLASH+PWVMKR YTGDGRLILTEE+V+HHEFFRAHRS+GRLML
Subjt:  MESCVDLRENHTSVDMSPPASFTE-GNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLML

Query:  QLVAVDDYQVSEESNEEMGMRT-------------AFYKYGNVRGRDAGFMFGVAVPAGSLRTVR
        QLV +DD ++ EE +EE                   + KY NVR RD  FMFGV V AGSLR+VR
Subjt:  QLVAVDDYQVSEESNEEMGMRT-------------AFYKYGNVRGRDAGFMFGVAVPAGSLRTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22110.1 structural constituent of ribosome6.4e-2350.45Show/hide
Query:  ESCVDLRENHTSVDMSPPAS------FTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNG
        ESC D+       D+   AS      F  G  +R+ +         +E+PPPIPLL +T NL  HMPWV+KR  T DGRLIL EE+VRHHE+FRA+RSNG
Subjt:  ESCVDLRENHTSVDMSPPAS------FTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNG

Query:  RLMLQLVAVDD
        RL L LV +DD
Subjt:  RLMLQLVAVDD

AT1G77932.1 Protein of unknown function (DUF3049)9.3e-1466Show/hide
Query:  LASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQLVAVDD
        L SH+P V+KR YT DGRL+L EE+V  +E+FRAHRSNGRLM+QLV++D+
Subjt:  LASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQLVAVDD

AT5G22390.1 Protein of unknown function (DUF3049)2.6e-0836.26Show/hide
Query:  NSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQLVAVDDYQVSEESNEE
        N   D + W   R + KEYPP              M  +  + Y  +GRL+L E R+   EF RA R +GRL L+LV  +D    EE NE+
Subjt:  NSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQLVAVDDYQVSEESNEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAGTTGTGTTGATTTGAGGGAGAATCACACGTCGGTTGACATGTCGCCGCCGGCGAGCTTCACTGAGGGGAATAGTAAGAGGGATAACAAAATGTGGGGAATGGG
GAGGCTAGATGAGAAAGAGTATCCACCGCCGATTCCATTGCTGGTTCGCACTGAGAATCTGGCTTCCCACATGCCATGGGTGATGAAACGGCACTACACCGGCGACGGTC
GGTTGATACTGACGGAGGAGAGAGTGAGGCACCATGAGTTTTTCCGTGCCCACAGGTCCAACGGGCGCCTGATGCTGCAGCTTGTGGCCGTCGACGACTACCAGGTCTCC
GAGGAGTCGAATGAGGAAATGGGGATGAGGACGGCGTTTTATAAGTATGGGAATGTGAGGGGCAGAGATGCAGGGTTTATGTTCGGAGTGGCAGTGCCTGCAGGGAGCCT
GAGGACGGTTCGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
CTGACTTCGAACTGCTTGTTTCGTTGGTAACACCACCGCGGGAGAAGCGTCGCCATTTTATCTTCCACTCCAATTCCACCACACAGATAGAGGTTGAAGAAGAGTCGAAG
ATCAAACGCTGTTTCGAAGTCCTCAAAAGTTCTCCATTGCCGCCATTGACGACGACCTTCGATTCTCTCCCTTCGTAGCAAGAAGGCTTCCCCTGTCTCCAGAGATTTCG
ACATTGATGATTATATTGGAATGGAGAGTTGTGTTGATTTGAGGGAGAATCACACGTCGGTTGACATGTCGCCGCCGGCGAGCTTCACTGAGGGGAATAGTAAGAGGGAT
AACAAAATGTGGGGAATGGGGAGGCTAGATGAGAAAGAGTATCCACCGCCGATTCCATTGCTGGTTCGCACTGAGAATCTGGCTTCCCACATGCCATGGGTGATGAAACG
GCACTACACCGGCGACGGTCGGTTGATACTGACGGAGGAGAGAGTGAGGCACCATGAGTTTTTCCGTGCCCACAGGTCCAACGGGCGCCTGATGCTGCAGCTTGTGGCCG
TCGACGACTACCAGGTCTCCGAGGAGTCGAATGAGGAAATGGGGATGAGGACGGCGTTTTATAAGTATGGGAATGTGAGGGGCAGAGATGCAGGGTTTATGTTCGGAGTG
GCAGTGCCTGCAGGGAGCCTGAGGACGGTTCGAAGCTAAAGGGGGAAGATGTCAGTAACAGTATCAGAATAGCATTTTGTGTGTTTGTGTAGTTGAAAGAAACCAACTTA
GTTAGGGACTTTGTGGTCTGTTTGGGACTTTGGGTTGGGTTTTTACGCATTCAGTTTATGACAACTTTAGCATATCCGTAATTAGTTAAAGACATATACTATTCAAATTA
GAAGTTGAAATTGCT
Protein sequenceShow/hide protein sequence
MESCVDLRENHTSVDMSPPASFTEGNSKRDNKMWGMGRLDEKEYPPPIPLLVRTENLASHMPWVMKRHYTGDGRLILTEERVRHHEFFRAHRSNGRLMLQLVAVDDYQVS
EESNEEMGMRTAFYKYGNVRGRDAGFMFGVAVPAGSLRTVRS