; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020700 (gene) of Snake gourd v1 genome

Gene IDTan0020700
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGlycoside hydrolase, family 43
Genome locationLG05:46064876..46085656
RNA-Seq ExpressionTan0020700
SyntenyTan0020700
Gene Ontology termsGO:0005975 - carbohydrate metabolic process (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0004553 - hydrolase activity, hydrolyzing O-glycosyl compounds (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605231.1 hypothetical protein SDJN03_02548, partial [Cucurbita argyrosperma subsp. sororia]6.8e-8785.71Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD + NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATS TTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG  AREKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

KAG7035197.1 hypothetical protein SDJN02_01992 [Cucurbita argyrosperma subsp. argyrosperma]5.8e-8685.25Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD   NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATS TTASNSD DSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG  AREKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

XP_022946955.1 uncharacterized protein LOC111450982 [Cucurbita moschata]1.8e-8786.18Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD+R NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATS TTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG  AREKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

XP_023007376.1 uncharacterized protein LOC111499889 [Cucurbita maxima]1.4e-8786.18Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD+R NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATSTTTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG   REKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

XP_023532576.1 uncharacterized protein LOC111794698 [Cucurbita pepo subsp. pepo]8.9e-8785.25Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD+R NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATSTTTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQS+ +EG DIIAAG  AREKSIARLEAKEAAAKAAAK EEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

TrEMBL top hitse value%identityAlignment
A0A1S3C6L4 uncharacterized protein LOC1034974291.0e-7275.23Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGDRR++  +GGDSSSGEEDGDA+WRAAIDSVT +SVFISSLTNG+PATS  T S  DDD ELN+  QPPK YQIKAQKLL+NILETTLE+VEHS ++ C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  -VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAE
          DSK SEGGIRLFKNAPVGVVFD VDEL RPTKKPKILPGKEINE+SKKFKQ+++SV VEG DII A K   EKSIARLEAKEAA KAAAKREEERVA+
Subjt:  -VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAE

Query:  LKKF---SWLKKKWRKKK
        LKK     WL    R+ K
Subjt:  LKKF---SWLKKKWRKKK

A0A5A7TUM5 Glycoside hydrolase, family 437.9e-7378.82Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGDRR++  +GGDSSSGEEDGDA+WRAAIDSVT +SVFISSLTNG+PATS  T S  DDD ELN+  QPPK YQIKAQKLL+NILETTLE+VEHS ++ C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  -VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAE
          DSK SEGGIRLFKNAPVGVVFD VDEL RPTKKPKILPGKEINE+SKKFKQ+++SV VEG DII A K   EKSIARLEAKEAA KAAAKREEERVA+
Subjt:  -VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAE

Query:  LKK
        LKK
Subjt:  LKK

A0A6J1D727 uncharacterized protein LOC1110178721.6e-7374.42Show/hide
Query:  DRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISCVD
        D R NG YGGDSSSGEEDGDAQWRAAIDSV T+SVFISSLTNG+P TSTT AS S+DDSELN+   PPK YQIKA+K+LENILETTLEVVEH  ++   D
Subjt:  DRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISCVD

Query:  SKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAELKK
        SK   GGIRLFKNAP+GVVFD VDEL+RPTK+PKI+PGKEINE+SKKFKQR+QSV V+G DIIA+ K A EKS+ RLEA+EAAAKAAAKREEERVAELKK
Subjt:  SKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAELKK

Query:  F---SWLKKKWRKKK
             WL    R+ K
Subjt:  F---SWLKKKWRKKK

A0A6J1G5G9 uncharacterized protein LOC1114509828.7e-8886.18Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD+R NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATS TTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG  AREKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

A0A6J1L0C9 uncharacterized protein LOC1114998896.7e-8886.18Show/hide
Query:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC
        MGD+R NG YGGDSSSGEEDGDAQWRAAIDSVTTTSVF+SSLTNGLPATSTTTASNSDDDSELN+GPQPPK YQIKAQK+LENILETTLEVVEHS AI C
Subjt:  MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISC

Query:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL
         DSK SEGGIRLFKNAPVGVVFD +DELQRPTKKPKILPGKEINE+SKKFKQRIQSV VEG DIIAAG   REKSIARLEAKEAAAKAAAKREEERVAEL
Subjt:  VDSKPSEGGIRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAEL

Query:  KKF---SWLKKKWRKKK
        KK     WL    R+ K
Subjt:  KKF---SWLKKKWRKKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G49890.1 unknown protein1.0e-3244.71Show/hide
Query:  GGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISCVDSKP-SEGG
        GGDSSS  ED D +WRAAI+S+ TT+V+ +S T   PA     A+ S +  +  + P+   H QIK + LL  ++E TL+ VE    ++  + KP ++ G
Subjt:  GGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISCVDSKP-SEGG

Query:  IRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAELKKF---SWL
        +RLFK    G+VFD VDE++ P KKP + P K +   SK+FK+R++S+ V+G+DI+ A   A +K+ ARL+AKE AAK  AK+EEER+AELKK     WL
Subjt:  IRLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAELKKF---SWL

Query:  KKKWRKKK
            R  K
Subjt:  KKKWRKKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGATCGTCGGACAAACGGCGTCTATGGTGGCGACAGCAGCAGCGGTGAGGAAGACGGCGACGCCCAATGGAGAGCCGCCATCGATTCTGTTACAACCACGTCTGT
GTTTATCTCGTCGTTAACTAATGGTCTTCCTGCTACTTCTACAACCACTGCATCAAACTCGGACGATGATTCCGAGCTTAATATCGGTCCTCAGCCGCCCAAGCATTATC
AAATCAAGGCACAGAAGCTATTGGAGAACATTTTGGAAACTACTCTAGAGGTGGTAGAACATTCCAAAGCTATTTCTTGTGTTGATTCCAAACCCAGTGAAGGTGGAATT
CGTTTGTTTAAAAATGCCCCCGTTGGTGTTGTGTTTGATCGCGTGGATGAGCTTCAACGCCCCACAAAGAAACCAAAAATTCTTCCGGGGAAAGAAATCAACGAGAGATC
GAAGAAGTTCAAGCAGCGTATCCAATCCGTGATCGTTGAAGGAACAGACATAATCGCTGCTGGAAAATGTGCGCGTGAGAAGTCAATTGCTAGGCTTGAAGCTAAAGAAG
CAGCAGCCAAAGCCGCTGCTAAAAGAGAGGAAGAAAGGGTAGCAGAACTGAAAAAGTTTTCTTGGTTGAAGAAGAAATGGAGGAAGAAGAAATGTTGTTTACTTGGTATT
TTTTTGGCCGATGTTAGCATAATTAAAACATGA
mRNA sequenceShow/hide mRNA sequence
TTATTTTTAGCCGTTCTTAAAATTTTCCAATATAGATGGGGTTGTTCAAAGTTCATACCCCAAATCTTTTATGGGCTTTTCAATGCAAAGCCCAATATATAAACACCCGA
ATTTGGCAGCTGGATATGTGTTACGTGTCCATGCTCGAGAAACGGAATGGGTGATCGTCGGACAAACGGCGTCTATGGTGGCGACAGCAGCAGCGGTGAGGAAGACGGCG
ACGCCCAATGGAGAGCCGCCATCGATTCTGTTACAACCACGTCTGTGTTTATCTCGTCGTTAACTAATGGTCTTCCTGCTACTTCTACAACCACTGCATCAAACTCGGAC
GATGATTCCGAGCTTAATATCGGTCCTCAGCCGCCCAAGCATTATCAAATCAAGGCACAGAAGCTATTGGAGAACATTTTGGAAACTACTCTAGAGGTGGTAGAACATTC
CAAAGCTATTTCTTGTGTTGATTCCAAACCCAGTGAAGGTGGAATTCGTTTGTTTAAAAATGCCCCCGTTGGTGTTGTGTTTGATCGCGTGGATGAGCTTCAACGCCCCA
CAAAGAAACCAAAAATTCTTCCGGGGAAAGAAATCAACGAGAGATCGAAGAAGTTCAAGCAGCGTATCCAATCCGTGATCGTTGAAGGAACAGACATAATCGCTGCTGGA
AAATGTGCGCGTGAGAAGTCAATTGCTAGGCTTGAAGCTAAAGAAGCAGCAGCCAAAGCCGCTGCTAAAAGAGAGGAAGAAAGGGTAGCAGAACTGAAAAAGTTTTCTTG
GTTGAAGAAGAAATGGAGGAAGAAGAAATGTTGTTTACTTGGTATTTTTTTGGCCGATGTTAGCATAATTAAAACATGAACTTAATTGTTTATGCTACTTGGACATTATA
ATAAAATTAAGAGCAAAAATCGAC
Protein sequenceShow/hide protein sequence
MGDRRTNGVYGGDSSSGEEDGDAQWRAAIDSVTTTSVFISSLTNGLPATSTTTASNSDDDSELNIGPQPPKHYQIKAQKLLENILETTLEVVEHSKAISCVDSKPSEGGI
RLFKNAPVGVVFDRVDELQRPTKKPKILPGKEINERSKKFKQRIQSVIVEGTDIIAAGKCAREKSIARLEAKEAAAKAAAKREEERVAELKKFSWLKKKWRKKKCCLLGI
FLADVSIIKT