; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005040 (gene) of Snake gourd v1 genome

Gene IDTan0005040
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionAdenine nucleotide alpha hydrolases-like superfamily protein
Genome locationLG01:104984509..104986142
RNA-Seq ExpressionTan0005040
SyntenyTan0005040
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA
IPR014729 - Rossmann-like alpha/beta/alpha sandwich fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6608056.1 hypothetical protein SDJN03_01398, partial [Cucurbita argyrosperma subsp. sororia]8.6e-7990Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFER+GKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLET+LSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCG SSSEQL+EQCI+HADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

XP_022940999.1 uncharacterized protein LOC111446414 [Cucurbita moschata]1.7e-7990.59Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFER+GKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLET+LSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCI+HADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

XP_022981064.1 uncharacterized protein LOC111480323 [Cucurbita maxima]1.3e-7991.76Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFER GKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCINHADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

XP_023523706.1 uncharacterized protein LOC111787863 [Cucurbita pepo subsp. pepo]6.0e-8091.18Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFERYGKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLET+LSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCI+HADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

XP_038898922.1 uncharacterized protein LOC120086377 [Benincasa hispida]2.4e-8191.76Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        ME FERYGKKRVMVVVDHTSQSKHAMMWALTHVA KGDLFTLLHIVSHSN    LSE P+  SSSFLANSLGYLCKASRPEVEVEALVIQGPK+ETVLSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCINHADCCTIGVRKQTNGMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0KYK7 Usp domain-containing protein7.1e-7988.95Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETP--ASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVL
        MEGFERYGKKRVMVVVDHTS SKHAM+WALTHVA KGDL TLLHIVSHS +   LSE P  +S SSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVL
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETP--ASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVL

Query:  SQVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        SQVKKLEASVLVVPQKKPSLF CFCGT+SSEQL+EQCINHADCCTIGVR+QTNGMGGYLIN+RWQKNFWLLA
Subjt:  SQVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

A0A1S3B978 uncharacterized protein LOC1034875817.1e-7988.89Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPA-SDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLS
        MEGFERYGKKRVMVVVDHTS SKHAM+WALTHVA KGDL TLLHIVSHS +   LSE P+ S SSSFLANSLGYLCKASRPEVE+EALVIQGPKLETVLS
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPA-SDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLS

Query:  QVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        QVKKLEA+VLVVPQKKPSLF CFCGT+SSEQL+EQCINHADCCTIGVRKQTNGMGGYLIN+RWQKNFWLLA
Subjt:  QVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

A0A5A7TPE6 UspA7.1e-7988.89Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPA-SDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLS
        MEGFERYGKKRVMVVVDHTS SKHAM+WALTHVA KGDL TLLHIVSHS +   LSE P+ S SSSFLANSLGYLCKASRPEVE+EALVIQGPKLETVLS
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPA-SDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLS

Query:  QVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        QVKKLEA+VLVVPQKKPSLF CFCGT+SSEQL+EQCINHADCCTIGVRKQTNGMGGYLIN+RWQKNFWLLA
Subjt:  QVKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

A0A6J1FLX8 uncharacterized protein LOC1114464148.4e-8090.59Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFER+GKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLET+LSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCI+HADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

A0A6J1IYE7 uncharacterized protein LOC1114803236.5e-8091.76Show/hide
Query:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
        MEGFER GKKRVMVVVDH+SQSKHAM+WALTHVA KGDLFTLLHIVSHSN    LSETP  DSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ
Subjt:  MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQ

Query:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        VKKLEASVLVVPQKKPSLF CFCGTSSSEQL+EQCINHADCCTIGVRKQT GMGGYLIN+RWQKNFWLLA
Subjt:  VKKLEASVLVVPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G44760.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.2e-5467.9Show/hide
Query:  KRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQVKKLEASVL
        KRVMVVVD +S+SKHAMMWALTH+  KGDL TLLH+VS           P  +++  LA SLG LCKA +PEV+VEALVIQGPKL TVLSQVKKLE SVL
Subjt:  KRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQVKKLEASVL

Query:  VVPQKKPS-LFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        V+ QKK + L SC CG S SE+L+ +CIN ADC TIGVRKQ  G+GGYLIN+RWQKNFWLLA
Subjt:  VVPQKKPS-LFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein5.7e-2031.67Show/hide
Query:  KRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNS------NKHLSE-----TPASDSSSFLANSLGYLCKASRPEVEVEALVIQG-PKLETV
        +R++VVVD  S++K+A++W L+H A+  D   LLH +    S      NK   E      P +  +    ++L  +C+  RPEV+ E + ++G  K  T+
Subjt:  KRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNS------NKHLSE-----TPASDSSSFLANSLGYLCKASRPEVEVEALVIQG-PKLETV

Query:  LSQVKKLEASVLVVPQKKP-------SLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        + + ++ EAS+LV+ QKK         +++      +    +E CIN++ C  I VRK+   +GGY + ++  K+FWLLA
Subjt:  LSQVKKLEASVLVVPQKKP-------SLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein1.7e-1937.13Show/hide
Query:  MVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSF--LANSLGYLCKASRPEVEVEALVIQ--GPKLETVLSQVKKLEASV
        MVVVD TSQ+K+A+ WALTH  +  D  TLLH V+ +   + + ET    +S    L + L   C+  +P V+ E +V++    K +T++ + KK  A V
Subjt:  MVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSF--LANSLGYLCKASRPEVEVEALVIQ--GPKLETVLSQVKKLEASV

Query:  LVVPQKKPS-----LFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        LV+ Q+K +     ++           ++E CI+++DC  I VRK++N  GGYLI ++  K+FWLLA
Subjt:  LVVPQKKPS-----LFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein2.9e-1633.14Show/hide
Query:  RVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQG---PKLETVLSQVKKLEAS
        RVMVVVD    S  A+ WAL H  +  D   LL+        K      +   +  L ++L  LC+  RP +EVE   +QG    K E ++ + K+ + S
Subjt:  RVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQG---PKLETVLSQVKKLEAS

Query:  VLVV-PQKKPSLFSCFCGTSSSEQ-----LMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        +LVV  +KKP ++         ++      ++ C+  A C TI V+ +   +GGYLI ++  KNFWLLA
Subjt:  VLVV-PQKKPSLFSCFCGTSSSEQ-----LMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.1e-1833.71Show/hide
Query:  ERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQG---PKLETVLSQV
        E  G  RVMVVVD    S  A+ WA+TH  +  D   LL+       +K  +      +   L ++L  LC+  RP +EVE   ++G    K + ++ + 
Subjt:  ERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQG---PKLETVLSQV

Query:  KKLEASVLVVPQ-KKPSLFS-----CFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA
        KK + S+LVV Q KKP ++       +      E +++ C+ +A C TI V+ +   +GGYLI ++  KNFWLLA
Subjt:  KKLEASVLVVPQ-KKPSLFS-----CFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGTTTGAGAGGTACGGTAAGAAGAGGGTGATGGTGGTGGTGGATCATACATCGCAGTCCAAGCATGCTATGATGTGGGCTCTAACTCATGTGGCTAAAAAGGG
TGATTTGTTCACTCTTCTTCACATTGTTTCTCACTCAAACTCAAACAAGCACCTCTCTGAAACTCCGGCTTCTGATTCTTCTTCATTTCTTGCTAACTCTCTTGGCTACC
TCTGCAAAGCTTCTAGACCTGAGGTGGAAGTTGAAGCACTTGTGATTCAAGGCCCAAAACTAGAGACAGTTTTGAGCCAAGTGAAGAAGCTTGAGGCATCTGTTTTAGTG
GTGCCCCAGAAAAAGCCCTCTCTCTTTAGCTGCTTCTGCGGGACCAGCAGCTCAGAGCAGCTTATGGAACAGTGCATCAACCATGCAGATTGCTGTACAATTGGGGTTAG
AAAACAGACCAATGGCATGGGCGGTTACCTTATCAACAGCAGGTGGCAGAAGAACTTCTGGCTTCTTGCTTAG
mRNA sequenceShow/hide mRNA sequence
GAAAAACTCAAAGTTTCTATATAACCTGCAACTGCAACCATCTTCCTAGCGAAAAAGGGGTTAGAGCTAACGCCAATATAGCTGAAGAGGTTCATTTTTTAAGGCAACTA
AAAGTGAAAGAAAGGCTTGAATTGAAGACTTCAGAAATCAAAGAAGTGGGGGTTTGATTTTTACCCTACAAGAAAGTAAGAAAGTTTGGGTCTGCACTAGCACTGCAGCA
AAAGCTTTAAACTGAGCGTCCATGGAAGGGTTTGAGAGGTACGGTAAGAAGAGGGTGATGGTGGTGGTGGATCATACATCGCAGTCCAAGCATGCTATGATGTGGGCTCT
AACTCATGTGGCTAAAAAGGGTGATTTGTTCACTCTTCTTCACATTGTTTCTCACTCAAACTCAAACAAGCACCTCTCTGAAACTCCGGCTTCTGATTCTTCTTCATTTC
TTGCTAACTCTCTTGGCTACCTCTGCAAAGCTTCTAGACCTGAGGTGGAAGTTGAAGCACTTGTGATTCAAGGCCCAAAACTAGAGACAGTTTTGAGCCAAGTGAAGAAG
CTTGAGGCATCTGTTTTAGTGGTGCCCCAGAAAAAGCCCTCTCTCTTTAGCTGCTTCTGCGGGACCAGCAGCTCAGAGCAGCTTATGGAACAGTGCATCAACCATGCAGA
TTGCTGTACAATTGGGGTTAGAAAACAGACCAATGGCATGGGCGGTTACCTTATCAACAGCAGGTGGCAGAAGAACTTCTGGCTTCTTGCTTAG
Protein sequenceShow/hide protein sequence
MEGFERYGKKRVMVVVDHTSQSKHAMMWALTHVAKKGDLFTLLHIVSHSNSNKHLSETPASDSSSFLANSLGYLCKASRPEVEVEALVIQGPKLETVLSQVKKLEASVLV
VPQKKPSLFSCFCGTSSSEQLMEQCINHADCCTIGVRKQTNGMGGYLINSRWQKNFWLLA