; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC09G165200 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC09G165200
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptionadenylylsulfatase HINT3
Genome locationCiama_Chr09:7143861..7150844
RNA-Seq ExpressionCaUC09G165200
SyntenyCaUC09G165200
Gene Ontology termsGO:0006790 - sulfur compound metabolic process (biological process)
GO:0009150 - purine ribonucleotide metabolic process (biological process)
GO:0005777 - peroxisome (cellular component)
GO:0047627 - adenylylsulfatase activity (molecular function)
InterPro domainsIPR001310 - Histidine triad (HIT) protein
IPR011146 - HIT-like domain
IPR036265 - HIT-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0059707.1 putative HIT-like protein MT1300 isoform X3 [Cucumis melo var. makuwa]1.6e-6493.94Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILDTRPLSNGHSLIIPKSHYSS
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN
        LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN

KAE8646299.1 hypothetical protein Csa_016401 [Cucumis sativus]6.4e-7490.73Show/hide
Query:  KLGVYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLC
        ++ VYTYGDSQFDC NFCFN SMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+IIRGEAPAFKLY+D+SCLC
Subjt:  KLGVYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLC

Query:  ILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        ILDT+PLSNGH+LIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  ILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

XP_016901093.1 PREDICTED: uncharacterized HIT-like protein MT1300 isoform X1 [Cucumis melo]1.2e-7593.92Show/hide
Query:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD
        VYTYGDSQFDC NFCFNWSMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILD
Subjt:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD

Query:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

XP_016901094.1 PREDICTED: uncharacterized HIT-like protein MT1300 isoform X2 [Cucumis melo]1.9e-7093.57Show/hide
Query:  FDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGH
        FDC NFCFNWSMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILDTRPLSNGH
Subjt:  FDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGH

Query:  SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

XP_031744877.1 adenylylsulfatase HINT3 isoform X1 [Cucumis sativus]1.9e-7392.57Show/hide
Query:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD
        VYTYGDSQFDC NFCFN SMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+IIRGEAPAFKLY+D+SCLCILD
Subjt:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD

Query:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        T+PLSNGH+LIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

TrEMBL top hitse value%identityAlignment
A0A0A0K757 HIT domain-containing protein2.7e-6293.02Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+IIRGEAPAFKLY+D+SCLCILDT+PLSNGH+LIIPKSHYSS
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGS
        LEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGS

A0A1S3BRW7 uncharacterized HIT-like protein MT1300 isoform X37.2e-6393.8Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILDTRPLSNGHSLIIPKSHYSS
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGS
        LEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGS

A0A1S4DYN9 uncharacterized HIT-like protein MT1300 isoform X29.4e-7193.57Show/hide
Query:  FDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGH
        FDC NFCFNWSMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILDTRPLSNGH
Subjt:  FDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGH

Query:  SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  SLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

A0A1S4DYP3 uncharacterized HIT-like protein MT1300 isoform X15.7e-7693.92Show/hide
Query:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD
        VYTYGDSQFDC NFCFNWSMEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILD
Subjt:  VYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILD

Query:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
        TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS
Subjt:  TRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGS

A0A5A7UYI3 Putative HIT-like protein MT1300 isoform X37.7e-6593.94Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHLCP+NLG SPAPLLNLSSSSCAS SKSEHLIYDSQKG LQDDCVFC+I+RGEAPAFKLY+D+SCLCILDTRPLSNGHSLIIPKSHYSS
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN
        LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGSGMN

SwissProt top hitse value%identityAlignment
F4K1R2 Adenylylsulfatase HINT39.1e-3965.12Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHL P   G +P     L  S C+S S  +  +  S    LQ+DCVFC+IIRGE+P  KLYED+ CLCILDT PLS+GHSLIIPK HY +
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGS
        LE TPPSV+AAMCSKVP+ISNAI+K+TGS
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGS

P94252 Uncharacterized HIT-like protein BB_03791.2e-0939.02Show/hide
Query:  DCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSV---IAAMCSKVPIISNAIMKSTGSGMN
        DC+FC+II  E P++K+YED+  L  LD  PL+ GH+L+IPK H  SL          +  +C K+      I  S   G+N
Subjt:  DCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSV---IAAMCSKVPIISNAIMKSTGSGMN

P9WML0 Uncharacterized HIT-like protein MT13001.9e-1255.17Show/hide
Query:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAM
        CVFC II GEAPA ++YED   L ILD RP + GH+L++PK H   L  TPP  +A M
Subjt:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAM

P9WML1 Uncharacterized HIT-like protein Rv1262c1.9e-1255.17Show/hide
Query:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAM
        CVFC II GEAPA ++YED   L ILD RP + GH+L++PK H   L  TPP  +A M
Subjt:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAM

Q58276 Uncharacterized HIT-like protein MJ08662.6e-0937.18Show/hide
Query:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGSGMN
        C+FC+II GE PA  +YEDE  L  LD  P + GH+L++PK HY   +  P   +      V      + K    G N
Subjt:  CVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGSGMN

Arabidopsis top hitse value%identityAlignment
AT4G16566.1 histidine triad nucleotide-binding 44.0e-0535.42Show/hide
Query:  GLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSH
        G+   C+FC I+R       L+ DE  +   D +P +  H L+IPK H
Subjt:  GLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSH

AT5G48545.1 histidine triad nucleotide-binding 36.5e-4065.12Show/hide
Query:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS
        MEARRLAILCSHL P   G +P     L  S C+S S  +  +  S    LQ+DCVFC+IIRGE+P  KLYED+ CLCILDT PLS+GHSLIIPK HY +
Subjt:  MEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLYEDESCLCILDTRPLSNGHSLIIPKSHYSS

Query:  LEATPPSVIAAMCSKVPIISNAIMKSTGS
        LE TPPSV+AAMCSKVP+ISNAI+K+TGS
Subjt:  LEATPPSVIAAMCSKVPIISNAIMKSTGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACATGCATTGGGTCTAAATGTGGAAGACTCTGGCCACCTTAGATCAAAGTTGGGCGTATATACTTATGGCGATTCACAGTTTGACTGCAATAATTTCTGCTTCAA
CTGGTCTATGGAGGCCCGGAGGCTTGCCATTCTCTGTTCACATCTGTGCCCAATTAACTTGGGACGTAGCCCAGCTCCTCTCTTAAATCTTTCCTCTTCGAGCTGTGCTT
CTGAATCCAAGAGTGAGCATCTAATTTATGATTCTCAAAAGGGGGGTCTGCAAGATGATTGTGTTTTCTGCAGGATTATACGAGGCGAGGCACCTGCTTTTAAGCTCTAT
GAAGATGAAAGTTGCCTTTGTATATTAGACACAAGACCACTGAGTAATGGGCACTCTCTAATTATACCAAAATCTCATTATTCTTCGTTGGAAGCTACACCTCCTTCTGT
GATAGCTGCAATGTGTTCGAAAGTTCCCATCATTAGCAATGCAATCATGAAGTCTACTGGTAGTGGTATGAACTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACATGCATTGGGTCTAAATGTGGAAGACTCTGGCCACCTTAGATCAAAGTTGGGCGTATATACTTATGGCGATTCACAGTTTGACTGCAATAATTTCTGCTTCAA
CTGGTCTATGGAGGCCCGGAGGCTTGCCATTCTCTGTTCACATCTGTGCCCAATTAACTTGGGACGTAGCCCAGCTCCTCTCTTAAATCTTTCCTCTTCGAGCTGTGCTT
CTGAATCCAAGAGTGAGCATCTAATTTATGATTCTCAAAAGGGGGGTCTGCAAGATGATTGTGTTTTCTGCAGGATTATACGAGGCGAGGCACCTGCTTTTAAGCTCTAT
GAAGATGAAAGTTGCCTTTGTATATTAGACACAAGACCACTGAGTAATGGGCACTCTCTAATTATACCAAAATCTCATTATTCTTCGTTGGAAGCTACACCTCCTTCTGT
GATAGCTGCAATGTGTTCGAAAGTTCCCATCATTAGCAATGCAATCATGAAGTCTACTGGTAGTGGTATGAACTGACATCTACTTGTTTCATTGTATGATTTTGTCTTCC
CTCCATTTTTTCAAATATTCAGTTTTTCTATGTTGGAATCCATGTATGTTTGAAATATTATTTTGGTGAGCGAATTAGGGCTTGAGAGTGATCTCAAGAGGGAATGGTAA
GTACCTCAAAACACCTGGTTTGTCTTGTAAGCTTTCACCCTTTATATTTCAATATAGTATGGTTTTGTTTTAGTTCCTATTAGAACTACACCCATCATTTTCAATCTTGT
CCCCAATTTATTGTTAATTATGTTTTAAAATGTGCAAGGTTTTGTGTTTAACAGGAAAAAACGATGTCAAGAAAAATGGTTCTTTTAACGTAGTTGCATGACTACTATGT
GAAAGGGTGGTTTTAGCCAATCATTTGAACACTTATTAAATGTATAGGAATGAGGTATTGCATTTAGATTTAAAATCGCGTAAAAAATACCTACCAAATTGCAAGGAATT
TTCCTTGCCTAAGGATAATATTCCCCTTTATTTACTTTAAATTGAAATATATATAACTTTTTTATATATAACCTATAAAAAAGTTATATATTCCAAGGAATAATTTGAGA
GACCCCTTCCTCCTTCCTGACTTTCAATCTATCAACTACTATAGGACTACCTTCTCAACACCAAAGAACCTCAAAATAATTGTTGGAGTCACCTCTTGTTCAAGAATACC
TAGAAAAACTCTAACCACTTAACAAATGTACCCCTCCTCCACTCCCTCTGATGATGACTTCCATAACGTTGATGGTTGACCTATCAGATCTTGAACAAACTGCCTTGATA
CAAAGTGCTTCATGCAATTCTTTAGGCTGAGTCACTTTTGACATTGACTTTGACGATGAAAAGGAGATCCCAAGTCCATAAAAAGGAGATTTCCACAAAAGAAAAGGAGT
CCAATTGAGTAAAATAAGACCGAAAGAATTTGACATTGTCACCCAAAGACAAACATAGAATTTAATAAGGGACTACACATCCCTACTATCCCCCTCCACCTCTAAAGATC
CTATCATTTTTTTCTCCCCACGACCCCCACAACAAAGCAGAAATGCCGGCAAACCATAAAAATCTTCTCTTTTGAATCGGCGGATGGACAGGGAACTCCTCAATTATTTG
GCTATAGTCTCCATGATGTGCAAACTGAAAGCTAACCCCCTAAAAAAAATAACTCCAAACAGCCTGAGCGGCGCTACAGTTCCAAAGAATATGATCCAAGCGGCCCTCCA
ATAAAGAATGCAACAAAATAACTCAACCAAAGAAGGCATATTTCTCGGAAGCTGATCAAGAGTATTTTCTTTGTTCTCTATTCTATTAAAAACCCTTATCTCCATGTAAA
GTTGATTACATGTACTGTTATACAAGTGCCACAACGTTTGGAAGCCATCTGATGAAAAAATAATATAGCTCTCTAGATTTCCTATTAAATGCAAGTTGCCTCCTCACTCA
ACCTTCTATTTCTAGCGCAGGACCCTTCATTTAGAATGGATATCCTTGTCTAATTCTTCTTTTATGTTGCATTGTGATGCCGGTGATGTTTTTATAGAGTTCTGAAACTG
ACATCATACCATGAATCTGAAGCAGGAACATGTAATTAGAATCAATATCGATGTCTAGTTTTATCCATTTACATTTTTCAATCTGTGGTGGATGTTTTTTGCTTCAAGTA
TGTCTTTGTAGTCTTTCACTTTTTCTCAATAGTTCAGTTTCTTGCGCAAAAAACCAAAAAATGAAGTAGTAGTTTAAATGACATGAAGAGAAACTTACCTCATGCAGATT
CATTCAACTTATTAGTTAACAATGGTGTGGCTGCTGGTCAAGTTATATTCCATACTCACATTCACATAATTCCACGCAAGGCGAGGGATTGCTTATGGGCATCTGAGAGT
TTGGAGAGAAGAACGCTAAAATTTGATGAGGAGGCATCAAGGCTTGCAAAAAGTATACAAGAAATTTTACACAGCACCAAAGAGAATGATGGCAAGGTTCAAGAATCAAA
TCTCACTGAAAATTAGTAATGTAATTAGCCTGCTGTTGCAATTTTAGCTTGTATTATTTTGTACCATATTCATGTATGTGAATTACAGTTATTCTTTTTAAGCGCCATAC
AGTTGGATACTTAACAAACAATAAGGGAGGAAAAAGAAAGGTTTTCTTGTAGATCTGATTGTAAAGTTTACTTGATGTTTGGTGTGGTGCAAAGTTTATGAGAAATCCCA
TTGGTTGGGGGAAATTTTATTATTTTCACTTCCTAGATTGATGATGTA
Protein sequenceShow/hide protein sequence
MGHALGLNVEDSGHLRSKLGVYTYGDSQFDCNNFCFNWSMEARRLAILCSHLCPINLGRSPAPLLNLSSSSCASESKSEHLIYDSQKGGLQDDCVFCRIIRGEAPAFKLY
EDESCLCILDTRPLSNGHSLIIPKSHYSSLEATPPSVIAAMCSKVPIISNAIMKSTGSGMN