; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10016240 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10016240
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationChr03:3723099..3725251
RNA-Seq ExpressionHG10016240
SyntenyHG10016240
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]1.6e-9288.21Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLRPP PILALS S+S DPT S LPL RPRN +HNWALLQS LKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  D-GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
          GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAE E ISNK+    VS
Subjt:  D-GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

XP_008440068.1 PREDICTED: uncharacterized protein LOC103484656 isoform X1 [Cucumis melo]1.4e-8885.38Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+E EQARKALESALGGKKNEFEKWNNEIKKR E+GGG 
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG

Query:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
        G GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+
Subjt:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

XP_008440069.1 PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo]5.5e-9085.78Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+EEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+A
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKWGSD
Subjt:  RERVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]1.2e-8984.83Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL PPT  LALSTSIS DPT S LPLLRPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYL+VAKG +LLAV+ NPLLYALRGTRNGLT VTSKILRK+ +SN AEF EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]4.8e-9490.05Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNLRPPTPILALSTSISGD T SAL LLRPRNA HNWALLQSNLKCNGRFSCLF DNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM GGGG
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSG WFGWSDDQFWPEAQQTSLAVLGIIVMYL+VAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILR +SASNYAE E+ISNKE    VSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        +ERVA+KWGSD
Subjt:  RERVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X16.6e-8985.38Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+E EQARKALESALGGKKNEFEKWNNEIKKR E+GGG 
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG

Query:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
        G GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+
Subjt:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X22.7e-9085.78Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+EEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV GIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+A
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKWGSD
Subjt:  RERVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053079.5e-8881.52Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        ML++LNL PP P     TSI  + T S +P +RPRN+ HNWA LQ+ LKCN RFSCLFSDNR+EEQARKALESALG KKNEFEKWNNEIKKR EM GGGG
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GG+GGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTF+TSKILRKSSA NYAEF+EISN+EVSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        +E+VARKWGSD
Subjt:  RERVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536312.5e-8882.94Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL P +  LALSTS+S DPT S LPLLRPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYL+VAKG +L+AV+ NPLLYALRGTRNGLT VTSKILRK+ +SN AEF+EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788501.4e-8682.46Show/hide
Query:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL PPT  LALSTSIS D T S LPL RPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFW EAQQTSLAVLGIIVMYL+VAKG +LLAV+ NPLLYALRGTRNGLT VTSK LRK+ ++N AEF+EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20130.1 unknown protein9.2e-4355.56Show/hide
Query:  SSALPLLRPR--------NAIHNWALLQSNLKCNGRFSCLFS-DNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG--DGGRGGWFGSGGWF
        SS+ PLL  R        N++  ++L  S  K  GRFSCLFS  N++EEQARK+LESALGGKKNEFEKW+ EIKKR E GGG G   GG GGWFG GGWF
Subjt:  SSALPLLRPR--------NAIHNWALLQSNLKCNGRFSCLFS-DNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG--DGGRGGWFGSGGWF

Query:  GWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKSSASNYAEFEEISNKEVSGHVSARERVARKWGSD
          S D FW EAQQ +  +L I+ +Y++VAKGE++ A V NPLLYALRGTR GL+ ++SK++ R++S  +    EE+  KE S   +A+E V RKWGSD
Subjt:  GWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKSSASNYAEFEEISNKEVSGHVSARERVARKWGSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAACATAACCAAAACCCTGAGGGCCAAGGCTAGCGCTGGTGTTTCTTCTCAGACTCTGCCTATCGCATTTGAAAGCGCAGAGGCCGTGGTTTATTGGCCG
GGAGATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCCCCAACTCCGATTCTGGCCTTATCGACCTCGATTTCCGGTGACCCAACTTCCTCCGCTCTCCCA
TTACTTCGCCCTCGTAATGCAATACACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATCGAAAAGAGGAA
CAGGCAAGGAAGGCATTAGAAAGTGCACTAGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGGGGAGATGGGTGGTGGTGGTGGT
GATGGTGGACGAGGAGGTTGGTTCGGATCGGGCGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATA
ATTGTCATGTATCTCTTGGTTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTT
ACTTCAAAAATTTTGAGAAAGTCCTCTGCTAGTAATTATGCTGAGTTTGAGGAGATTTCAAACAAAGAAGTCTCTGGCCATGTCTCTGCCAGAGAGAGAGTTGCA
AGGAAATGGGGGAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAAACATAACCAAAACCCTGAGGGCCAAGGCTAGCGCTGGTGTTTCTTCTCAGACTCTGCCTATCGCATTTGAAAGCGCAGAGGCCGTGGTTTATTGGCCG
GGAGATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCCCCAACTCCGATTCTGGCCTTATCGACCTCGATTTCCGGTGACCCAACTTCCTCCGCTCTCCCA
TTACTTCGCCCTCGTAATGCAATACACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATCGAAAAGAGGAA
CAGGCAAGGAAGGCATTAGAAAGTGCACTAGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGGGGAGATGGGTGGTGGTGGTGGT
GATGGTGGACGAGGAGGTTGGTTCGGATCGGGCGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTATA
ATTGTCATGTATCTCTTGGTTGCGAAAGGTGAACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTT
ACTTCAAAAATTTTGAGAAAGTCCTCTGCTAGTAATTATGCTGAGTTTGAGGAGATTTCAAACAAAGAAGTCTCTGGCCATGTCTCTGCCAGAGAGAGAGTTGCA
AGGAAATGGGGGAGCGATTGA
Protein sequenceShow/hide protein sequence
MQNITKTLRAKASAGVSSQTLPIAFESAEAVVYWPGDAKTMLQVLNLRPPTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEE
QARKALESALGGKKNEFEKWNNEIKKRGEMGGGGGDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGIIVMYLLVAKGELLLAVVFNPLLYALRGTRNGLTFV
TSKILRKSSASNYAEFEEISNKEVSGHVSARERVARKWGSD