; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi01G004770 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi01G004770
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationchr01:3966149..3968437
RNA-Seq ExpressionLsi01G004770
SyntenyLsi01G004770
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]4.3e-8482.55Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLRP  PILALS S+S DPT S LPL RPRN +HNWALLQS LKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  D-GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
          GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL           GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAE E ISNK+    VS
Subjt:  D-GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

XP_008440068.1 PREDICTED: uncharacterized protein LOC103484656 isoform X1 [Cucumis melo]2.6e-8180.66Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+E EQARKALESALGGKKNEFEKWNNEIKKR E+GGG 
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG

Query:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
        G GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV            GELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+
Subjt:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

XP_008440069.1 PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo]1.1e-8281.04Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+EEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV            GELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+A
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKWGSD
Subjt:  RERVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]2.0e-8179.62Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL P T  LALSTSIS DPT S LPLLRPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL           G +LLAV+ NPLLYALRGTRNGLT VTSKILRK+ +SN AEF EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]1.0e-8584.83Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNLRP TPILALSTSISGD T SAL LLRPRNA HNWALLQSNLKCNGRFSCLF DNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM GGGG
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSG WFGWSDDQFWPEAQQTSLAVL           GELLLAVVFNPLLYALRGTRNGLTFVTSKILR +SASNYAE E+ISNKE    VSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        +ERVA+KWGSD
Subjt:  RERVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X11.3e-8180.66Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+E EQARKALESALGGKKNEFEKWNNEIKKR E+GGG 
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKE-EQARKALESALGGKKNEFEKWNNEIKKRGEMGGGG

Query:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS
        G GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV            GELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+
Subjt:  GDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVS

Query:  ARERVARKWGSD
        A++RVARKWGSD
Subjt:  ARERVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X25.2e-8381.04Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQ LNLR   PILALS S+S DPT SALPLLRPRN  HNWALL SNLKCNGRFSCLFS+NR+EEQARKALESALGGKKNEFEKWNNEIKKR E+GGG G
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAV            GELLLAV+FNPLLYALRGTRNGLTFVTSK LRKSSASNYAE EEISNK+    V+A
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKWGSD
Subjt:  RERVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053071.6e-7976.3Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        ML++LNL P  P     TSI  + T S +P +RPRN+ HNWA LQ+ LKCN RFSCLFSDNR+EEQARKALESALG KKNEFEKWNNEIKKR EM GGGG
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GG+GGWFGSGGWFGWSDD FWPEAQQTSLAVL           GELLLAV+FNPLLYALRGTRNGLTF+TSKILRKSSA NYAEF+EISN+EVSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        +E+VARKWGSD
Subjt:  RERVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536313.7e-8178.2Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL P +  LALSTS+S DPT S LPLLRPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL           G +L+AV+ NPLLYALRGTRNGLT VTSKILRK+ +SN AEF+EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788502.2e-7877.25Show/hide
Query:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG
        MLQVLNL P T  LALSTSIS D T S LPL RPRNA H WALLQS LKCN RFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKR EM G GG
Subjt:  MLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG

Query:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA
         GGRGGWFGSGGWFGWSDDQFW EAQQTSLAVL           G +LLAV+ NPLLYALRGTRNGLT VTSK LRK+ ++N AEF+EISN++VSGHVSA
Subjt:  DGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVL-----------GELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISNKEVSGHVSA

Query:  RERVARKWGSD
        ++RVARKW +D
Subjt:  RERVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20130.1 unknown protein3.8e-3854.55Show/hide
Query:  SSALPLLRPR--------NAIHNWALLQSNLKCNGRFSCLFS-DNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG--DGGRGGWFGSGGWF
        SS+ PLL  R        N++  ++L  S  K  GRFSCLFS  N++EEQARK+LESALGGKKNEFEKW+ EIKKR E GGG G   GG GGWFG GGWF
Subjt:  SSALPLLRPR--------NAIHNWALLQSNLKCNGRFSCLFS-DNRKEEQARKALESALGGKKNEFEKWNNEIKKRGEMGGGGG--DGGRGGWFGSGGWF

Query:  GWSDDQFWPEAQQ---TSLAVL--------GELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKSSASNYAEFEEISNKEVSGHVSARERVARKWGSD
          S D FW EAQQ   T LA+L        GE++ A V NPLLYALRGTR GL+ ++SK++ R++S  +    EE+  KE S   +A+E V RKWGSD
Subjt:  GWSDDQFWPEAQQ---TSLAVL--------GELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKSSASNYAEFEEISNKEVSGHVSARERVARKWGSD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAAACATAACCAAAACCCTGAGGGCCAAGGCTAGCGCTGGTGTTTCTTCTCAGACTCTGCCTATCGCATTTGAAAGCGCAGAGGCCGTGGTTTATTGGCCGGGAGA
TGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCCCTAACTCCGATTCTGGCCTTATCGACCTCGATTTCCGGTGACCCAACTTCCTCCGCTCTCCCATTACTTCGCC
CTCGTAATGCAATACACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATCGAAAAGAGGAACAGGCAAGGAAGGCA
TTAGAAAGTGCACTAGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGGGGAGATGGGTGGTGGTGGTGGTGATGGTGGACGAGGAGGTTG
GTTCGGATCGGGCGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTGAACTGTTGCTTGCTGTTGTTTTCAACC
CACTGCTTTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGTCCTCTGCTAGTAATTATGCTGAGTTTGAGGAGATTTCAAAC
AAAGAAGTCTCTGGCCATGTCTCTGCCAGAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
TTTTTTTTTATCAATTTATCTTTTATGTCAATGTACAAAACATAAAAATATTTTATCGATGAAAATATGGACGTAGAAATTTAAGCTCAGTTTGGTTATATTTCATTAGT
TTATTGCCATAAAGTTATCCCTTTGAATGCAAAACATAACCAAAACCCTGAGGGCCAAGGCTAGCGCTGGTGTTTCTTCTCAGACTCTGCCTATCGCATTTGAAAGCGCA
GAGGCCGTGGTTTATTGGCCGGGAGATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGACCCCTAACTCCGATTCTGGCCTTATCGACCTCGATTTCCGGTGACCCAAC
TTCCTCCGCTCTCCCATTACTTCGCCCTCGTAATGCAATACACAATTGGGCGCTTTTACAGTCCAACCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATC
GAAAAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTAGGGGGAAAGAAAAATGAATTTGAGAAATGGAATAATGAAATAAAGAAAAGAGGGGAGATGGGTGGTGGT
GGTGGTGATGGTGGACGAGGAGGTTGGTTCGGATCGGGCGGATGGTTTGGTTGGTCTGATGACCAATTCTGGCCAGAAGCACAACAGACTAGTCTTGCTGTTTTAGGTGA
ACTGTTGCTTGCTGTTGTTTTCAACCCACTGCTTTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGTCCTCTGCTAGTAATT
ATGCTGAGTTTGAGGAGATTTCAAACAAAGAAGTCTCTGGCCATGTCTCTGCCAGAGAGAGAGTTGCAAGGAAATGGGGGAGCGATTGA
Protein sequenceShow/hide protein sequence
MQNITKTLRAKASAGVSSQTLPIAFESAEAVVYWPGDAKTMLQVLNLRPLTPILALSTSISGDPTSSALPLLRPRNAIHNWALLQSNLKCNGRFSCLFSDNRKEEQARKA
LESALGGKKNEFEKWNNEIKKRGEMGGGGGDGGRGGWFGSGGWFGWSDDQFWPEAQQTSLAVLGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKSSASNYAEFEEISN
KEVSGHVSARERVARKWGSD