; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017103 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017103
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationtig00153031:282289..287331
RNA-Seq ExpressionSgr017103
SyntenySgr017103
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603816.1 hypothetical protein SDJN03_04425, partial [Cucurbita argyrosperma subsp. sororia]3.2e-8580.48Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNLS P  ++ LSTSI+D+ TRS LPL+RPRNA   WA LQSKLKCN RF CLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKG +L+AVI NPLLYALRGTRNGLT VTSKI RK  + N AE D +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]2.9e-8681.6Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGG--G
        MLQ LNL  P P L LS S++D+ T S LPL RPRN + NWA LQSKLKCNGRFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKREEVGG  G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGG--G

Query:  DGGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVS
         GGGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSKI RK SA NYAE++ +SN++    VS
Subjt:  DGGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVS

Query:  AKERVARKWGSD
        AK+RVARKWGSD
Subjt:  AKERVARKWGSD

XP_022132456.1 uncharacterized protein LOC111005307 [Momordica charantia]7.9e-9285.24Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        ML++LNL+ P P     TSI DESTRS +P IRPRN+  NWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GG+GGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTF+TSKI RK SAGNYAE D +SNREVSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        E+VARKWGSD
Subjt:  ERVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]8.5e-8681.43Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNLS P  +L LSTSI+D+ TRS LPL+RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKG +LLAVI NPLLYALRGTRNGLT VTSKI RK  + N AE   +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]8.5e-8681.9Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNL  P P L LSTSI+ + T S+L L+RPRNA  NWA LQS LKCNGRFSCLF DNRREEQARKALESALGGKKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSG WFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSKI R  SA NYAE++ +SN+E    VSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        ERVA+KWGSD
Subjt:  ERVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X13.8e-8480.09Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEVGGGD
        MLQ LNL +  P L LS S++D+ T S+LPL+RPRN   NWA L S LKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREEVGGG 
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEVGGGD

Query:  GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSA
        GGGRGGWFGSGGWFGWSDD FWPEAQQTSLAV GIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSK  RK SA NYAE++ +SN++    V+A
Subjt:  GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSA

Query:  KERVARKWGSD
        K+RVARKWGSD
Subjt:  KERVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X21.6e-8580.48Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQ LNL +  P L LS S++D+ T S+LPL+RPRN   NWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGG G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAV GIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSK  RK SA NYAE++ +SN++    V+AK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKWGSD
Subjt:  ERVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053073.8e-9285.24Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        ML++LNL+ P P     TSI DESTRS +P IRPRN+  NWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GG+GGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTF+TSKI RK SAGNYAE D +SNREVSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        E+VARKWGSD
Subjt:  ERVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536311.6e-8580.48Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNLS    +L LSTS++D+ TRS LPL+RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKG +L+AVI NPLLYALRGTRNGLT VTSKI RK  + N AE D +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788501.7e-8480.48Show/hide
Query:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNL  P  +L LSTSI+D+ TRS LPL RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FW EAQQTSLAVLGIIVMYLIVAKG +LLAVI NPLLYALRGTRNGLT VTSK  RK  + N AE D +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G22370.1 unknown protein1.2e-0839.74Show/hide
Query:  FFADLIKILVNSLVFIVRPTAEDGTTMGLKWRLGWRETLIPLGKGCSIQHNYGFGGKLSIRNVEMIKDALLEVMPMRL
        FF  LI  L   +  IV+PT +DG T+G++W+L   ++ I LGKG S    + + GKL I+NVEM  + +  +  +RL
Subjt:  FFADLIKILVNSLVFIVRPTAEDGTTMGLKWRLGWRETLIPLGKGCSIQHNYGFGGKLSIRNVEMIKDALLEVMPMRL

AT5G20130.1 unknown protein1.7e-4460.59Show/hide
Query:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGD---GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKG
        K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE GGG+   GGG GGWFG GGWF  S DHFW EAQQ +  +L I+ +Y++VAKG
Subjt:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGD---GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKG

Query:  ELLLAVIFNPLLYALRGTRNGLTFVTSKI----SRKISAGNYAEIDGLSNREVSGHVSAKERVARKWGSD
        E++ A + NPLLYALRGTR GL+ ++SK+    + K+S  N  E   +  +E S   +AKE V RKWGSD
Subjt:  ELLLAVIFNPLLYALRGTRNGLTFVTSKI----SRKISAGNYAEIDGLSNREVSGHVSAKERVARKWGSD

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein3.6e-1033.83Show/hide
Query:  SSAPQPEKRDGNENEILETVHKVYKEIKNKDITQLSESEVIADEYPDVCDYFPFFRILRTKSEASEFFADLIKILVNSLVFIVRPTAE-DGTTMGLKWRL
        S   +P  R  N+    + V K Y  I  K+  QLS S + +D +    D F F +  R K EA EFF +L+K +  ++ F V    E DG +  + W L
Subjt:  SSAPQPEKRDGNENEILETVHKVYKEIKNKDITQLSESEVIADEYPDVCDYFPFFRILRTKSEASEFFADLIKILVNSLVFIVRPTAE-DGTTMGLKWRL

Query:  GWRETLIPLGKGCSIQHNYGFGGKLSIRNVEMI
         W+   IP  +GCS       GG+L IRN  ++
Subjt:  GWRETLIPLGKGCSIQHNYGFGGKLSIRNVEMI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGCATGGTAGGGCTCGAGTCTGCGATTGATGGCTGCGGTGGCCAAAGTTGGCAGCGTGGAGGGTCCACCGTCCACTTCCGGCCACCCTCCGACGCCTCTGGCGG
TGTGATTTTTCGTAAAGGGGAAGGAAGCATCTCCCATGGCCATTGCCAAATGCCAACCCATTTGTCTCCATTCCAAAACCATCAATATAATCTTCTTCTGCCATTTCTCA
GCCTTCCAATGGCTTTTTCGTCCGCACCAAAGTTCATCCATTTCCCCAACTCACTCTCCCCACCAATTCCAAACCGGCCGCCGTCGTCAACGTGCTCTTTCCTCCTCACA
AACTCCAAGAACTACAACTCCCCCTCTCTTTACACCAGACATTACCCCCCGCCGTCGCCCCCGCCTCCGCCGGCCAAGCGTCGAAGCTCTCTGTGCGTTAAATGCAATGG
TGACGCCGAAAGCTCGGCGCCGCAACCGGAAAAAAGAGATGGCAACGAGAACGAGATTCTCGAAACAGTCCACAAAGTTTACAAGGAAATAAAGAACAAAGACATCACCC
AGTTATCTGAGTCTGAGGTAATAGCAGATGAATATCCCGATGTCTGCGATTACTTCCCTTTTTTCCGAATCCTCCGAACCAAATCGGAAGCGTCGGAATTCTTCGCCGAT
CTCATCAAAATCTTAGTAAACAGTCTAGTATTCATAGTGCGCCCAACAGCAGAAGATGGAACTACGATGGGTCTTAAATGGAGATTAGGGTGGAGGGAAACCCTGATACC
GTTGGGAAAGGGATGCAGCATCCAACATAACTATGGCTTTGGAGGAAAGTTGTCTATTCGAAACGTGGAGATGATTAAGGATGCTCTTCTTGAAGTCATGCCCATGAGAC
TGGTGAATTTGAATTGGGTGCCGAAACAGAAGTCGAGGAGCGCTGCATCCCTGTGTCTGGGTTTCTTCCTCCTCGTTCTCTCGCTCTTCTTTTTCAATTATCCCCCGAGT
GAGTGCCAAAAATACCCAAAACCCGGCGGCCAGCGTCCTTCCCTCAGGCCATCGCAGGGCACAGAGGCCGTGCTTTATCGGCCGGGAGATGCGAAAACCATGCTTCAGGT
TCTCAATCTAAGCTCGCCCAGGCCAACTCTCGGCCTATCGACCTCCATTACTGATGAGTCAACCCGCTCCAGCCTCCCTTTGATTCGCCCTCGAAATGCAATACAAAATT
GGGCGCGTTTACAGTCCAAGCTCAAGTGCAACGGCAGATTCTCCTGCCTTTTCTCGGACAATCGAAGAGAGGAACAGGCTAGGAAGGCATTAGAAAGTGCACTGGGGGGA
AAGAAAAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAGAGGAGGTGGGTGGAGGAGACGGTGGTGGACGCGGAGGTTGGTTCGGATCAGGTGGATGGTTTGG
TTGGTCTGATGACCATTTCTGGCCAGAAGCGCAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTACCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTA
TTTTCAACCCACTGCTGTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCTAAAATTTCGAGAAAGATCTCCGCTGGTAATTATGCTGAGATCGATGGG
CTTTCAAACAGAGAAGTCTCGGGCCATGTCTCTGCTAAAGAGAGAGTTGCAAGAAAATGGGGTAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGGCATGGTAGGGCTCGAGTCTGCGATTGATGGCTGCGGTGGCCAAAGTTGGCAGCGTGGAGGGTCCACCGTCCACTTCCGGCCACCCTCCGACGCCTCTGGCGG
TGTGATTTTTCGTAAAGGGGAAGGAAGCATCTCCCATGGCCATTGCCAAATGCCAACCCATTTGTCTCCATTCCAAAACCATCAATATAATCTTCTTCTGCCATTTCTCA
GCCTTCCAATGGCTTTTTCGTCCGCACCAAAGTTCATCCATTTCCCCAACTCACTCTCCCCACCAATTCCAAACCGGCCGCCGTCGTCAACGTGCTCTTTCCTCCTCACA
AACTCCAAGAACTACAACTCCCCCTCTCTTTACACCAGACATTACCCCCCGCCGTCGCCCCCGCCTCCGCCGGCCAAGCGTCGAAGCTCTCTGTGCGTTAAATGCAATGG
TGACGCCGAAAGCTCGGCGCCGCAACCGGAAAAAAGAGATGGCAACGAGAACGAGATTCTCGAAACAGTCCACAAAGTTTACAAGGAAATAAAGAACAAAGACATCACCC
AGTTATCTGAGTCTGAGGTAATAGCAGATGAATATCCCGATGTCTGCGATTACTTCCCTTTTTTCCGAATCCTCCGAACCAAATCGGAAGCGTCGGAATTCTTCGCCGAT
CTCATCAAAATCTTAGTAAACAGTCTAGTATTCATAGTGCGCCCAACAGCAGAAGATGGAACTACGATGGGTCTTAAATGGAGATTAGGGTGGAGGGAAACCCTGATACC
GTTGGGAAAGGGATGCAGCATCCAACATAACTATGGCTTTGGAGGAAAGTTGTCTATTCGAAACGTGGAGATGATTAAGGATGCTCTTCTTGAAGTCATGCCCATGAGAC
TGGTGAATTTGAATTGGGTGCCGAAACAGAAGTCGAGGAGCGCTGCATCCCTGTGTCTGGGTTTCTTCCTCCTCGTTCTCTCGCTCTTCTTTTTCAATTATCCCCCGAGT
GAGTGCCAAAAATACCCAAAACCCGGCGGCCAGCGTCCTTCCCTCAGGCCATCGCAGGGCACAGAGGCCGTGCTTTATCGGCCGGGAGATGCGAAAACCATGCTTCAGGT
TCTCAATCTAAGCTCGCCCAGGCCAACTCTCGGCCTATCGACCTCCATTACTGATGAGTCAACCCGCTCCAGCCTCCCTTTGATTCGCCCTCGAAATGCAATACAAAATT
GGGCGCGTTTACAGTCCAAGCTCAAGTGCAACGGCAGATTCTCCTGCCTTTTCTCGGACAATCGAAGAGAGGAACAGGCTAGGAAGGCATTAGAAAGTGCACTGGGGGGA
AAGAAAAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAGAGGAGGTGGGTGGAGGAGACGGTGGTGGACGCGGAGGTTGGTTCGGATCAGGTGGATGGTTTGG
TTGGTCTGATGACCATTTCTGGCCAGAAGCGCAACAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTACCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTA
TTTTCAACCCACTGCTGTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCTAAAATTTCGAGAAAGATCTCCGCTGGTAATTATGCTGAGATCGATGGG
CTTTCAAACAGAGAAGTCTCGGGCCATGTCTCTGCTAAAGAGAGAGTTGCAAGAAAATGGGGTAGCGATTGA
Protein sequenceShow/hide protein sequence
MHGMVGLESAIDGCGGQSWQRGGSTVHFRPPSDASGGVIFRKGEGSISHGHCQMPTHLSPFQNHQYNLLLPFLSLPMAFSSAPKFIHFPNSLSPPIPNRPPSSTCSFLLT
NSKNYNSPSLYTRHYPPPSPPPPPAKRRSSLCVKCNGDAESSAPQPEKRDGNENEILETVHKVYKEIKNKDITQLSESEVIADEYPDVCDYFPFFRILRTKSEASEFFAD
LIKILVNSLVFIVRPTAEDGTTMGLKWRLGWRETLIPLGKGCSIQHNYGFGGKLSIRNVEMIKDALLEVMPMRLVNLNWVPKQKSRSAASLCLGFFLLVLSLFFFNYPPS
ECQKYPKPGGQRPSLRPSQGTEAVLYRPGDAKTMLQVLNLSSPRPTLGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGG
KKNEFEKWNNEIKKREEVGGGDGGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKISAGNYAEIDG
LSNREVSGHVSAKERVARKWGSD