; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014887 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014887
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationtig00001400:52361..57355
RNA-Seq ExpressionSgr014887
SyntenySgr014887
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]3.1e-8782.08Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGG--G
        MLQ LNL  P PIL LS S++D+ T S LPL RPRN + NWA LQSKLKCNGRFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKREEVGG  G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGG--G

Query:  DGGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVS
         GGGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYL+VAKGELLLAV+FNPLLYALRGTRNGLTFVTSKI RK+SA NYAE++ +SN++    VS
Subjt:  DGGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVS

Query:  AKERVARKWGSD
        AK+RVARKWGSD
Subjt:  AKERVARKWGSD

XP_008440069.1 PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo]3.5e-8680.95Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQ LNL +  PIL LS S++D+ T S+LPL+RPRN   NWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGG G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAV GIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSK  RK+SA NYAE++ +SN++    V+AK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKWGSD
Subjt:  ERVARKWGSD

XP_022132456.1 uncharacterized protein LOC111005307 [Momordica charantia]4.2e-9285.24Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        ML++LNL+ P P     TSI DESTRS +P IRPRN+  NWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GG+GGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTF+TSKI RK+SAGNYAE D +SNREVSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        E+VARKWGSD
Subjt:  ERVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]7.7e-8681.43Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNLS P   L LSTSI+D+ TRS LPL+RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKG +LLAVI NPLLYALRGTRNGLT VTSKI RK  + N AE   +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]4.1e-8782.86Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNL  P PIL LSTSI+ + T S+L L+RPRNA  NWA LQS LKCNGRFSCLF DNRREEQARKALESALGGKKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSG WFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSKI R TSA NYAE++ +SN+E    VSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        ERVA+KWGSD
Subjt:  ERVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X14.1e-8580.57Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEVGGGD
        MLQ LNL +  PIL LS S++D+ T S+LPL+RPRN   NWA L S LKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREEVGGG 
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEVGGGD

Query:  GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSA
        GGGRGGWFGSGGWFGWSDD FWPEAQQTSLAV GIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSK  RK+SA NYAE++ +SN++    V+A
Subjt:  GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSA

Query:  KERVARKWGSD
        K+RVARKWGSD
Subjt:  KERVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X21.7e-8680.95Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQ LNL +  PIL LS S++D+ T S+LPL+RPRN   NWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGG G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAV GIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSK  RK+SA NYAE++ +SN++    V+AK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKWGSD
Subjt:  ERVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053072.0e-9285.24Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        ML++LNL+ P P     TSI DESTRS +P IRPRN+  NWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE+GGG  
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GG+GGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTF+TSKI RK+SAGNYAE D +SNREVSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        E+VARKWGSD
Subjt:  ERVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536311.4e-8580.48Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNLS     L LSTS++D+ TRS LPL+RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FWPEAQQTSLAVLGIIVMYLIVAKG +L+AVI NPLLYALRGTRNGLT VTSKI RK  + N AE D +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788501.6e-8480.48Show/hide
Query:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG
        MLQVLNL  P   L LSTSI+D+ TRS LPL RPRNA   WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE+ G  G
Subjt:  MLQVLNLSSPRPILGLSTSITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDG

Query:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK
        GGRGGWFGSGGWFGWSDD FW EAQQTSLAVLGIIVMYLIVAKG +LLAVI NPLLYALRGTRNGLT VTSK  RK  + N AE D +SN +VSGHVSAK
Subjt:  GGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAK

Query:  ERVARKWGSD
        +RVARKW +D
Subjt:  ERVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20130.1 unknown protein3.5e-4460.48Show/hide
Query:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGD---GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKG
        K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE GGG+   GGG GGWFG GGWF  S DHFW EAQQ +  +L I+ +Y++VAKG
Subjt:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGD---GGGRGGWFGSGGWFGWSDDHFWPEAQQTSLAVLGIIVMYLIVAKG

Query:  ELLLAVIFNPLLYALRGTRNGLTFVTSKI-SRKTSAGNYAEIDGLSNREVSGHVSAKERVARKWGSD
        E++ A + NPLLYALRGTR GL+ ++SK+  R+ S  +    + +  +E S   +AKE V RKWGSD
Subjt:  ELLLAVIFNPLLYALRGTRNGLTFVTSKI-SRKTSAGNYAEIDGLSNREVSGHVSAKERVARKWGSD

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein2.2e-0628.57Show/hide
Query:  SSAPQPEKRDGNEKEILETVHKVYKEIKNKDITQLSESEVMADEYPDVCDYFPFFRILRTKSEASEFFADLIKILVNSLVFIVE----------------
        S   +P  R  N+    + V K Y  I  K+  QLS S + +D +    D F F +  R K EA EFF +L+K +  ++ F VE                
Subjt:  SSAPQPEKRDGNEKEILETVHKVYKEIKNKDITQLSESEVMADEYPDVCDYFPFFRILRTKSEASEFFADLIKILVNSLVFIVE----------------

Query:  ---GNLIPLGKGCSIQHNYGFGGKLSIRNVEMIKDALLEV--MPMRLVSEVYTIHSEFELGAETEVEE
           G  IP  +GCS       GG+L IRN  ++ ++ ++   + + L+  +  +  EF  GAE  +E+
Subjt:  ---GNLIPLGKGCSIQHNYGFGGKLSIRNVEMIKDALLEV--MPMRLVSEVYTIHSEFELGAETEVEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCATGGCATGGTAGGGCTCGAGTCTGCGATTGATGGCTGCGGTGGCCAAAGTTGGCAGCGTGGAGGGTCCACCGTCCACTTCCGGCCACCCTCCGCCGCCTCTGACGG
TGTGATTTTTCGTAAACCTTCCATGGCTTTTCGTCCGCACCAGTTCATCCATTTCCCCAACTCACTCTCCCCACCAATTCCAAACCGGCCGCCGTCGTCAACGTGCTCTT
CCTCCTCACAACTCAAGAACTACAACTCCCCCTCTCTTTACACCAGACATTACCCCCCGCCGTCGCCGCCGGCCAAGCGTCGAAGCTCTCTGTGCGTTAAATGCAATGGT
GACGCCGAAAGCTCGGCGCCGCAACCGGAAAAAAGAGATGGCAACGAGAAGGAGATTCTCGAAACAGTCCACAAAGTTTACAAGGAAATAAAGAACAAAGACATCACCCA
GTTATCTGAGTCTGAGGTAATGGCAGATGAATATCCCGATGTCTGCGATTACTTCCCTTTTTTCCGAATCCTCCGAACCAAATCGGAAGCATCGGAATTCTTCGCCGATC
TCATCAAAATCTTAGTAAACAGTCTAGTATTCATAGTGGAGGGAAACCTGATACCGTTGGGAAAGGGATGCAGCATCCAACATAACTATGGCTTTGGAGGAAAGTTGTCT
ATTCGAAACGTGGAGATGATTAAGGATGCTCTTCTTGAAGTCATGCCCATGAGACTGGTGAGTGAAGTTTATACAATACACAGTGAATTTGAATTGGGTGCCGAAACAGA
AGTCGAGGAGCGCTGCATCCCTGTGTCTGGGTTTCTTCCTCCTCGTTCTCTCGCTCTTCTTTTTCAATGCCAAAAATACCCAAAACCCGGCGGCCAGCATCCATCCCTCA
GGCCATCGCAGGGCACAGAGGCCGTGCTTTATCGGCCGGGAGATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGCTCGCCCAGGCCAATTCTCGGCCTATCGACCTCC
ATTACTGATGAGTCAACCCGCTCCAGCCTCCCTTTGATTCGCCCTCGAAATGCAATACAAAATTGGGCGCGTTTACAGTCCAAGCTCAAGTGCAACGGCAGATTCTCTTG
CCTTTTCTCGGACAATCGAAGAGAGGAACAGGCTAGGAAGGCATTAGAAAGTGCACTCGGGGGAAAGAAAAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAG
AGGAGGTGGGTGGAGGAGACGGTGGTGGACGCGGAGGTTGGTTTGGATCAGGTGGATGGTTTGGTTGGTCTGATGACCATTTCTGGCCAGAAGCGCAACAGACTAGTCTT
GCTGTTTTAGGTATAATTGTCATGTACCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTATTTTCAACCCACTGCTGTATGCTTTGCGAGGAACAAGAAATGGATT
GACTTTTGTTACTTCTAAAATTTCGAGAAAGACCTCCGCTGGTAATTATGCTGAGATCGATGGGCTTTCAAACAGAGAAGTCTCGGGCCATGTCTCTGCTAAAGAGAGAG
TTGCAAGAAAATGGGGTAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCATGGCATGGTAGGGCTCGAGTCTGCGATTGATGGCTGCGGTGGCCAAAGTTGGCAGCGTGGAGGGTCCACCGTCCACTTCCGGCCACCCTCCGCCGCCTCTGACGG
TGTGATTTTTCGTAAACCTTCCATGGCTTTTCGTCCGCACCAGTTCATCCATTTCCCCAACTCACTCTCCCCACCAATTCCAAACCGGCCGCCGTCGTCAACGTGCTCTT
CCTCCTCACAACTCAAGAACTACAACTCCCCCTCTCTTTACACCAGACATTACCCCCCGCCGTCGCCGCCGGCCAAGCGTCGAAGCTCTCTGTGCGTTAAATGCAATGGT
GACGCCGAAAGCTCGGCGCCGCAACCGGAAAAAAGAGATGGCAACGAGAAGGAGATTCTCGAAACAGTCCACAAAGTTTACAAGGAAATAAAGAACAAAGACATCACCCA
GTTATCTGAGTCTGAGGTAATGGCAGATGAATATCCCGATGTCTGCGATTACTTCCCTTTTTTCCGAATCCTCCGAACCAAATCGGAAGCATCGGAATTCTTCGCCGATC
TCATCAAAATCTTAGTAAACAGTCTAGTATTCATAGTGGAGGGAAACCTGATACCGTTGGGAAAGGGATGCAGCATCCAACATAACTATGGCTTTGGAGGAAAGTTGTCT
ATTCGAAACGTGGAGATGATTAAGGATGCTCTTCTTGAAGTCATGCCCATGAGACTGGTGAGTGAAGTTTATACAATACACAGTGAATTTGAATTGGGTGCCGAAACAGA
AGTCGAGGAGCGCTGCATCCCTGTGTCTGGGTTTCTTCCTCCTCGTTCTCTCGCTCTTCTTTTTCAATGCCAAAAATACCCAAAACCCGGCGGCCAGCATCCATCCCTCA
GGCCATCGCAGGGCACAGAGGCCGTGCTTTATCGGCCGGGAGATGCGAAAACAATGCTTCAGGTTCTCAATCTAAGCTCGCCCAGGCCAATTCTCGGCCTATCGACCTCC
ATTACTGATGAGTCAACCCGCTCCAGCCTCCCTTTGATTCGCCCTCGAAATGCAATACAAAATTGGGCGCGTTTACAGTCCAAGCTCAAGTGCAACGGCAGATTCTCTTG
CCTTTTCTCGGACAATCGAAGAGAGGAACAGGCTAGGAAGGCATTAGAAAGTGCACTCGGGGGAAAGAAAAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAG
AGGAGGTGGGTGGAGGAGACGGTGGTGGACGCGGAGGTTGGTTTGGATCAGGTGGATGGTTTGGTTGGTCTGATGACCATTTCTGGCCAGAAGCGCAACAGACTAGTCTT
GCTGTTTTAGGTATAATTGTCATGTACCTCATAGTTGCGAAAGGTGAACTGTTGCTTGCTGTTATTTTCAACCCACTGCTGTATGCTTTGCGAGGAACAAGAAATGGATT
GACTTTTGTTACTTCTAAAATTTCGAGAAAGACCTCCGCTGGTAATTATGCTGAGATCGATGGGCTTTCAAACAGAGAAGTCTCGGGCCATGTCTCTGCTAAAGAGAGAG
TTGCAAGAAAATGGGGTAGCGATTGA
Protein sequenceShow/hide protein sequence
MHGMVGLESAIDGCGGQSWQRGGSTVHFRPPSAASDGVIFRKPSMAFRPHQFIHFPNSLSPPIPNRPPSSTCSSSSQLKNYNSPSLYTRHYPPPSPPAKRRSSLCVKCNG
DAESSAPQPEKRDGNEKEILETVHKVYKEIKNKDITQLSESEVMADEYPDVCDYFPFFRILRTKSEASEFFADLIKILVNSLVFIVEGNLIPLGKGCSIQHNYGFGGKLS
IRNVEMIKDALLEVMPMRLVSEVYTIHSEFELGAETEVEERCIPVSGFLPPRSLALLFQCQKYPKPGGQHPSLRPSQGTEAVLYRPGDAKTMLQVLNLSSPRPILGLSTS
ITDESTRSSLPLIRPRNAIQNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEVGGGDGGGRGGWFGSGGWFGWSDDHFWPEAQQTSL
AVLGIIVMYLIVAKGELLLAVIFNPLLYALRGTRNGLTFVTSKISRKTSAGNYAEIDGLSNREVSGHVSAKERVARKWGSD