; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013649 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013649
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationchr1:51793527..51798245
RNA-Seq ExpressionLag0013649
SyntenyLag0013649
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]3.6e-8783.49Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG--
        ML  LNL P  PILALS S SDDP  S LPL RPRN  HNWA LQSKLKCNGRFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKREE GGG  
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG--

Query:  -GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVS
         GGGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYL+VAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRK+SA NY E E ISNK+    VS
Subjt:  -GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVS

Query:  AREKVARKWGSD
        A+++VARKWGSD
Subjt:  AREKVARKWGSD

XP_008440069.1 PREDICTED: uncharacterized protein LOC103484656 isoform X2 [Cucumis melo]4.7e-8783.81Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G
        ML  LNL  S PILALS S SDDP  SALPLLRPRN THNWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREE GGG G
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+SA NY E EEISNK+    V+A+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        ++VARKWGSD
Subjt:  EKVARKWGSD

XP_022132456.1 uncharacterized protein LOC111005307 [Momordica charantia]1.9e-8882.86Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-
        ML ILNL P  P     TS  D+  RS +P +RPRN  HNWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE GGGG 
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GG+GGWFGSGGWFGWSD+HFWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTF+TSKILRK+SAGNY EF+EISN+E SG VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        EKVARKWGSD
Subjt:  EKVARKWGSD

XP_022950570.1 uncharacterized protein LOC111453631 [Cucurbita moschata]3.0e-8681.43Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        ML +LNL+PS+  LALSTS SDDP RS LPLLRPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKILRK  + N  EF+EISN++ SG VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

XP_038881277.1 uncharacterized protein LOC120072833 [Benincasa hispida]3.6e-8785.24Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-
        ML +LNL P TPILALSTS S D   SAL LLRPRN THNWA LQS LKCNGRFSCLF DNRREEQARKALESALGGKKNEFEKWNNEIKKREE GGGG 
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSG WFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILR TSA NY E E+ISNKE    VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        E+VA+KWGSD
Subjt:  EKVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X15.6e-8683.41Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEAGGG-
        ML  LNL  S PILALS S SDDP  SALPLLRPRN THNWA L S LKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREE GGG 
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEAGGG-

Query:  GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSA
        GGGRGGWFGSGGWFGWSD+ FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+SA NY E EEISNK+    V+A
Subjt:  GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSA

Query:  REKVARKWGSD
        +++VARKWGSD
Subjt:  REKVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X22.3e-8783.81Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G
        ML  LNL  S PILALS S SDDP  SALPLLRPRN THNWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREE GGG G
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK LRK+SA NY E EEISNK+    V+A+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        ++VARKWGSD
Subjt:  EKVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053079.2e-8982.86Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-
        ML ILNL P  P     TS  D+  RS +P +RPRN  HNWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE GGGG 
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GG+GGWFGSGGWFGWSD+HFWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTF+TSKILRK+SAGNY EF+EISN+E SG VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        EKVARKWGSD
Subjt:  EKVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536311.5e-8681.43Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        ML +LNL+PS+  LALSTS SDDP RS LPLLRPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKILRK  + N  EF+EISN++ SG VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788502.6e-8380Show/hide
Query:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        ML +LNL P T  LALSTS SDD  RS LPL RPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR
        GGRGGWFGSGGWFGWSD+ FW EAQQTSLAVLGIIVMYLIVAKG +LLAV+ NPLLYALRGTRNGLT VTSK LRK  + N  EF+EISN++ SG VSA+
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEASGQVSAR

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20130.1 unknown protein3.6e-4561.68Show/hide
Query:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG----GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKG
        K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE+GGG    GGG GGWFG GGWF  S +HFW EAQQ +  +L I+ +Y++VAKG
Subjt:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG----GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKG

Query:  ELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKTSAGNYTEFEEISNKEASGQVSAREKVARKWGSD
        E++ A V NPLLYALRGTR GL+ ++SK++ R+ S  +    EE+  KE S   +A+E V RKWGSD
Subjt:  ELLLAVVFNPLLYALRGTRNGLTFVTSKIL-RKTSAGNYTEFEEISNKEASGQVSAREKVARKWGSD

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein5.3e-0433.33Show/hide
Query:  VRKLYEAVKDKNLTKLSDVVADECPDVYNSIPLLRIFRAKLKVMEFFSHLTKTLGNNLQF
        V K Y ++ +KN  +LS  ++ +C    +     + FR K + MEFF  L K++G N++F
Subjt:  VRKLYEAVKDKNLTKLSDVVADECPDVYNSIPLLRIFRAKLKVMEFFSHLTKTLGNNLQF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCTTTTCTTCCTCAACAAAGTTCCTCAATTTCCCTACCTCAACCTCAATCTCCCCAAACTCTAACTCTAACAACTACAACTTCACCCCTCTTTACACCACAAGTCG
TTACTCGCTGTCATCGACATCTGAGCGTCGGGGCTATTTGTGCGTAAAATGCAATGGTGGCCCTAAAATGCCACAACAGAACAAAGACAACGACGACAACGATAACGATG
AGACTCTCAAAAGAGTCCGCAAACTTTACGAGGCGGTAAAGGACAAAAACCTCACCAAATTGTCAGATGTAGTAGCTGATGAATGTCCGGATGTCTATAATTCCATCCCT
TTGTTACGAATCTTTCGAGCCAAATTGAAAGTGATGGAATTCTTCTCTCATCTCACCAAAACCTTGGGAAACAACCTACAATTCACAGCAAAGCCAATGACAAAACATGG
ATCTATGAAACTTGGAGAGGATAATGGATCCTTTTGTTCAGCCCAAGCCCATGAGACTGCAATTGCTAATGTTGTGGAGCAGATGGATTTGAATTTGGAGCCAAAACAGA
GGAGATTAGCATCGTTGTATCTGGGTGTCTTCCTCTTTGTTCTCACGCTCTTCTTTTTCCAGTTTTCTTTATCCTGGAGCACAAAGAATAGCCAAGAGGCAAAAACTCAG
GGGCCGCGGGCGGCGCTGGTTCATCAAATTAAGAAACAGAGCGCAGAGGCCGTGGTTTATCGGCCGGGAGATGCGAAAACAATGCTTCACATTCTCAATCTAAACCCTTC
AACTCCGATTCTCGCCCTATCGACCTCGAGTTCTGATGACCCAATTCGCTCCGCCCTCCCTTTACTTCGCCCTCGAAATGATACACATAATTGGGCACGTTTACAGTCCA
AGCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATCGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAGAATGAATTTGAG
AAATGGAACAATGAAATAAAGAAAAGAGAGGAGGCAGGCGGTGGTGGCGGTGGACGAGGAGGTTGGTTCGGATCTGGTGGATGGTTCGGTTGGTCCGATGAACATTTCTG
GCCAGAAGCACAGCAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCAAAAGGTGAACTGTTGCTCGCTGTTGTTTTCAACCCACTGCTGTATG
CTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACCTCTGCTGGTAATTATACTGAGTTTGAGGAGATTTCAAACAAAGAAGCCTCA
GGCCAGGTCTCTGCTAGAGAGAAAGTTGCAAGGAAATGGGGAAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCTTTTCTTCCTCAACAAAGTTCCTCAATTTCCCTACCTCAACCTCAATCTCCCCAAACTCTAACTCTAACAACTACAACTTCACCCCTCTTTACACCACAAGTCG
TTACTCGCTGTCATCGACATCTGAGCGTCGGGGCTATTTGTGCGTAAAATGCAATGGTGGCCCTAAAATGCCACAACAGAACAAAGACAACGACGACAACGATAACGATG
AGACTCTCAAAAGAGTCCGCAAACTTTACGAGGCGGTAAAGGACAAAAACCTCACCAAATTGTCAGATGTAGTAGCTGATGAATGTCCGGATGTCTATAATTCCATCCCT
TTGTTACGAATCTTTCGAGCCAAATTGAAAGTGATGGAATTCTTCTCTCATCTCACCAAAACCTTGGGAAACAACCTACAATTCACAGCAAAGCCAATGACAAAACATGG
ATCTATGAAACTTGGAGAGGATAATGGATCCTTTTGTTCAGCCCAAGCCCATGAGACTGCAATTGCTAATGTTGTGGAGCAGATGGATTTGAATTTGGAGCCAAAACAGA
GGAGATTAGCATCGTTGTATCTGGGTGTCTTCCTCTTTGTTCTCACGCTCTTCTTTTTCCAGTTTTCTTTATCCTGGAGCACAAAGAATAGCCAAGAGGCAAAAACTCAG
GGGCCGCGGGCGGCGCTGGTTCATCAAATTAAGAAACAGAGCGCAGAGGCCGTGGTTTATCGGCCGGGAGATGCGAAAACAATGCTTCACATTCTCAATCTAAACCCTTC
AACTCCGATTCTCGCCCTATCGACCTCGAGTTCTGATGACCCAATTCGCTCCGCCCTCCCTTTACTTCGCCCTCGAAATGATACACATAATTGGGCACGTTTACAGTCCA
AGCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAATCGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAGAAGAATGAATTTGAG
AAATGGAACAATGAAATAAAGAAAAGAGAGGAGGCAGGCGGTGGTGGCGGTGGACGAGGAGGTTGGTTCGGATCTGGTGGATGGTTCGGTTGGTCCGATGAACATTTCTG
GCCAGAAGCACAGCAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCAAAAGGTGAACTGTTGCTCGCTGTTGTTTTCAACCCACTGCTGTATG
CTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTTTGAGAAAGACCTCTGCTGGTAATTATACTGAGTTTGAGGAGATTTCAAACAAAGAAGCCTCA
GGCCAGGTCTCTGCTAGAGAGAAAGTTGCAAGGAAATGGGGAAGCGATTGA
Protein sequenceShow/hide protein sequence
MAFSSSTKFLNFPTSTSISPNSNSNNYNFTPLYTTSRYSLSSTSERRGYLCVKCNGGPKMPQQNKDNDDNDNDETLKRVRKLYEAVKDKNLTKLSDVVADECPDVYNSIP
LLRIFRAKLKVMEFFSHLTKTLGNNLQFTAKPMTKHGSMKLGEDNGSFCSAQAHETAIANVVEQMDLNLEPKQRRLASLYLGVFLFVLTLFFFQFSLSWSTKNSQEAKTQ
GPRAALVHQIKKQSAEAVVYRPGDAKTMLHILNLNPSTPILALSTSSSDDPIRSALPLLRPRNDTHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFE
KWNNEIKKREEAGGGGGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKILRKTSAGNYTEFEEISNKEAS
GQVSAREKVARKWGSD