; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy04g019800 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy04g019800
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionSulfate adenylyltransferase subunit 2
Genome locationChr04:51451472..51459975
RNA-Seq ExpressionLcy04g019800
SyntenyLcy04g019800
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0016740 - transferase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6603816.1 hypothetical protein SDJN03_04425, partial [Cucurbita argyrosperma subsp. sororia]1.3e-8882.86Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        MLQ+LNL+P T  +ALSTSISDDP RSGLPLLRPRN TH WA LQSKLKCN RF CLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKI+RKN + N AEF+EISN++ SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

XP_004134798.1 uncharacterized protein LOC101207146 [Cucumis sativus]5.9e-8984.43Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG--
        MLQ LNL P  PILALS S+SDDP  S LPL RPRN  HNWA LQSKLKCNGRFSCLFSDNR+EEQARKALESALGGKKNEFEKWNNEIKKREE GGG  
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG--

Query:  -GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVS
         GGGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYL+VAKGELLLAVVFNPLLYALRGTRNGLTFVTSKI+RK+SA NYAE E ISNK+    VS
Subjt:  -GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVS

Query:  AKEKVARKWGSD
        AK++VARKWGSD
Subjt:  AKEKVARKWGSD

XP_022132456.1 uncharacterized protein LOC111005307 [Momordica charantia]1.4e-9084.29Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-
        ML+ILNL P  P     TSI D+  RSG+P +RPRN  HNWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE GGGG 
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GG+GGWFGSGGWFGWSD+HFWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTF+TSKI+RK+SAGNYAEF+EISN+E SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        EKVARKWGSD
Subjt:  EKVARKWGSD

XP_022950570.1 uncharacterized protein LOC111453631 [Cucurbita moschata]1.2e-8983.33Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        MLQ+LNL+PS+  LALSTS+SDDP RSGLPLLRPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKI+RKN + N AEF+EISN++ SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

XP_023545023.1 uncharacterized protein LOC111804448 [Cucurbita pepo subsp. pepo]9.0e-9084.29Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        MLQ+LNL+P T  LALSTSISDDP RSGLPLLRPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +LLAV+ NPLLYALRGTRNGLT VTSKI+RKN + N AEF EISN++ SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

TrEMBL top hitse value%identityAlignment
A0A1S3AZU3 uncharacterized protein LOC103484656 isoform X12.0e-8783.89Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEAGGG-
        MLQ LNL  S PILALS S+SDDP  S LPLLRPRN THNWA L S LKCNGRFSCLFS+NRRE EQARKALESALGGKKNEFEKWNNEIKKREE GGG 
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRRE-EQARKALESALGGKKNEFEKWNNEIKKREEAGGG-

Query:  GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSA
        GGGRGGWFGSGGWFGWSD+ FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK +RK+SA NYAE EEISNK+    V+A
Subjt:  GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSA

Query:  KEKVARKWGSD
        K++VARKWGSD
Subjt:  KEKVARKWGSD

A0A1S3B0A1 uncharacterized protein LOC103484656 isoform X28.3e-8984.29Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G
        MLQ LNL  S PILALS S+SDDP  S LPLLRPRN THNWA L S LKCNGRFSCLFS+NRREEQARKALESALGGKKNEFEKWNNEIKKREE GGG G
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG-G

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAV GIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTFVTSK +RK+SA NYAE EEISNK+    V+AK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKWGSD
Subjt:  EKVARKWGSD

A0A6J1BSB9 uncharacterized protein LOC1110053076.8e-9184.29Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-
        ML+ILNL P  P     TSI D+  RSG+P +RPRN  HNWARLQ+KLKCN RFSCLFSDNRREEQARKALESALG KKNEFEKWNNEIKKREE GGGG 
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGG-

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GG+GGWFGSGGWFGWSD+HFWPEAQQTSLAVLGIIVMYLIVAKGELLLAV+FNPLLYALRGTRNGLTF+TSKI+RK+SAGNYAEF+EISN+E SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        EKVARKWGSD
Subjt:  EKVARKWGSD

A0A6J1GG46 uncharacterized protein LOC1114536315.7e-9083.33Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        MLQ+LNL+PS+  LALSTS+SDDP RSGLPLLRPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FWPEAQQTSLAVLGIIVMYLIVAKG +L+AV+ NPLLYALRGTRNGLT VTSKI+RKN + N AEF+EISN++ SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

A0A6J1IV71 uncharacterized protein LOC1114788507.7e-8782.38Show/hide
Query:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG
        MLQ+LNL P T  LALSTSISDD  RSGLPL RPRN TH WA LQSKLKCN RFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE AG GG
Subjt:  MLQILNLNPSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREE-AGGGG

Query:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK
        GGRGGWFGSGGWFGWSD+ FW EAQQTSLAVLGIIVMYLIVAKG +LLAV+ NPLLYALRGTRNGLT VTSK +RKN + N AEF+EISN++ SG VSAK
Subjt:  GGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAK

Query:  EKVARKWGSD
        ++VARKW +D
Subjt:  EKVARKWGSD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G20130.1 unknown protein1.9e-4562.87Show/hide
Query:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG----GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKG
        K  GRFSCLFS  N+REEQARK+LESALGGKKNEFEKW+ EIKKREE+GGG    GGG GGWFG GGWF  S +HFW EAQQ +  +L I+ +Y++VAKG
Subjt:  KCNGRFSCLFS-DNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGG----GGGRGGWFGSGGWFGWSDEHFWPEAQQTSLAVLGIIVMYLIVAKG

Query:  ELLLAVVFNPLLYALRGTRNGLTFVTSKIM-RKNSAGNYAEFEEISNKEASGQVSAKEKVARKWGSD
        E++ A V NPLLYALRGTR GL+ ++SK+M R+ S  +    EE+  KE S   +AKE V RKWGSD
Subjt:  ELLLAVVFNPLLYALRGTRNGLTFVTSKIM-RKNSAGNYAEFEEISNKEASGQVSAKEKVARKWGSD

AT5G41470.1 Nuclear transport factor 2 (NTF2) family protein8.1e-0435Show/hide
Query:  VRKLYEVVKDKNLTKLSDVVADECPDVYNSIPLLRIFRAKLKVMEFFSHLTKTFGNNLKF
        V K Y  + +KN  +LS  ++ +C    +     + FR K + MEFF  L K+ G N+KF
Subjt:  VRKLYEVVKDKNLTKLSDVVADECPDVYNSIPLLRIFRAKLKVMEFFSHLTKTFGNNLKF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAACTTTATGTCTATTCCAGAAGCCATATCAATCCTCTTGCTCTGCCATTTCACCTTTCCAATGGCCTTTTCTTCCTCAACAAAGTTCCTCCATTTCCCT
GCCTCAACCTCAATCTCCCCAAACTCTAACTCTAACAACTACAACTTCACCCCTCTTTACACCACAAGTCGTTACTCGCTGCCATCGACAGCCGAGCGTCGAGGC
TATTTGTGCGTAAAATGCAATGGTGGCCCTGAGATGCCACAACAGAGAAAAGACGACGACGACAACGATAACGATGAGACTCTCAAAAGAGTCCGCAAACTTTAC
GAGGTGGTAAAGGACAAAAACCTCACCAAATTGTCTGATGTAGTAGCAGATGAATGTCCGGATGTCTATAATTCCATCCCTTTGTTACGAATCTTCCGAGCCAAA
TTGAAAGTGATGGAATTCTTCTCTCATCTCACCAAAACCTTCGGGAACAACCTAAAATTCACAGCGAAGCCAATGACAAAACATGGATCTATGGCAAAAACTCAG
GGGCCGCGGGCAGCGCTCGTTCATCAAATTAAGAAACAGAGCGCCGAGGCCGTGGTTTATCGGCCGGGAGATGCGAAAACAATGCTTCAGATTCTCAATCTAAAC
CCTTCAACTCCGATTCTCGCCCTATCGACCTCGATTTCTGATGACCCAATTCGCTCCGGCCTCCCTTTACTTCGCCCTCGAAATGAAACACATAATTGGGCTCGT
TTGCAGTCCAAGCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAACCGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAG
AAGAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAGAGGAGGCAGGCGGCGGTGGCGGTGGACGAGGAGGTTGGTTCGGCTCTGGTGGATGGTTCGGT
TGGTCCGATGAACATTTCTGGCCAGAAGCCCAGCAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCAAAAGGTGAACTGTTGCTTGCT
GTTGTTTTCAACCCACTACTGTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTATGAGAAAGAACTCTGCTGGTAATTATGCTGAG
TTTGAGGAGATTTCAAACAAAGAAGCCTCAGGCCAGGTCTCTGCTAAAGAGAAAGTTGCAAGGAAATGGGGAAGCGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAAACTTTATGTCTATTCCAGAAGCCATATCAATCCTCTTGCTCTGCCATTTCACCTTTCCAATGGCCTTTTCTTCCTCAACAAAGTTCCTCCATTTCCCT
GCCTCAACCTCAATCTCCCCAAACTCTAACTCTAACAACTACAACTTCACCCCTCTTTACACCACAAGTCGTTACTCGCTGCCATCGACAGCCGAGCGTCGAGGC
TATTTGTGCGTAAAATGCAATGGTGGCCCTGAGATGCCACAACAGAGAAAAGACGACGACGACAACGATAACGATGAGACTCTCAAAAGAGTCCGCAAACTTTAC
GAGGTGGTAAAGGACAAAAACCTCACCAAATTGTCTGATGTAGTAGCAGATGAATGTCCGGATGTCTATAATTCCATCCCTTTGTTACGAATCTTCCGAGCCAAA
TTGAAAGTGATGGAATTCTTCTCTCATCTCACCAAAACCTTCGGGAACAACCTAAAATTCACAGCGAAGCCAATGACAAAACATGGATCTATGGCAAAAACTCAG
GGGCCGCGGGCAGCGCTCGTTCATCAAATTAAGAAACAGAGCGCCGAGGCCGTGGTTTATCGGCCGGGAGATGCGAAAACAATGCTTCAGATTCTCAATCTAAAC
CCTTCAACTCCGATTCTCGCCCTATCGACCTCGATTTCTGATGACCCAATTCGCTCCGGCCTCCCTTTACTTCGCCCTCGAAATGAAACACATAATTGGGCTCGT
TTGCAGTCCAAGCTCAAGTGCAACGGCAGATTCTCTTGCCTTTTCTCCGACAACCGAAGAGAGGAACAGGCAAGGAAGGCATTAGAAAGTGCACTTGGGGGAAAG
AAGAATGAATTTGAGAAATGGAACAATGAAATAAAGAAAAGAGAGGAGGCAGGCGGCGGTGGCGGTGGACGAGGAGGTTGGTTCGGCTCTGGTGGATGGTTCGGT
TGGTCCGATGAACATTTCTGGCCAGAAGCCCAGCAGACTAGTCTTGCTGTTTTAGGTATAATTGTCATGTATCTCATAGTTGCAAAAGGTGAACTGTTGCTTGCT
GTTGTTTTCAACCCACTACTGTATGCTTTGCGAGGAACAAGAAATGGATTGACTTTTGTTACTTCAAAAATTATGAGAAAGAACTCTGCTGGTAATTATGCTGAG
TTTGAGGAGATTTCAAACAAAGAAGCCTCAGGCCAGGTCTCTGCTAAAGAGAAAGTTGCAAGGAAATGGGGAAGCGATTGATCTTTCTTCCCATTTACTTTTAAC
CTTCCTGTTTCAACAATTCATTAATATTGCTGTGTAATTTTTTGGAAAGTGTTTGTCTCCTAGAATTAGTTTTTCTTTTTTATGTTGATTGCTTAGAGGAGTCTT
CCAAAGATTAGCTTCCTTTCAAACTGGATACTTGAGTTTTTGCCCTCTTCAACAAAAACTACCCTACCCCATTCTAAAAGAGAAAAATTGAGTTGCATGAACTAA
CATATAGTTAAGTTAGTAGTCAATCGTACTGTAAAAAGATAAAATAATTTACTATTTATTATGAATTTTGTCAATAACTAACACGG
Protein sequenceShow/hide protein sequence
MANFMSIPEAISILLLCHFTFPMAFSSSTKFLHFPASTSISPNSNSNNYNFTPLYTTSRYSLPSTAERRGYLCVKCNGGPEMPQQRKDDDDNDNDETLKRVRKLY
EVVKDKNLTKLSDVVADECPDVYNSIPLLRIFRAKLKVMEFFSHLTKTFGNNLKFTAKPMTKHGSMAKTQGPRAALVHQIKKQSAEAVVYRPGDAKTMLQILNLN
PSTPILALSTSISDDPIRSGLPLLRPRNETHNWARLQSKLKCNGRFSCLFSDNRREEQARKALESALGGKKNEFEKWNNEIKKREEAGGGGGGRGGWFGSGGWFG
WSDEHFWPEAQQTSLAVLGIIVMYLIVAKGELLLAVVFNPLLYALRGTRNGLTFVTSKIMRKNSAGNYAEFEEISNKEASGQVSAKEKVARKWGSD