; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr018122 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr018122
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionprotein CURVATURE THYLAKOID 1B, chloroplastic
Genome locationtig00153107:380122..381888
RNA-Seq ExpressionSgr018122
SyntenySgr018122
Gene Ontology termsGO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025564 - Cyanobacterial aminoacyl-tRNA synthetase, CAAD domain
IPR033344 - Protein CURVATURE THYLAKOID 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140528.1 protein CURVATURE THYLAKOID 1B, chloroplastic [Cucumis sativus]5.0e-5877.42Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PP P SQIRPWK+TAYCRKIARNVM MA+GE  AEVA    AE+PE +KK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPL+PGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

XP_008459821.1 PREDICTED: protein CURVATURE THYLAKOID 1B, chloroplastic [Cucumis melo]3.8e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQS +PSS +PLPTLP PP PS QIRPWK+TAYCRKIARNVMAMA+GE  AEVA    AE+PE IKK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPL+PGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

XP_022929793.1 protein CURVATURE THYLAKOID 1B, chloroplastic [Cucurbita moschata]5.5e-5776.77Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PP PS QIR WK+TAYCRKIARNVMAMA+GE  AEVA    AELPE IKK QEAW KV+DKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPLIPGLLELVGIGYTGWF Y N++F PDREA ++KL+ETY+E+IGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

XP_022992417.1 protein CURVATURE THYLAKOID 1B, chloroplastic [Cucurbita maxima]2.9e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PPAPS QIRPWK+TAYCR+IARNVMAMA+GE  AEVA    AELPE IKK QEAW KV+DKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPLIPGLLELVGIGYTGWF Y N++F PDREA ++KL+ETY+EIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

XP_038906765.1 protein CURVATURE THYLAKOID 1B, chloroplastic [Benincasa hispida]3.8e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PS+ +PLPTLP PP P SQIRPWK+TAYCRK+ARNVMAMA+GE  AEVA    AE+PE IKK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPLIPGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

TrEMBL top hitse value%identityAlignment
A0A0A0KCB7 CAAD domain-containing protein2.4e-5877.42Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PP P SQIRPWK+TAYCRKIARNVM MA+GE  AEVA    AE+PE +KK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAP-SQIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPL+PGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

A0A1S3CCA4 protein CURVATURE THYLAKOID 1B, chloroplastic1.8e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQS +PSS +PLPTLP PP PS QIRPWK+TAYCRKIARNVMAMA+GE  AEVA    AE+PE IKK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPL+PGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

A0A5A7T8Q0 Protein CURVATURE THYLAKOID 1B1.8e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQS +PSS +PLPTLP PP PS QIRPWK+TAYCRKIARNVMAMA+GE  AEVA    AE+PE IKK QEAW KVEDKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPL+PGLLELVGIGYTGWF Y N++F PDREA ++KLKETYSEIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

A0A6J1ET79 protein CURVATURE THYLAKOID 1B, chloroplastic2.7e-5776.77Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PP PS QIR WK+TAYCRKIARNVMAMA+GE  AEVA    AELPE IKK QEAW KV+DKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPLIPGLLELVGIGYTGWF Y N++F PDREA ++KL+ETY+E+IGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

A0A6J1JPR9 protein CURVATURE THYLAKOID 1B, chloroplastic1.4e-5878.06Show/hide
Query:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG
        + PRQSA+PSS +PLPTLP PPAPS QIRPWK+TAYCR+IARNVMAMA+GE  AEVA    AELPE IKK QEAW KV+DKYAVSSLAV+G V LWASAG
Subjt:  QGPRQSATPSSPLPLPTLPPPPAPS-QIRPWKTTAYCRKIARNVMAMASGEASAEVAN---AELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAG

Query:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        VVSAID+LPLIPGLLELVGIGYTGWF Y N++F PDREA ++KL+ETY+EIIGSS
Subjt:  VVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

SwissProt top hitse value%identityAlignment
O04616 Protein CURVATURE THYLAKOID 1A, chloroplastic2.3e-1334.55Show/hide
Query:  RNVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKL
        + V  + +  +S E ++ +  E I   +E W  +E+K  V       +V +W S+ VV AI+ +PL+P ++ELVG+GYTGWFVY  +LF+  R+   + +
Subjt:  RNVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKL

Query:  KETYSEIIGS
        +    +I GS
Subjt:  KETYSEIIGS

Q8LCA1 Protein CURVATURE THYLAKOID 1B, chloroplastic8.9e-4260.39Show/hide
Query:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV
        SA+ SSP  + LPTL   P  S  R  K TAYCRKI RNV+  A+   GEA A   E    ELPE +K AQEAW KV+DKYA+ SLA AG+V LW SAG+
Subjt:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV

Query:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        +SAID+LPL+PG+LELVGIGYTGWF Y N++F+PDREA  +K+K TY +I+GSS
Subjt:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

Q8LDD3 Protein CURVATURE THYLAKOID 1D, chloroplastic6.4e-0828.97Show/hide
Query:  ASAEVANAELPEFIKKAQEAWGKVED-------KYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKET
        A  + +N+E P+   +  +A   + D        Y++       +V L+ ++ +VS+++ +PL P L+E+VG+GYT WF    +LF+ +RE    K+ E 
Subjt:  ASAEVANAELPEFIKKAQEAWGKVED-------KYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKET

Query:  YSEIIGS
          +++GS
Subjt:  YSEIIGS

Q8YX97 Valine--tRNA ligase1.0e-0540.35Show/hide
Query:  LAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKK
        L +AGLV L  +  V   +D +P +    E+VG+GY+ WFV  N+L  P R+ F+ K
Subjt:  LAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKK

Q9M812 Protein CURVATURE THYLAKOID 1C, chloroplastic3.8e-1639.81Show/hide
Query:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK
        ++M  ASGE+S    + ++   +   Q  W K ED+  +  L  AG+V LWAS  +++AIDKLP+I    ELVGI ++ WF Y  +LF+PDR+   K +K
Subjt:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK

Query:  ETYSEIIG
        ++ ++I+G
Subjt:  ETYSEIIG

Arabidopsis top hitse value%identityAlignment
AT1G52220.1 FUNCTIONS IN: molecular_function unknown2.7e-1739.81Show/hide
Query:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK
        ++M  ASGE+S    + ++   +   Q  W K ED+  +  L  AG+V LWAS  +++AIDKLP+I    ELVGI ++ WF Y  +LF+PDR+   K +K
Subjt:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK

Query:  ETYSEIIG
        ++ ++I+G
Subjt:  ETYSEIIG

AT1G52220.2 FUNCTIONS IN: molecular_function unknown7.8e-1739.81Show/hide
Query:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK
        ++M  ASGE+S    + ++   I+     W K ED+  +  L  AG+V LWAS  +++AIDKLP+I    ELVGI ++ WF Y  +LF+PDR+   K +K
Subjt:  NVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLK

Query:  ETYSEIIG
        ++ ++I+G
Subjt:  ETYSEIIG

AT2G46820.1 photosystem I P subunit6.3e-4360.39Show/hide
Query:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV
        SA+ SSP  + LPTL   P  S  R  K TAYCRKI RNV+  A+   GEA A   E    ELPE +K AQEAW KV+DKYA+ SLA AG+V LW SAG+
Subjt:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV

Query:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        +SAID+LPL+PG+LELVGIGYTGWF Y N++F+PDREA  +K+K TY +I+GSS
Subjt:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

AT2G46820.2 photosystem I P subunit6.3e-4360.39Show/hide
Query:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV
        SA+ SSP  + LPTL   P  S  R  K TAYCRKI RNV+  A+   GEA A   E    ELPE +K AQEAW KV+DKYA+ SLA AG+V LW SAG+
Subjt:  SATPSSP--LPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMAS---GEASA---EVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGV

Query:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS
        +SAID+LPL+PG+LELVGIGYTGWF Y N++F+PDREA  +K+K TY +I+GSS
Subjt:  VSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS

AT4G01150.1 unknown protein1.6e-1434.55Show/hide
Query:  RNVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKL
        + V  + +  +S E ++ +  E I   +E W  +E+K  V       +V +W S+ VV AI+ +PL+P ++ELVG+GYTGWFVY  +LF+  R+   + +
Subjt:  RNVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLVGLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKL

Query:  KETYSEIIGS
        +    +I GS
Subjt:  KETYSEIIGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TGGCAATGGCTTCTTCTTCTTCCTCCACATTCGCTCTCTCTTCCTCTTCCACCATCCTCGACTCCAAGGCCCCCGGCAATCCGCCACGCCGTCGTCCCCCTTGCCGCTCC
CCACTCTCCCGCCGCCGCCTGCCCCGTCGCAGATCCGCCCTTGGAAGACCACCGCCTACTGTCGGAAGATTGCTCGCAATGTCATGGCCATGGCGTCCGGCGAAGCGTCG
GCGGAAGTGGCCAACGCCGAGCTGCCGGAATTTATCAAGAAAGCTCAAGAAGCTTGGGGAAAGGTTGAAGATAAGTATGCCGTAAGTTCGCTTGCAGTGGCTGGTTTGGT
CGGACTTTGGGCCTCCGCCGGCGTCGTCTCGGCAATTGATAAGCTTCCTCTGATTCCTGGCCTGCTTGAGCTTGTAGGCATTGGCTACACTGGGTGGTTTGTATACAACA
ATGTGTTATTCGAACCAGACAGGGAAGCATTTGTGAAGAAACTAAAGGAAACCTACAGTGAGATAATAGGGAGCAGCTAA
mRNA sequenceShow/hide mRNA sequence
TGGCAATGGCTTCTTCTTCTTCCTCCACATTCGCTCTCTCTTCCTCTTCCACCATCCTCGACTCCAAGGCCCCCGGCAATCCGCCACGCCGTCGTCCCCCTTGCCGCTCC
CCACTCTCCCGCCGCCGCCTGCCCCGTCGCAGATCCGCCCTTGGAAGACCACCGCCTACTGTCGGAAGATTGCTCGCAATGTCATGGCCATGGCGTCCGGCGAAGCGTCG
GCGGAAGTGGCCAACGCCGAGCTGCCGGAATTTATCAAGAAAGCTCAAGAAGCTTGGGGAAAGGTTGAAGATAAGTATGCCGTAAGTTCGCTTGCAGTGGCTGGTTTGGT
CGGACTTTGGGCCTCCGCCGGCGTCGTCTCGGCAATTGATAAGCTTCCTCTGATTCCTGGCCTGCTTGAGCTTGTAGGCATTGGCTACACTGGGTGGTTTGTATACAACA
ATGTGTTATTCGAACCAGACAGGGAAGCATTTGTGAAGAAACTAAAGGAAACCTACAGTGAGATAATAGGGAGCAGCTAA
Protein sequenceShow/hide protein sequence
GNGFFFFLHIRSLFLFHHPRLQGPRQSATPSSPLPLPTLPPPPAPSQIRPWKTTAYCRKIARNVMAMASGEASAEVANAELPEFIKKAQEAWGKVEDKYAVSSLAVAGLV
GLWASAGVVSAIDKLPLIPGLLELVGIGYTGWFVYNNVLFEPDREAFVKKLKETYSEIIGSS