; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014763 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014763
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionBEL1-like homeodomain 6
Genome locationtig00001047:795076..798301
RNA-Seq ExpressionSgr014763
SyntenySgr014763
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
BAT04807.1 Os08g0300333, partial [Oryza sativa Japonica Group]2.4e-1432.52Show/hide
Query:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR
        +++  L E+++  G RH LHH +RR VGE ++++VHLHLL R   + +   LP P L +    E   H  L  P    QE  LL+  +QV +QH+ V   
Subjt:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR

Query:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD
           +H  RR  R+ V H E +V    + +H+  HLL   ++PQR  HH       ++D ++ H RL + V+ R+A ++++ +V+  VE V DG  +++ +
Subjt:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD

Query:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA
        H  R++E+++G    E  D+ A    H +   E+    G  +GD+A
Subjt:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA

GER38952.1 BEL1-like homeodomain 6 [Striga asiatica]1.6e-1326.98Show/hide
Query:  VSSQHQPVKLRRSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGF
        +S Q++P +LRR       R  R+  HH +PI+    + +HN+ + L S + + R      +  R VSH RLR+ +  R+   + N +VN  +ET+  GF
Subjt:  VSSQHQPVKLRRSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGF

Query:  RNQKDDHRNRKQEKIIGV----------RAH------------------------EKADEAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGER
        R+Q+D+H   + E+++G            AH                        E A EAA+  A +DDG E+   +G+A G+ A+    +EE +EG  
Subjt:  RNQKDDHRNRKQEKIIGV----------RAH------------------------EKADEAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGER

Query:  LVLGWSVDGEDVSDGVVVGCEKKGCEVVVVAFGQLNCLKLPE---------------------RP-----------------------------------
        + L      E+V+DGVVVG E++G EVVVVA G L  L +                       +P                                   
Subjt:  LVLGWSVDGEDVSDGVVVGCEKKGCEVVVVAFGQLNCLKLPE---------------------RP-----------------------------------

Query:  -------------------LVGQAESGRDGLQE-----------------------TPLTSNIISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSR
                           LV +A S  D + E                          TSN+I PP P SPH P     M+ L TT T AV+KK L   
Subjt:  -------------------LVGQAESGRDGLQE-----------------------TPLTSNIISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSR

Query:  ALLH
        AL H
Subjt:  ALLH

KAB8108081.1 hypothetical protein EE612_043393, partial [Oryza sativa]4.1e-1432.52Show/hide
Query:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR
        +++  L E+++  G RH LHH +RR VGE ++++VHLHLL R   + +   LP P L +    E   H  L  P    QE  LL+  +QV +QH+ V   
Subjt:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR

Query:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD
           +H  RR  R+ V H E +V    + +H+  HLL   ++PQR  HH       ++D ++ H RL + V+ R+A ++++ +V+  VE V DG  +++ +
Subjt:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD

Query:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA
        H  R++E+++G    E  D+ A    H +   E+    G  +GD+A
Subjt:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA

TVU50424.1 hypothetical protein EJB05_01795 [Eragrostis curvula]1.1e-3234.48Show/hide
Query:  PEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDRHPHRRH
        PE++   G  H LH  + RGVGE +++NVH+HLL R   + +    L +    E     PLLRPP L QEP L +H  +V  +H+ V+L  +        
Subjt:  PEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDRHPHRRH

Query:  HRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDDHRNRKQEKIIGVRAH
            V HG+P+V  V + L + GHLL   QR    LG  EL+D DV HP LR+ ++E +A +V+++LV+  VE V+DG  +++ +H + ++E+++G    
Subjt:  HRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDDHRNRKQEKIIGVRAH

Query:  EKAD----------------------------------EAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGERLVLGWSVDGEDVSDGVVVGCE
        + A+                                  +AA+  AH+DDG+E+  R+ DAEG++A++ V+ EED+E     L     GEDV+  V VG  
Subjt:  EKAD----------------------------------EAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGERLVLGWSVDGEDVSDGVVVGCE

Query:  KKGCEVVVVAFGQLNCLKL
        +KG EVVVVA   +  L++
Subjt:  KKGCEVVVVAFGQLNCLKL

TrEMBL top hitse value%identityAlignment
A0A0A9J1X3 Uncharacterized protein5.8e-1467.14Show/hide
Query:  ISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSRALLHQVATSSRANKTPPTGARKAAQTPEDDPQI
        +SPPSPN PH P+   SM    TTTT+AVLKKV HS AL H VATSS AN TPPTGAR AA TP+D PQ+
Subjt:  ISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSRALLHQVATSSRANKTPPTGARKAAQTPEDDPQI

A0A0E0MM00 Uncharacterized protein1.6e-1945.64Show/hide
Query:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDR
        +AE    E++    HRH LH  +RR VG+R+++NVHLHLL R   L +    LR+    E L   PLLRPPPL QEP L QH  +V  +HQPV+L    R
Subjt:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDR

Query:  HPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSD
        HP  R     VHH EP+V    + LH+  H L   QRH  RL   EL+D
Subjt:  HPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSD

A0A0P0XEQ1 Os08g0300333 protein (Fragment)1.2e-1432.52Show/hide
Query:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR
        +++  L E+++  G RH LHH +RR VGE ++++VHLHLL R   + +   LP P L +    E   H  L  P    QE  LL+  +QV +QH+ V   
Subjt:  MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFN---LPFP-LRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLR

Query:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD
           +H  RR  R+ V H E +V    + +H+  HLL   ++PQR  HH       ++D ++ H RL + V+ R+A ++++ +V+  VE V DG  +++ +
Subjt:  RSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLL---QSPQR--HHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDD

Query:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA
        H  R++E+++G    E  D+ A    H +   E+    G  +GD+A
Subjt:  HRNRKQEKIIGVRAHEKADEAAIARAHQDDGKEKTRRNGDAEGDQA

A0A5A7Q2C7 BEL1-like homeodomain 67.6e-1426.98Show/hide
Query:  VSSQHQPVKLRRSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGF
        +S Q++P +LRR       R  R+  HH +PI+    + +HN+ + L S + + R      +  R VSH RLR+ +  R+   + N +VN  +ET+  GF
Subjt:  VSSQHQPVKLRRSDRHPHRRHHRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGF

Query:  RNQKDDHRNRKQEKIIGV----------RAH------------------------EKADEAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGER
        R+Q+D+H   + E+++G            AH                        E A EAA+  A +DDG E+   +G+A G+ A+    +EE +EG  
Subjt:  RNQKDDHRNRKQEKIIGV----------RAH------------------------EKADEAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGER

Query:  LVLGWSVDGEDVSDGVVVGCEKKGCEVVVVAFGQLNCLKLPE---------------------RP-----------------------------------
        + L      E+V+DGVVVG E++G EVVVVA G L  L +                       +P                                   
Subjt:  LVLGWSVDGEDVSDGVVVGCEKKGCEVVVVAFGQLNCLKLPE---------------------RP-----------------------------------

Query:  -------------------LVGQAESGRDGLQE-----------------------TPLTSNIISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSR
                           LV +A S  D + E                          TSN+I PP P SPH P     M+ L TT T AV+KK L   
Subjt:  -------------------LVGQAESGRDGLQE-----------------------TPLTSNIISPPSPNSPHSPALHISMKVLSTTTTMAVLKKVLHSR

Query:  ALLH
        AL H
Subjt:  ALLH

A0A5J9WT13 Uncharacterized protein5.6e-3334.48Show/hide
Query:  PEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDRHPHRRH
        PE++   G  H LH  + RGVGE +++NVH+HLL R   + +    L +    E     PLLRPP L QEP L +H  +V  +H+ V+L  +        
Subjt:  PEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDRHPHRRH

Query:  HRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDDHRNRKQEKIIGVRAH
            V HG+P+V  V + L + GHLL   QR    LG  EL+D DV HP LR+ ++E +A +V+++LV+  VE V+DG  +++ +H + ++E+++G    
Subjt:  HRIGVHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDDHRNRKQEKIIGVRAH

Query:  EKAD----------------------------------EAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGERLVLGWSVDGEDVSDGVVVGCE
        + A+                                  +AA+  AH+DDG+E+  R+ DAEG++A++ V+ EED+E     L     GEDV+  V VG  
Subjt:  EKAD----------------------------------EAAIARAHQDDGKEKTRRNGDAEGDQAQNAVEKEEDDEGERLVLGWSVDGEDVSDGVVVGCE

Query:  KKGCEVVVVAFGQLNCLKL
        +KG EVVVVA   +  L++
Subjt:  KKGCEVVVVAFGQLNCLKL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAGCCCCACTTGCCGGAACAGAACCAGAGCTTTGGCCACCGACATGCTCTCCATCACTGTATACGGCGTGGTGTTGGTGAGAGGATGAAGATCAACGTACATCT
CCATCTCCTCCTTCGTCACTGCCAGCTCTTCAATCTTCCCTTCCCTCTCCGCCAATTCCACCCACGTGAATTTCTCTCTCACTTCCCACTCCTCCGTCCTCCTCCTCTCC
CTCAAGAACCATTTCTTCTTCAGCACCTGAAGCAGGTGAGCTCTCAGCACCAGCCCGTGAAGCTCCGTCGCTCCGATCGCCATCCCCACCGGCGGCACCACCGCATCGGC
GTCCACCACGGGGAACCCATTGTGCGTGGTGTTCTTAAGAACCTCCACAATGCGGGCCACCTTCTCCAGTCCCCGCAGCGTCACCACCGCCGGCTTGGCGTCCGCGAGCT
CTCCGACCGTGATGTTTCTCATCCACGGCTCCGGATTTGCGTCCAGGAACGGCAGGCCTTTCAAGTGCAGAATAATCTCGTAAATACTCGGGTTGAAACTGTCTCCGACG
GTTTTCGCAATCAGAAGGACGATCATCGTAATCGGAAGCAGGAGAAGATTATTGGTGTAAGGGCGCATGAGAAGGCCGATGAGGCGGCCATAGCCAGAGCCCATCAAGAT
GATGGGAAGGAAAAGACCAGACGGAACGGCGATGCCGAAGGTGATCAGGCCCAAAACGCAGTAGAGAAGGAAGAAGATGACGAGGGAGAGCGGTTGGTACTCGGCTGGAG
TGTTGATGGAGAAGATGTTTCGGACGGCGTCGTCGTTGGTTGTGAGAAGAAGGGTTGCGAGGTCGTTGTAGTGGCCTTTGGGCAGTTGAATTGCTTGAAGTTGCCGGAGC
GGCCATTGGTGGGGCAGGCGGAGTCTGGAAGAGATGGGTTGCAGGAAACTCCACTGACATCAAACATTATAAGCCCTCCTTCTCCAAACAGCCCACATTCACCTGCTTTG
CATATTTCAATGAAAGTTCTTAGCACCACCACAACAATGGCTGTGCTGAAAAAGGTTCTCCACAGCAGGGCACTCCTCCACCAAGTAGCCACCTCTTCTAGAGCAAATAA
AACGCCCCCGACCGGAGCTCGGAAAGCTGCACAAACTCCCGAAGACGACCCGCAGATCAGTCTAATTTATCTGTTGATTTGTTCAATGATTGACATTGACATTCATGACA
TTACCGAAACTATTTTAAATCTTAAGCCTATCCAATCAACCAAAGCAAATGATCCAAAGATGATTCCTGAATTACATTCAATCAGTGTTCGTTGCGCCGAGATTCAACAA
GAAAGCGCACGGCGCACGCACCTTAACGATCAATGTCGTGGCACCGAACATATTAGGAGTATCTATGCCATTGAGGTAGGCTTTGATTTCTGGTATGCCAGGCCCAGCTG
CGGTGGGAGCAAAGCATACACAGAGAAAAGCTGCGACAAAAGTCAGAAGGAAATTTGCAGTGTTGGGATTTCAAAGGCATGGTCATGGATGTATTATGTATACTTACCTT
TCTTCTTCTATGTAGCCGACCACTTTCAAAAGCTTGTAGCCAGCGATGTTTTCAATGGCGAGATTATGAGAGTGGCAATGATGCCTGTAAGAACTTCTGCTTCCATGGCG
TTGCTGAGAGTTGATTCTGCGAGAAGGCTTGAGTTTTCCTCCATTGGCGCAGTGGGGGAGAACCAGAAACAGCCTACAGAGCTTATGAAGAGACAAGAGAGACGATGGTC
AAGCTTGGGTTTGGTTGGTTGTGGATTTGGAAGAAAGCGCAAAGCTCTCTCTTCCCCGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAGCCCCACTTGCCGGAACAGAACCAGAGCTTTGGCCACCGACATGCTCTCCATCACTGTATACGGCGTGGTGTTGGTGAGAGGATGAAGATCAACGTACATCT
CCATCTCCTCCTTCGTCACTGCCAGCTCTTCAATCTTCCCTTCCCTCTCCGCCAATTCCACCCACGTGAATTTCTCTCTCACTTCCCACTCCTCCGTCCTCCTCCTCTCC
CTCAAGAACCATTTCTTCTTCAGCACCTGAAGCAGGTGAGCTCTCAGCACCAGCCCGTGAAGCTCCGTCGCTCCGATCGCCATCCCCACCGGCGGCACCACCGCATCGGC
GTCCACCACGGGGAACCCATTGTGCGTGGTGTTCTTAAGAACCTCCACAATGCGGGCCACCTTCTCCAGTCCCCGCAGCGTCACCACCGCCGGCTTGGCGTCCGCGAGCT
CTCCGACCGTGATGTTTCTCATCCACGGCTCCGGATTTGCGTCCAGGAACGGCAGGCCTTTCAAGTGCAGAATAATCTCGTAAATACTCGGGTTGAAACTGTCTCCGACG
GTTTTCGCAATCAGAAGGACGATCATCGTAATCGGAAGCAGGAGAAGATTATTGGTGTAAGGGCGCATGAGAAGGCCGATGAGGCGGCCATAGCCAGAGCCCATCAAGAT
GATGGGAAGGAAAAGACCAGACGGAACGGCGATGCCGAAGGTGATCAGGCCCAAAACGCAGTAGAGAAGGAAGAAGATGACGAGGGAGAGCGGTTGGTACTCGGCTGGAG
TGTTGATGGAGAAGATGTTTCGGACGGCGTCGTCGTTGGTTGTGAGAAGAAGGGTTGCGAGGTCGTTGTAGTGGCCTTTGGGCAGTTGAATTGCTTGAAGTTGCCGGAGC
GGCCATTGGTGGGGCAGGCGGAGTCTGGAAGAGATGGGTTGCAGGAAACTCCACTGACATCAAACATTATAAGCCCTCCTTCTCCAAACAGCCCACATTCACCTGCTTTG
CATATTTCAATGAAAGTTCTTAGCACCACCACAACAATGGCTGTGCTGAAAAAGGTTCTCCACAGCAGGGCACTCCTCCACCAAGTAGCCACCTCTTCTAGAGCAAATAA
AACGCCCCCGACCGGAGCTCGGAAAGCTGCACAAACTCCCGAAGACGACCCGCAGATCAGTCTAATTTATCTGTTGATTTGTTCAATGATTGACATTGACATTCATGACA
TTACCGAAACTATTTTAAATCTTAAGCCTATCCAATCAACCAAAGCAAATGATCCAAAGATGATTCCTGAATTACATTCAATCAGTGTTCGTTGCGCCGAGATTCAACAA
GAAAGCGCACGGCGCACGCACCTTAACGATCAATGTCGTGGCACCGAACATATTAGGAGTATCTATGCCATTGAGGTAGGCTTTGATTTCTGGTATGCCAGGCCCAGCTG
CGGTGGGAGCAAAGCATACACAGAGAAAAGCTGCGACAAAAGTCAGAAGGAAATTTGCAGTGTTGGGATTTCAAAGGCATGGTCATGGATGTATTATGTATACTTACCTT
TCTTCTTCTATGTAGCCGACCACTTTCAAAAGCTTGTAGCCAGCGATGTTTTCAATGGCGAGATTATGAGAGTGGCAATGATGCCTGTAAGAACTTCTGCTTCCATGGCG
TTGCTGAGAGTTGATTCTGCGAGAAGGCTTGAGTTTTCCTCCATTGGCGCAGTGGGGGAGAACCAGAAACAGCCTACAGAGCTTATGAAGAGACAAGAGAGACGATGGTC
AAGCTTGGGTTTGGTTGGTTGTGGATTTGGAAGAAAGCGCAAAGCTCTCTCTTCCCCGTTCTAG
Protein sequenceShow/hide protein sequence
MAEPHLPEQNQSFGHRHALHHCIRRGVGERMKINVHLHLLLRHCQLFNLPFPLRQFHPREFLSHFPLLRPPPLPQEPFLLQHLKQVSSQHQPVKLRRSDRHPHRRHHRIG
VHHGEPIVRGVLKNLHNAGHLLQSPQRHHRRLGVRELSDRDVSHPRLRICVQERQAFQVQNNLVNTRVETVSDGFRNQKDDHRNRKQEKIIGVRAHEKADEAAIARAHQD
DGKEKTRRNGDAEGDQAQNAVEKEEDDEGERLVLGWSVDGEDVSDGVVVGCEKKGCEVVVVAFGQLNCLKLPERPLVGQAESGRDGLQETPLTSNIISPPSPNSPHSPAL
HISMKVLSTTTTMAVLKKVLHSRALLHQVATSSRANKTPPTGARKAAQTPEDDPQISLIYLLICSMIDIDIHDITETILNLKPIQSTKANDPKMIPELHSISVRCAEIQQ
ESARRTHLNDQCRGTEHIRSIYAIEVGFDFWYARPSCGGSKAYTEKSCDKSQKEICSVGISKAWSWMYYVYLPFFFYVADHFQKLVASDVFNGEIMRVAMMPVRTSASMA
LLRVDSARRLEFSSIGAVGENQKQPTELMKRQERRWSSLGLVGCGFGRKRKALSSPF