; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0012610 (gene) of Chayote v1 genome

Gene IDSed0012610
OrganismSechium edule (Chayote v1)
Descriptionzinc finger homeobox protein 4-like isoform X1
Genome locationLG03:10927605..10929104
RNA-Seq ExpressionSed0012610
SyntenySed0012610
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585540.1 hypothetical protein SDJN03_18273, partial [Cucurbita argyrosperma subsp. sororia]2.8e-6860.29Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        FS  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
        PSLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+      +   +  Q A          
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
        M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

KAG7020453.1 hypothetical protein SDJN02_17137, partial [Cucurbita argyrosperma subsp. argyrosperma]3.7e-6860.29Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        FS  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
        PSLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+      +   +  Q A          
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
        M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

XP_022951578.1 uncharacterized protein LOC111454352 [Cucurbita moschata]6.3e-6859.64Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        FS  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
        PSLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+                ++ +    HQ
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  ---MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
           M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  ---MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

XP_023002465.1 uncharacterized protein LOC111496295 [Cucurbita maxima]3.1e-6759.06Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        F+  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
         SLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+                ++ +   NHQ
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  ----MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
            M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  ----MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

XP_023537124.1 uncharacterized protein LOC111798295 [Cucurbita pepo subsp. pepo]9.1e-6759.19Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        F+  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK ++  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
        PS DKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+      +   +  Q A          
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
        M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

TrEMBL top hitse value%identityAlignment
A0A0A0LRP1 Uncharacterized protein3.0e-5551.27Show/hide
Query:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS
        DFS ++  VAQIL + PLL+Q+S FSLGL+P+W  RRKRS + SPP  SS  T     P    P S++ K SSP +PL L+SLPLSRSESD+N  IAK S
Subjt:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS

Query:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM
        +  KK  +DKKSQY+E I+ LT Q Q L+G+IEA+++HF +LK++N ELK+ KQEI+     L   P+ GTS+S  +        S DS+  N   +   
Subjt:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM

Query:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN
         M      + EQSN + QN+QIPIG +P YDPS+GP GIPDLN++ E+   +NY + LAA+ARQNRIQI KNKNN
Subjt:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN

A0A1S3BAR4 uncharacterized protein LOC1034880493.0e-5551.64Show/hide
Query:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS
        DFS ++  VAQIL + PLL+QKS FSLGL+P+W  RRKRS + SPP   S  T     P    P S++ K SSP +PL LNSLPLSRSESD+N  IAK S
Subjt:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS

Query:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM
        +  KK  +DKKSQY+E ID LT Q Q L+G+IEA+++HF +LK++N ELK+ KQEI+     +   PEIGTSSS  +        S  S+  N   +   
Subjt:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM

Query:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN
         M        EQ N N +N+QIPIG +P YDPS+GP GIPDLN++ E+   ++Y + LAA+ARQNRIQI KNKNN
Subjt:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN

A0A5A7VHE1 Uncharacterized protein3.0e-5551.64Show/hide
Query:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS
        DFS ++  VAQIL + PLL+QKS FSLGL+P+W  RRKRS + SPP   S  T     P    P S++ K SSP +PL LNSLPLSRSESD+N  IAK S
Subjt:  DFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPS---PPSKKVKVSSPDSPLVLNSLPLSRSESDDN-PIAKRS

Query:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM
        +  KK  +DKKSQY+E ID LT Q Q L+G+IEA+++HF +LK++N ELK+ KQEI+     +   PEIGTSSS  +        S  S+  N   +   
Subjt:  RSLKKPSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEII-----LRNKPEIGTSSSAIVNF------SGDSSNRNEFQQAMM

Query:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN
         M        EQ N N +N+QIPIG +P YDPS+GP GIPDLN++ E+   ++Y + LAA+ARQNRIQI KNKNN
Subjt:  MMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPSIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNN

A0A6J1GI34 uncharacterized protein LOC1114543523.0e-6859.64Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        FS  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
        PSLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+                ++ +    HQ
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  ---MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
           M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  ---MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

A0A6J1KP15 uncharacterized protein LOC1114962951.5e-6759.06Show/hide
Query:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK
        F+  +  VAQILLEF    +KS   LG  P W+ RRKRS L+SPP       S+A+ PSPPSKKVK SSP SPLVLNSLPLSRSESD++  AK S+  KK
Subjt:  FSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKK

Query:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ
         SLDKKSQ+VEAID+LTKQNQGLKGE EA++QH+NHLK++N ELK+ KQE+IL +         PEIGTSSSA+                ++ +   NHQ
Subjt:  PSLDKKSQYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNK--------PEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQ

Query:  ----MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK
            M EQSN  SQNFQIPIG +PFYDP S+ P GIPDLNI+ EE +QRNY+R +AA+AR+NRIQICKNKNNG  K
Subjt:  ----MGEQSNMNSQNFQIPIGAVPFYDP-SIGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGATTTCTCCTCCGACGATTTCCTCGTCGCCCAAATCCTCCTCGAATTCCCTCTCCTCGTTCAGAAATCGCAGTTCTCCCTCGGCTTAACCCCCGCCTGGTCCCA
CCGTCGCAAGAGATCCGTCCTTCTATCGCCGCCGCCACCATCCTCCTCTGCCACTTCCACCGCCGCCGCCCCCTCGCCGCCGTCCAAGAAGGTCAAGGTCTCCAGCCCTG
ATTCCCCGCTCGTCCTCAACTCCCTGCCATTGTCGCGGAGTGAATCGGACGATAATCCCATCGCCAAACGCTCCCGCTCCCTCAAGAAACCCTCTCTCGATAAGAAATCT
CAATATGTGGAAGCCATTGACGATTTGACCAAACAGAATCAAGGTTTGAAAGGGGAAATTGAGGCTTTGGAGCAACATTTTAACCATCTGAAATCTGTCAATTATGAGTT
AAAGTCTCTCAAGCAAGAGATAATTCTGCGTAATAAGCCTGAAATTGGAACCTCAAGTTCAGCCATCGTTAATTTCAGCGGCGACTCCTCGAATCGCAACGAATTTCAAC
AGGCAATGATGATGATGATGATGATGAACCATCAAATGGGAGAACAGAGCAATATGAACAGTCAAAATTTTCAAATCCCAATTGGGGCAGTTCCTTTTTACGATCCTTCA
ATCGGTCCAAAAGGGATTCCTGATTTGAACATTGCTTTCGAAGAAACTCATCAGAGGAATTACGCAAGAATCTTGGCGGCTCAAGCGAGACAGAATAGGATTCAGATCTG
CAAGAACAAGAACAATGGAGCCGCCAAATAG
mRNA sequenceShow/hide mRNA sequence
AATTCTTCCTCCTTCCTCCAATCTTCACTCTCCCCTCAAATTCCGATTTCTGCGATTCCGTTTTGCCCTAATTTCTCCAACCAACCAACCAAACTCCATTGATTTCCATG
GCGGATTTCTCCTCCGACGATTTCCTCGTCGCCCAAATCCTCCTCGAATTCCCTCTCCTCGTTCAGAAATCGCAGTTCTCCCTCGGCTTAACCCCCGCCTGGTCCCACCG
TCGCAAGAGATCCGTCCTTCTATCGCCGCCGCCACCATCCTCCTCTGCCACTTCCACCGCCGCCGCCCCCTCGCCGCCGTCCAAGAAGGTCAAGGTCTCCAGCCCTGATT
CCCCGCTCGTCCTCAACTCCCTGCCATTGTCGCGGAGTGAATCGGACGATAATCCCATCGCCAAACGCTCCCGCTCCCTCAAGAAACCCTCTCTCGATAAGAAATCTCAA
TATGTGGAAGCCATTGACGATTTGACCAAACAGAATCAAGGTTTGAAAGGGGAAATTGAGGCTTTGGAGCAACATTTTAACCATCTGAAATCTGTCAATTATGAGTTAAA
GTCTCTCAAGCAAGAGATAATTCTGCGTAATAAGCCTGAAATTGGAACCTCAAGTTCAGCCATCGTTAATTTCAGCGGCGACTCCTCGAATCGCAACGAATTTCAACAGG
CAATGATGATGATGATGATGATGAACCATCAAATGGGAGAACAGAGCAATATGAACAGTCAAAATTTTCAAATCCCAATTGGGGCAGTTCCTTTTTACGATCCTTCAATC
GGTCCAAAAGGGATTCCTGATTTGAACATTGCTTTCGAAGAAACTCATCAGAGGAATTACGCAAGAATCTTGGCGGCTCAAGCGAGACAGAATAGGATTCAGATCTGCAA
GAACAAGAACAATGGAGCCGCCAAATAGCACCAGAGTTTTCTTCTGATCCCTGTATGTGATTGATACACACTGAGTCAGTTTTCAACCAAACAAATTCGAGGATTTCACA
TTTTTACAACATTCTTTTTTTGATCTTGGGATTCTGTTCATCAATTTGATGATGTTGGGGTGCCTATTTTGATTCTGGAGATTAGGGTAGATTTTTATTTTTCATTTTTG
TAGATTTTTGAATTGGGGATTAGTACTCCAATTGTAAAGTTAATAGAAGAATCCTAAGAGCTGCC
Protein sequenceShow/hide protein sequence
MADFSSDDFLVAQILLEFPLLVQKSQFSLGLTPAWSHRRKRSVLLSPPPPSSSATSTAAAPSPPSKKVKVSSPDSPLVLNSLPLSRSESDDNPIAKRSRSLKKPSLDKKS
QYVEAIDDLTKQNQGLKGEIEALEQHFNHLKSVNYELKSLKQEIILRNKPEIGTSSSAIVNFSGDSSNRNEFQQAMMMMMMMNHQMGEQSNMNSQNFQIPIGAVPFYDPS
IGPKGIPDLNIAFEETHQRNYARILAAQARQNRIQICKNKNNGAAK