; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0021239 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0021239
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionNUFIP1 domain-containing protein
Genome locationchr10:5107178..5112861
RNA-Seq ExpressionPI0021239
SyntenyPI0021239
Gene Ontology termsGO:0000492 - box C/D snoRNP assembly (biological process)
GO:0005634 - nucleus (cellular component)
InterPro domainsIPR019496 - Nuclear fragile X mental retardation-interacting protein 1, conserved domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004140922.1 uncharacterized protein LOC101213190 [Cucumis sativus]3.1e-8082.87Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNGSNSI NNSAHRNFMRNSKKGFQKNQTHH+KNEKKKF FPGGQK K     HNERRNKF G+N TDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK
        WREARRKNYPSS NIQ     KQTN TLVDKEA+LLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR   STLG+EA+ ASI KE SQN+LNK
Subjt:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK

Query:  RGRFKKKNRPRKKGKF
        RGR KKKNRPRKKGKF
Subjt:  RGRFKKKNRPRKKGKF

XP_008456637.1 PREDICTED: uncharacterized protein LOC103496534 isoform X2 [Cucumis melo]4.3e-8283.8Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGGQK K     HNERRNKF G+NSTDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK
        WREARRKNYPSS NIQ     KQTN TLV++EAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR  PSTLG+EA GASI KEKSQN+LNK
Subjt:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK

Query:  RGRFKKKNRPRKKGKF
        RGR KKKNRPRKKGKF
Subjt:  RGRFKKKNRPRKKGKF

XP_016902016.1 PREDICTED: uncharacterized protein LOC103496534 isoform X1 [Cucumis melo]1.6e-8181.9Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGGQK K     HNERRNKF G+NSTDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ----------KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQ
        WREARRKNYPSS NIQ          KQTN TLV++EAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR  PSTLG+EA GASI KEKSQ
Subjt:  WREARRKNYPSSINIQ----------KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQ

Query:  NKLNKRGRFKKKNRPRKKGKF
        N+LNKRGR KKKNRPRKKGKF
Subjt:  NKLNKRGRFKKKNRPRKKGKF

XP_022992649.1 uncharacterized protein LOC111488934 [Cucurbita maxima]6.4e-7075.12Show/hide
Query:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR
        ++GNSSISDGGNGSNS  NN AHRNF RNS KGFQK+Q HHMKNEKKKF  PGG KGK     HNERRNKFGG NST+ VK+QKRSL LVYTDQEI QWR
Subjt:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR

Query:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG
        EARRKN+PSS NIQ     KQT+ TLVDKEAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEK D+ K+  D ST+G+EA+GAS GKEK++N+ NKR 
Subjt:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG

Query:  RFKKKNRPRKKGK
        R +KKNR RKKGK
Subjt:  RFKKKNRPRKKGK

XP_038885674.1 uncharacterized protein LOC120075982 [Benincasa hispida]1.2e-7981.22Show/hide
Query:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR
        ++GNSSISDGGNGSNS  NN AHRNF RNSKKGFQKNQ HHMKNEKKKF FPGGQKGK     HNERRNKFG +NSTDQVK+QKRSL LVYTDQEIRQWR
Subjt:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR

Query:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG
        EARRKNYPSS N+Q     KQT+ TLVDKEAQLLR+ELKEILAKQAELGVEVA+IP EYLSYSEKH+NRK RRDPSTLG+E KGAS+GKEKS+N+ NKRG
Subjt:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG

Query:  RFKKKNRPRKKGK
        R +KKNR RKKGK
Subjt:  RFKKKNRPRKKGK

TrEMBL top hitse value%identityAlignment
A0A0A0KE87 NUFIP1 domain-containing protein1.5e-8082.87Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNGSNSI NNSAHRNFMRNSKKGFQKNQTHH+KNEKKKF FPGGQK K     HNERRNKF G+N TDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK
        WREARRKNYPSS NIQ     KQTN TLVDKEA+LLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR   STLG+EA+ ASI KE SQN+LNK
Subjt:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK

Query:  RGRFKKKNRPRKKGKF
        RGR KKKNRPRKKGKF
Subjt:  RGRFKKKNRPRKKGKF

A0A1S3C3B2 uncharacterized protein LOC103496534 isoform X22.1e-8283.8Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGGQK K     HNERRNKF G+NSTDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK
        WREARRKNYPSS NIQ     KQTN TLV++EAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR  PSTLG+EA GASI KEKSQN+LNK
Subjt:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK

Query:  RGRFKKKNRPRKKGKF
        RGR KKKNRPRKKGKF
Subjt:  RGRFKKKNRPRKKGKF

A0A1S4E1B3 uncharacterized protein LOC103496534 isoform X17.9e-8281.9Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGGQK K     HNERRNKF G+NSTDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ----------KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQ
        WREARRKNYPSS NIQ          KQTN TLV++EAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR  PSTLG+EA GASI KEKSQ
Subjt:  WREARRKNYPSSINIQ----------KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQ

Query:  NKLNKRGRFKKKNRPRKKGKF
        N+LNKRGR KKKNRPRKKGKF
Subjt:  NKLNKRGRFKKKNRPRKKGKF

A0A5A7SKM0 Putative basic-leucine zipper transcription factor F isoform X12.1e-8283.8Show/hide
Query:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ
        +  +GNSSISDGGNG NSI NNSAHRNFMRNSKKGFQKNQTHHMKNEKK+F FPGGQK K     HNERRNKF G+NSTDQVKEQKRSL LVYTDQEIRQ
Subjt:  SSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQ

Query:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK
        WREARRKNYPSS NIQ     KQTN TLV++EAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEKHDNRKQR  PSTLG+EA GASI KEKSQN+LNK
Subjt:  WREARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNK

Query:  RGRFKKKNRPRKKGKF
        RGR KKKNRPRKKGKF
Subjt:  RGRFKKKNRPRKKGKF

A0A6J1JY42 uncharacterized protein LOC1114889343.1e-7075.12Show/hide
Query:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR
        ++GNSSISDGGNGSNS  NN AHRNF RNS KGFQK+Q HHMKNEKKKF  PGG KGK     HNERRNKFGG NST+ VK+QKRSL LVYTDQEI QWR
Subjt:  AEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWR

Query:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG
        EARRKN+PSS NIQ     KQT+ TLVDKEAQLLRQELKEILAKQAELGVEVA+IP EYLSYSEK D+ K+  D ST+G+EA+GAS GKEK++N+ NKR 
Subjt:  EARRKNYPSSINIQ-----KQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRG

Query:  RFKKKNRPRKKGK
        R +KKNR RKKGK
Subjt:  RFKKKNRPRKKGK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G18440.1 CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).9.3e-1935.47Show/hide
Query:  GGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWREARRKNYPS
        G NG++       H+NF +   +GFQ+ Q H   N K+K  F    +GK     +N+ +    GS++ +  KE+KRS  L+YT +E++QWREARRKNYP+
Subjt:  GGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWREARRKNYPS

Query:  SI----NIQKQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRGRFKKKNRPRK
               ++K  + +++D+EA++ RQ+L+E+LAKQAELGVEVA++P  YLS +++  N  +  +        KG      +++ + +++ +F  K +PR 
Subjt:  SI----NIQKQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRGRFKKKNRPRK

Query:  KGK
        + K
Subjt:  KGK

AT5G18440.2 FUNCTIONS IN: molecular_function unknown; INVOLVED IN: biological_process unknown; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Nuclear fragile X mental retardation-interacting protein 1, conserved region (InterPro:IPR019496); Has 1333 Blast hits to 1211 proteins in 205 species: Archae - 0; Bacteria - 137; Metazoa - 339; Fungi - 162; Plants - 70; Viruses - 6; Other Eukaryotes - 619 (source: NCBI BLink).9.3e-1935.47Show/hide
Query:  GGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWREARRKNYPS
        G NG++       H+NF +   +GFQ+ Q H   N K+K  F    +GK     +N+ +    GS++ +  KE+KRS  L+YT +E++QWREARRKNYP+
Subjt:  GGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNERRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWREARRKNYPS

Query:  SI----NIQKQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRGRFKKKNRPRK
               ++K  + +++D+EA++ RQ+L+E+LAKQAELGVEVA++P  YLS +++  N  +  +        KG      +++ + +++ +F  K +PR 
Subjt:  SI----NIQKQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKEAKGASIGKEKSQNKLNKRGRFKKKNRPRK

Query:  KGK
        + K
Subjt:  KGK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTAATTTTCCCGAAGATATTCAAAAATTCCCTTCGCGGAAAAACTGCTTCAGAACGCGTAGCTCATCGCAGGCTGCCCTCTTCGCGAGGCAAGTCCTTCACAACCAG
AGCTTCGCGAGATACTTCATCTGCAGAAGGTAATTCTTCAATAAGTGATGGTGGAAATGGATCAAATTCAATTTTGAATAATTCAGCTCACAGGAATTTCATGAGGAATT
CAAAAAAAGGATTTCAGAAGAATCAAACTCATCATATGAAAAATGAGAAGAAAAAGTTTGAGTTTCCTGGCGGACAGAAAGGGAAAGACTGTAAACAGGTTCACAACGAG
AGGAGGAACAAATTTGGTGGCTCTAACTCCACAGATCAAGTGAAAGAACAGAAGAGATCTCTCTATTTGGTCTATACGGATCAAGAAATCCGGCAATGGCGCGAAGCACG
CCGGAAGAATTACCCGTCATCGATCAACATACAGAAGCAAACCAACTACACATTGGTGGATAAGGAGGCTCAGCTTTTGCGACAAGAACTCAAAGAGATTTTAGCAAAGC
AGGCTGAATTAGGAGTGGAAGTTGCAAAAATACCACTAGAGTATCTCTCATATTCAGAGAAACATGACAATCGAAAACAACGTAGAGATCCATCGACATTAGGAAAGGAA
GCCAAAGGAGCCTCAATAGGGAAAGAAAAATCTCAGAACAAGTTAAACAAGAGGGGAAGATTCAAGAAAAAGAATCGCCCGAGAAAGAAGGGAAAATTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGTAATTTTCCCGAAGATATTCAAAAATTCCCTTCGCGGAAAAACTGCTTCAGAACGCGTAGCTCATCGCAGGCTGCCCTCTTCGCGAGGCAAGTCCTTCACAACCAG
AGCTTCGCGAGATACTTCATCTGCAGAAGGTAATTCTTCAATAAGTGATGGTGGAAATGGATCAAATTCAATTTTGAATAATTCAGCTCACAGGAATTTCATGAGGAATT
CAAAAAAAGGATTTCAGAAGAATCAAACTCATCATATGAAAAATGAGAAGAAAAAGTTTGAGTTTCCTGGCGGACAGAAAGGGAAAGACTGTAAACAGGTTCACAACGAG
AGGAGGAACAAATTTGGTGGCTCTAACTCCACAGATCAAGTGAAAGAACAGAAGAGATCTCTCTATTTGGTCTATACGGATCAAGAAATCCGGCAATGGCGCGAAGCACG
CCGGAAGAATTACCCGTCATCGATCAACATACAGAAGCAAACCAACTACACATTGGTGGATAAGGAGGCTCAGCTTTTGCGACAAGAACTCAAAGAGATTTTAGCAAAGC
AGGCTGAATTAGGAGTGGAAGTTGCAAAAATACCACTAGAGTATCTCTCATATTCAGAGAAACATGACAATCGAAAACAACGTAGAGATCCATCGACATTAGGAAAGGAA
GCCAAAGGAGCCTCAATAGGGAAAGAAAAATCTCAGAACAAGTTAAACAAGAGGGGAAGATTCAAGAAAAAGAATCGCCCGAGAAAGAAGGGAAAATTTTGA
Protein sequenceShow/hide protein sequence
MVIFPKIFKNSLRGKTASERVAHRRLPSSRGKSFTTRASRDTSSAEGNSSISDGGNGSNSILNNSAHRNFMRNSKKGFQKNQTHHMKNEKKKFEFPGGQKGKDCKQVHNE
RRNKFGGSNSTDQVKEQKRSLYLVYTDQEIRQWREARRKNYPSSINIQKQTNYTLVDKEAQLLRQELKEILAKQAELGVEVAKIPLEYLSYSEKHDNRKQRRDPSTLGKE
AKGASIGKEKSQNKLNKRGRFKKKNRPRKKGKF