; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0021789 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0021789
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGag/pol protein
Genome locationchr01:18454504..18462568
RNA-Seq ExpressionIVF0021789
SyntenyIVF0021789
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.07e-14488.19Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LVEECPQVPAANATRTVRE YERWAKANEKARAYILASLSE                    E
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E

Query:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+AEMN AVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        QKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKKGGQGNKANLAAAK +KKAKAAKGICFHCNQEGH
Subjt:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]2.09e-14488.19Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LVEECPQVPAANATRTVRE YERWAKANEKARAYILASLSE                    E
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E

Query:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+AEMN AVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        QKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKKGGQGNKANLAAAK +KKAKAAKGICFHCNQEGH
Subjt:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

KAA0054432.1 gag/pol protein [Cucumis melo var. makuwa]3.81e-14585.24Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LVEECPQVPAANATRTVRE YERWAKAN+KARAYILASLSE                    E
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E

Query:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MF QASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+ EMN AVIDEASQVSFILESLPESFLQFR+NAVMNKIAYTLT LLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        QKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKK GQGNKANLAAAK +KKAKAAK ICFH NQEGH
Subjt:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

KAA0062798.1 gag/pol protein [Cucumis melo var. makuwa]7.41e-16298.01Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE     ASYQIKHDALKYIYNA
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA

Query:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
        RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
Subjt:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS

Query:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
Subjt:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

TYJ96755.1 gag/pol protein [Cucumis melo var. makuwa]4.47e-14697.94Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE     ASYQIKHDALKYIYNA
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA

Query:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
        RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
Subjt:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS

Query:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC
        GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC
Subjt:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.4e-12088.19Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LVEECPQVPAANATRTVRE YERWAKANEKARAYILASLSE                    E
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E

Query:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+AEMN AVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        QKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKKGGQGNKANLAAAK +KKAKAAKGICFHCNQEGH
Subjt:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

A0A5A7TU93 Gag/pol protein1.4e-12088.19Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LVEECPQVPAANATRTVRE YERWAKANEKARAYILASLSE                    E
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE--------------------E

Query:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
        MFGQASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+AEMN AVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG
Subjt:  MFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKG

Query:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        QKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKKGGQGNKANLAAAK +KKAKAAKGICFHCNQEGH
Subjt:  QKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

A0A5A7V4M1 Gag/pol protein1.3e-12187.96Show/hide
Query:  NLIMTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE------------------
        NLIMTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRF LV+ECPQVPAANATRTVRE YERWAKANEKARAYILASLSE                  
Subjt:  NLIMTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE------------------

Query:  --EMFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMK
          EMFGQASYQIKHDALKYIYNARMNEGAS+REHVLNMMVHFN+AEMN AVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMK
Subjt:  --EMFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMK

Query:  IKGQKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        IKGQKGEANVATSTRKF+RGS SGTKSMP SS NKK KKKKGGQGNKANLAAAK +KKAKAAKGICFHCNQEGH
Subjt:  IKGQKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

A0A5A7V5A8 Gag/pol protein6.0e-12798.01Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE     ASYQIKHDALKYIYNA
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA

Query:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
        RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
Subjt:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS

Query:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
        GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH
Subjt:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH

A0A5D3BE74 Gag/pol protein1.1e-12097.94Show/hide
Query:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA
        MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSE     ASYQIKHDALKYIYNA
Subjt:  MTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVPAANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNA

Query:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
        RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS
Subjt:  RMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNKIAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSIS

Query:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC
        GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC
Subjt:  GTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGIC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGTCTGATCAACCAGTTGAAACAAAAGCTGTTAGCTCTTCCACAAAAACAAATAACTGGATCAGACGCATTGCTGATGAAGTGAATGTATCAGATGAACATCACCC
ATGTTTGAGTGTCATGGATTTTTTGGAATCATTGTGGTGTCTTCTGGTTCATCTAAATGTTGATATCCTAACCCTAAATGATACTCCTGGTCCTAACCCAAAGACATTAT
CATTAAGCTACAAGCTATTTCAAGGAAGTCATGTTCCTGATATAGAACATGATATGAGACCTTCTAAGAACCCCCGTATGTTTGACATTGATGATGTAGATGAGAATGCT
GAAGGGTTCTTTGTTCATCATGGCTTAGCCTCCAGAATAATTAATACGCTTACAATTGAGTCTCGAGCTCTTTCTACATCTATAAATTTGCCGTCTGATAGAAGATTGGA
GGTTGATTCGCTTATTCGTCACCTGAAGACACTTATTCCCTCCTCTAGTATCGATGCTTCAGATAAAGACAATTTGATAATGACGAGTGCTACTTTGAATATGCTGGCTG
CTGATAAACTTAATGGGAATAATTATGCATCTTGGAAAAATACTATCAACACTGTGCTAATCATCGATGACCTTAGATTTGACCTAGTTGAGGAGTGTCCTCAAGTCCCA
GCTGCTAATGCAACTCGAACTGTTCGAGAAGCATATGAGCGCTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGAGATGTTTGGTCA
GGCCTCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAATACGAGAACATGTTCTCAATATGATGGTTCATTTTAACA
TGGCAGAAATGAATGAGGCTGTCATTGATGAAGCCAGTCAGGTTAGCTTTATTTTGGAATCTCTGCCAGAGAGTTTCCTGCAATTTAGAAGCAATGCTGTTATGAATAAG
ATTGCTTATACCCTTACCACCCTTCTCAACGAGTTACAAACTTTTGAGTCTCTTATGAAAATCAAGGGACAAAAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTT
CTATAGAGGTTCGATCTCTGGAACTAAGTCTATGCCTTATTCATCTGACAATAAGAAGTCGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTA
AAATAAGCAAGAAAGCCAAGGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGTCTGATCAACCAGTTGAAACAAAAGCTGTTAGCTCTTCCACAAAAACAAATAACTGGATCAGACGCATTGCTGATGAAGTGAATGTATCAGATGAACATCACCC
ATGTTTGAGTGTCATGGATTTTTTGGAATCATTGTGGTGTCTTCTGGTTCATCTAAATGTTGATATCCTAACCCTAAATGATACTCCTGGTCCTAACCCAAAGACATTAT
CATTAAGCTACAAGCTATTTCAAGGAAGTCATGTTCCTGATATAGAACATGATATGAGACCTTCTAAGAACCCCCGTATGTTTGACATTGATGATGTAGATGAGAATGCT
GAAGGGTTCTTTGTTCATCATGGCTTAGCCTCCAGAATAATTAATACGCTTACAATTGAGTCTCGAGCTCTTTCTACATCTATAAATTTGCCGTCTGATAGAAGATTGGA
GGTTGATTCGCTTATTCGTCACCTGAAGACACTTATTCCCTCCTCTAGTATCGATGCTTCAGATAAAGACAATTTGATAATGACGAGTGCTACTTTGAATATGCTGGCTG
CTGATAAACTTAATGGGAATAATTATGCATCTTGGAAAAATACTATCAACACTGTGCTAATCATCGATGACCTTAGATTTGACCTAGTTGAGGAGTGTCCTCAAGTCCCA
GCTGCTAATGCAACTCGAACTGTTCGAGAAGCATATGAGCGCTGGGCCAAGGCAAATGAAAAAGCCCGAGCATACATCTTGGCAAGCTTATCTGAAGAGATGTTTGGTCA
GGCCTCTTATCAGATCAAGCATGATGCTCTGAAATACATTTATAATGCCCGTATGAATGAGGGAGCCTCAATACGAGAACATGTTCTCAATATGATGGTTCATTTTAACA
TGGCAGAAATGAATGAGGCTGTCATTGATGAAGCCAGTCAGGTTAGCTTTATTTTGGAATCTCTGCCAGAGAGTTTCCTGCAATTTAGAAGCAATGCTGTTATGAATAAG
ATTGCTTATACCCTTACCACCCTTCTCAACGAGTTACAAACTTTTGAGTCTCTTATGAAAATCAAGGGACAAAAGGGAGAGGCAAATGTTGCTACTTCCACAAGAAAGTT
CTATAGAGGTTCGATCTCTGGAACTAAGTCTATGCCTTATTCATCTGACAATAAGAAGTCGAAGAAGAAGAAGGGTGGCCAAGGAAATAAAGCTAACCTCGCTGCTGCTA
AAATAAGCAAGAAAGCCAAGGCTGCAAAGGGAATATGTTTCCATTGCAACCAAGAGGGACATTAG
Protein sequenceShow/hide protein sequence
MKSDQPVETKAVSSSTKTNNWIRRIADEVNVSDEHHPCLSVMDFLESLWCLLVHLNVDILTLNDTPGPNPKTLSLSYKLFQGSHVPDIEHDMRPSKNPRMFDIDDVDENA
EGFFVHHGLASRIINTLTIESRALSTSINLPSDRRLEVDSLIRHLKTLIPSSSIDASDKDNLIMTSATLNMLAADKLNGNNYASWKNTINTVLIIDDLRFDLVEECPQVP
AANATRTVREAYERWAKANEKARAYILASLSEEMFGQASYQIKHDALKYIYNARMNEGASIREHVLNMMVHFNMAEMNEAVIDEASQVSFILESLPESFLQFRSNAVMNK
IAYTLTTLLNELQTFESLMKIKGQKGEANVATSTRKFYRGSISGTKSMPYSSDNKKSKKKKGGQGNKANLAAAKISKKAKAAKGICFHCNQEGH