; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh09G012960 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh09G012960
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionRetrovirus-related Pol polyprotein from transposon TNT 1-94
Genome locationCma_Chr09:8988039..8989347
RNA-Seq ExpressionCmaCh09G012960
SyntenyCmaCh09G012960
Gene Ontology termsGO:0006810 - transport (biological process)
GO:0044237 - cellular metabolic process (biological process)
GO:0110165 - cellular anatomical structure (cellular component)
GO:0000166 - nucleotide binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
GO:0043167 - ion binding (molecular function)
InterPro domainsIPR025724 - GAG-pre-integrase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF3637944.1 putative cytochrome 82A3-like [Capsicum annuum]4.0e-6058.8Show/hide
Query:  EGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRD-------------------------------VVL
        E  SV DHINEFNMIVSQL  ++INFEDEIKALILMSSL ESWDTV+A ISSSRGS+KLKFDEIRD                               +V+
Subjt:  EGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRD-------------------------------VVL

Query:  SEK----------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNI
          K          NQWTL+DVRYIP LKKNLIS+GQLDSTG+  +F K SWKI+KGAMVVARGTKSGTLYT+   +N A V E  S   +WHNRL HM+ 
Subjt:  SEK----------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNI

Query:  KGMKMLAAKEVLERLESTDMSHCVNCVMSKQKR
        KGM+MLAAK  L+ ++S DM  C +CVM KQKR
Subjt:  KGMKMLAAKEVLERLESTDMSHCVNCVMSKQKR

KAF3643966.1 Pleiotropic drug resistance protein 1 [Capsicum annuum]7.3e-6243.54Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----
        MY+KPSA NKVYLM RLFNLQ+ E GSV DHINEFNMIVSQL SV+INFEDEIKALILMSSL E   T++  ISSS GS+KLKFD+IRDVV S+      
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKS
                                                       NQWTL+DVRYIP LKKNLISIGQLDSTGYAT+F K SWKI+KGAMVVARGTKS
Subjt:  -----------------------------------------------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKS

Query:  GTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELKK
        GTL+T+A C+N A V E  S+  +WHNRL HM+ K MKMLAAK  LE ++  DM  C +CVM KQKRVSFTKT +  KK
Subjt:  GTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELKK

KAF3680274.1 putative 50S ribosomal protein L18-like [Capsicum annuum]5.6e-5449.31Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL------------------
        MY+ PSA+NKVYLM RLFNLQ+ E GSV DHINEFNMIVSQL SV+INFEDE K+LILMSSL ESWDTV+  ISSS   D L                  
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL------------------

Query:  ----------------KFDEI-------------RDVVLSEK--NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTL
                         F ++             RDV +     NQWTL+DVRYIP LKKNLI +GQLDSTGYA +F K                     
Subjt:  ----------------KFDEI-------------RDVVLSEK--NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTL

Query:  YTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELKKSTGTTKQVEVEV
            GC+N  +V ES S S +W+NRL HM+ KGMKMLAAK  L+ L+S DM  C +CVM KQKRVSF KT RE       TKQV +E+
Subjt:  YTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELKKSTGTTKQVEVEV

KAG7011443.1 hypothetical protein SDJN02_26349, partial [Cucurbita argyrosperma subsp. argyrosperma]9.9e-5959.07Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSE------
        MY+K SAMNKVYLM RLFNLQ+SEGGS+ D+INEFNMIVS+LS VEINF+DEIKALILMSSL ESWDTV+A I+SSRGSDKLKFDEIRD+VL E      
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSE------

Query:  -----------------KNQ-----------WTLKDVRYIPSLK-KNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGE
                         K Q           +T +D      L   +    G     GYA +F KSSWKIVKGAMVVARGTKSGTLYT+A C+N  A   
Subjt:  -----------------KNQ-----------WTLKDVRYIPSLK-KNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGE

Query:  SVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDM
        S SNSS+WHNRL H+++KGMKML AK  LE L+S D+
Subjt:  SVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDM

PHT34103.1 IAA-amino acid hydrolase ILR1-like 1 [Capsicum baccatum]1.2e-5163.83Show/hide
Query:  LSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL--------KFDEIRDVVLSEK------------------
        + E GSV  HINEFNMIVSQL SV+INF+ +IKALILMSSL ESWDTV+A ISSSRG +KL        K    R V L++                   
Subjt:  LSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL--------KFDEIRDVVLSEK------------------

Query:  NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIK
        NQWTLKDVRYIP LKKNLISIGQLDSTGYAT+F K SWKIVKGA+VVARGTKS TLYT+AGC+N AAV E  S S +WHNRL HM+ K
Subjt:  NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIK

TrEMBL top hitse value%identityAlignment
A0A2G2VMA4 IAA-amino acid hydrolase ILR1-like 15.7e-5263.83Show/hide
Query:  LSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL--------KFDEIRDVVLSEK------------------
        + E GSV  HINEFNMIVSQL SV+INF+ +IKALILMSSL ESWDTV+A ISSSRG +KL        K    R V L++                   
Subjt:  LSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKL--------KFDEIRDVVLSEK------------------

Query:  NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIK
        NQWTLKDVRYIP LKKNLISIGQLDSTGYAT+F K SWKIVKGA+VVARGTKS TLYT+AGC+N AAV E  S S +WHNRL HM+ K
Subjt:  NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIK

A0A438E1E8 Retrovirus-related Pol polyprotein from transposon TNT 1-941.3e-5141.55Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----
        MY+KPSA NKV+LM +LFNL+++E  SV  H+NEFN I +QLSSVEI+F+DEI+ALI+++SL  SW+ +   +S+S G +KLK+++IRD++L+E+     
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----

Query:  -----------------------------------------------------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVV
                                                             + W L+ VR+IP L++NLIS+GQLD  G+A  F   +WK+ KGA V+
Subjt:  -----------------------------------------------------NQWTLKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVV

Query:  ARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELK
        ARG K+GTLY ++   +  AV ++ +++S+WH RL HM+ KGMKML +K  L  L+S D   C +C++ KQK+VSF KT R LK
Subjt:  ARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELK

A0A6A2WFX9 ABC transporter B family member 68.2e-5147.28Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDV-----VLSEK
        MY+KPSA NKV+LM RLFNL+++E  SV  H+NE N I +QLSSVEI F DE++ALIL+SSL +SW+  +  +SSS  + KLKFD++R++         K
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDV-----VLSEK

Query:  NQWTL--------KDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKM
          W +         +VR+I  LK+NLIS+GQLD  GY+T FS   WKI K ++V+ARG K+GTLY ++   N     ES   S++WH RL++M+ K MKM
Subjt:  NQWTL--------KDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKM

Query:  LAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELK
        L +K  L  L++ D+  C +C+  KQ++VSF K  + LK
Subjt:  LAAKEVLERLESTDMSHCVNCVMSKQKRVSFTKTVRELK

A0A6A3APX2 Endomembrane protein 70 protein family isoform 16.9e-5039.09Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRD-----------
        MY+KPSA NKV+LM R+FNL+++EG S+  ++NE N I +QLSSVEI F+DE++ALIL+SSL +SW+  I  +SSS G++KLKFD++RD           
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRD-----------

Query:  ----------------------------------------------------------------------------VVLSEKNQWTLKDVRYIPSLKKNL
                                                                                    + LS +  WTLK VR+IP LK+NL
Subjt:  ----------------------------------------------------------------------------VVLSEKNQWTLKDVRYIPSLKKNL

Query:  ISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQ
        IS+GQLD  GY T F    WKI KGA+V+ARG K+GTLY ++   N  A  ++   S++WH RL HM+ KGMK L +K  L  L++ D+  C +C+ SKQ
Subjt:  ISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQ

Query:  KRVSFTK
        K+VSF K
Subjt:  KRVSFTK

A0A6A3BFY1 Uncharacterized protein6.9e-5039.37Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----
        MY+KPSA NKV+LM RLFNL+++EG SV  H+NE N I +QLSSVEI F+DE++ALIL+ SL +SW+  +  +SSS G+ KLKFD++RD+VLSE+     
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK-----

Query:  ----------------------------------------------------------------------------------------NQ--WTLKDVRY
                                                                                                NQ  W L  VR+
Subjt:  ----------------------------------------------------------------------------------------NQ--WTLKDVRY

Query:  IPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHC
        IP LK+NLIS+GQLD  GY+T FS   WKI KGA+V+ARG K+GTLY ++   N  AV +    S++WH RL HM+ KGMK+L +K  L  L++ D+  C
Subjt:  IPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHC

Query:  VNCVMSKQKRVSFTK
         +C+  KQK+VSF K
Subjt:  VNCVMSKQKRVSFTK

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.3e-1837.98Show/hide
Query:  LKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAG--CMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERL
        LKDVR++P L+ NLIS   LD  GY + F+   W++ KG++V+A+G   GTLY +    C       +   +  +WH R+ HM+ KG+++LA K ++   
Subjt:  LKDVRYIPSLKKNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAG--CMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERL

Query:  ESTDMSHCVNCVMSKQKRVSF-TKTVREL
        + T +  C  C+  KQ RVSF T + R+L
Subjt:  ESTDMSHCVNCVMSKQKRVSF-TKTVREL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-0833.68Show/hide
Query:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK
        +Y   +  NK+YL  +L+ L +SEG + + H+N FN +++QL+++ +  E+E KA++L++SL  S+D +   I   + + +LK D    ++L+EK
Subjt:  MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEK

P93293 Uncharacterized mitochondrial protein AtMg003002.3e-0527.66Show/hide
Query:  SKSSWKIVKGAMVVARGTKSGTLYTSAGCM--NRAAVGESVSNSS-IWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFT
        S+   K++KG   + +G +  +LY   G +    + + E+  + + +WH+RL HM+ +GM++L  K  L+  + + +  C +C+  K  RV+F+
Subjt:  SKSSWKIVKGAMVVARGTKSGTLYTSAGCM--NRAAVGESVSNSS-IWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFT

Arabidopsis top hitse value%identityAlignment
ATMG00300.1 Gag-Pol-related retrotransposon family protein1.6e-0627.66Show/hide
Query:  SKSSWKIVKGAMVVARGTKSGTLYTSAGCM--NRAAVGESVSNSS-IWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFT
        S+   K++KG   + +G +  +LY   G +    + + E+  + + +WH+RL HM+ +GM++L  K  L+  + + +  C +C+  K  RV+F+
Subjt:  SKSSWKIVKGAMVVARGTKSGTLYTSAGCM--NRAAVGESVSNSS-IWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACAAAAAACCGTCGGCTATGAACAAGGTGTATTTGATGCTGAGATTGTTCAATCTACAATTGTCTGAAGGTGGATCTGTTGTGGACCATATAAATGAATTCAATAT
GATCGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAGCATTGATTTTGATGTCATCTTTACGCGAGTCGTGGGATACTGTTATTGCCGTAATCA
GCAGTTCTCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGAGATGTAGTTCTGAGCGAAAAAAATCAGTGGACATTAAAGGATGTCAGATATATTCCTAGTCTCAAG
AAGAATCTGATCTCTATTGGTCAGTTGGATAGCACAGGTTATGCAACAAAGTTTAGTAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAA
ATCTGGAACTTTATACACCTCTGCAGGGTGTATGAACAGAGCTGCGGTTGGTGAGAGTGTTTCAAATTCAAGTATATGGCACAATAGACTTGAACATATGAACATTAAAG
GAATGAAGATGTTGGCTGCAAAAGAAGTTTTAGAACGTCTGGAATCTACTGATATGAGTCATTGTGTGAACTGCGTTATGAGCAAACAGAAACGAGTTAGCTTCACAAAG
ACTGTTAGAGAATTGAAGAAAAGTACTGGGACAACGAAGCAAGTAGAAGTTGAGGTTGAGTTGCAGAACAATTCACAGAGTGATGTTGTAGCAGATACTCAAGAAACTCC
TGAGACTCTTGCTGAGGAATCAGAGATGAAGCAAGTTGGAGTTGAGGTTGAGGTTGAGTTGCTAAAAGATTCACCTAGTGATGTTGTAGCTGATACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTACAAAAAACCGTCGGCTATGAACAAGGTGTATTTGATGCTGAGATTGTTCAATCTACAATTGTCTGAAGGTGGATCTGTTGTGGACCATATAAATGAATTCAATAT
GATCGTAAGTCAACTGAGTTCGGTGGAAATTAATTTCGAGGATGAAATTAAAGCATTGATTTTGATGTCATCTTTACGCGAGTCGTGGGATACTGTTATTGCCGTAATCA
GCAGTTCTCGAGGATCTGATAAACTGAAGTTTGATGAAATTCGAGATGTAGTTCTGAGCGAAAAAAATCAGTGGACATTAAAGGATGTCAGATATATTCCTAGTCTCAAG
AAGAATCTGATCTCTATTGGTCAGTTGGATAGCACAGGTTATGCAACAAAGTTTAGTAAGAGTTCGTGGAAGATTGTGAAGGGTGCTATGGTGGTAGCACGTGGCACAAA
ATCTGGAACTTTATACACCTCTGCAGGGTGTATGAACAGAGCTGCGGTTGGTGAGAGTGTTTCAAATTCAAGTATATGGCACAATAGACTTGAACATATGAACATTAAAG
GAATGAAGATGTTGGCTGCAAAAGAAGTTTTAGAACGTCTGGAATCTACTGATATGAGTCATTGTGTGAACTGCGTTATGAGCAAACAGAAACGAGTTAGCTTCACAAAG
ACTGTTAGAGAATTGAAGAAAAGTACTGGGACAACGAAGCAAGTAGAAGTTGAGGTTGAGTTGCAGAACAATTCACAGAGTGATGTTGTAGCAGATACTCAAGAAACTCC
TGAGACTCTTGCTGAGGAATCAGAGATGAAGCAAGTTGGAGTTGAGGTTGAGGTTGAGTTGCTAAAAGATTCACCTAGTGATGTTGTAGCTGATACTTAA
Protein sequenceShow/hide protein sequence
MYKKPSAMNKVYLMLRLFNLQLSEGGSVVDHINEFNMIVSQLSSVEINFEDEIKALILMSSLRESWDTVIAVISSSRGSDKLKFDEIRDVVLSEKNQWTLKDVRYIPSLK
KNLISIGQLDSTGYATKFSKSSWKIVKGAMVVARGTKSGTLYTSAGCMNRAAVGESVSNSSIWHNRLEHMNIKGMKMLAAKEVLERLESTDMSHCVNCVMSKQKRVSFTK
TVRELKKSTGTTKQVEVEVELQNNSQSDVVADTQETPETLAEESEMKQVGVEVEVELLKDSPSDVVADT