; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi02G011490 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi02G011490
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionPentatricopeptide repeat
Genome locationchr02:13626468..13626935
RNA-Seq ExpressionLsi02G011490
SyntenyLsi02G011490
Gene Ontology termsGO:0009451 - RNA modification (biological process)
GO:0043231 - intracellular membrane-bounded organelle (cellular component)
GO:0003723 - RNA binding (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588590.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. sororia]1.7e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD+GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

KAG7022386.1 putative pentatricopeptide repeat-containing protein, partial [Cucurbita argyrosperma subsp. argyrosperma]1.7e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD+GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

XP_022931578.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita moschata]1.7e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD+GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

XP_022989366.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita maxima]3.7e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

XP_023520790.1 putative pentatricopeptide repeat-containing protein At1g56570 [Cucurbita pepo subsp. pepo]1.7e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD+GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

TrEMBL top hitse value%identityAlignment
A0A0A0LW37 Uncharacterized protein3.5e-5676.06Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F P+GPS+W TN+IKSYFD+GLT+EA NLF+E+PERDVV WT MIV FTSCNHY QAW MF EMLRS++ PNAFTMSSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH ID S+Y+  ALL MYA  CATM DAL+VFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

A0A1S3BQ30 putative pentatricopeptide repeat-containing protein At1g565704.5e-5676.76Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PKGPS W TN+IKSYFD+GLT+EA NLF+E+PERDVV WT MIV FTSCNHYPQAW MF EMLRS++ PNAFTMSSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        K  ID S+Y+  ALL MYA  CATM DAL+VFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

A0A6J1DTG1 putative pentatricopeptide repeat-containing protein At1g565702.2e-5880.99Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F+PKGPSIW TN+IKSYFD+GLTKEARNLFDEMPERDVVAWTT+IV FTSCNHY QAWA+FCEM+RS+I+PNAFTMSSVLKA  G +ALSCG LAH LAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        K  ID SMY+  ALL MYAT CATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

A0A6J1EZ33 putative pentatricopeptide repeat-containing protein At1g565708.0e-6181.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD+GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

A0A6J1JJV4 putative pentatricopeptide repeat-containing protein At1g565701.8e-6081.69Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        F PK PSIW TN+IKSYFD GL+K ARNLFDEMPERDVVAWT MIV FTSCN Y Q+WA+FCEMLRS I PNAFT+SSVLKAC G KALSCG LAHSLAT
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        KH IDGSMY+  ALL MYATCCATM DALTVFNDIPLKT VS
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

SwissProt top hitse value%identityAlignment
Q0WNP3 Pentatricopeptide repeat-containing protein At4g18520, chloroplastic9.5e-1936.36Show/hide
Query:  TNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYI
        + ++  Y   G +++A N+  ++P RDVV+WT MI   +S  H  +A     EM++  ++PN FT SS LKAC  +++L  G   HS+A K+    ++++
Subjt:  TNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYI

Query:  WKALLHMYATCCATMGDALTVFNDIPLKTIVS
          AL+HMYA  C  + +A  VF+ +P K +VS
Subjt:  WKALLHMYATCCATMGDALTVFNDIPLKTIVS

Q9FXA9 Putative pentatricopeptide repeat-containing protein At1g565703.6e-3448.59Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        ++PK   I  TN+I SYF++GL +EAR+LFDEMP+RDVVAWT MI  + S N+  +AW  F EM++    PN FT+SSVLK+C   K L+ G L H +  
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        K  ++GS+Y+  A+++MYATC  TM  A  +F DI +K  V+
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

Q9LIC3 Putative pentatricopeptide repeat-containing protein At3g13770, mitochondrial8.9e-1736.97Show/hide
Query:  KEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWKALLHMYATCCA
        ++AR + DEMPE++VV+WT MI  ++   H  +A  +F EM+RS   PN FT ++VL +CI    L  G   H L  K + D  +++  +LL MYA    
Subjt:  KEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWKALLHMYATCCA

Query:  TMGDALTVFNDIPLKTIVS
         + +A  +F  +P + +VS
Subjt:  TMGDALTVFNDIPLKTIVS

Q9LT48 Pentatricopeptide repeat-containing protein At3g207308.9e-1737.69Show/hide
Query:  IIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWK
        +I  Y  +G  K AR LFD + +RDVV+WT MI  F+ C ++P A  +F EM R  +  N FT  SVLK+C     L  G   H    K +  G++ +  
Subjt:  IIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWK

Query:  ALLHMYATCCATMGDALTVFNDIPLKTIVS
        ALL +YA  C  M +A   F+ +  + +VS
Subjt:  ALLHMYATCCATMGDALTVFNDIPLKTIVS

Q9LU94 Putative pentatricopeptide repeat-containing protein At3g259701.5e-1636.09Show/hide
Query:  TTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMY
        +  I+ SY   G    A  LFDEMP+RD V+W TMI  +TSC     AW +F  M RS  D + ++ S +LK     K    G+  H L  K   + ++Y
Subjt:  TTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMY

Query:  IWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        +  +L+ MYA  C  + DA   F +I     VS
Subjt:  IWKALLHMYATCCATMGDALTVFNDIPLKTIVS

Arabidopsis top hitse value%identityAlignment
AT1G56570.1 Tetratricopeptide repeat (TPR)-like superfamily protein2.6e-3548.59Show/hide
Query:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT
        ++PK   I  TN+I SYF++GL +EAR+LFDEMP+RDVVAWT MI  + S N+  +AW  F EM++    PN FT+SSVLK+C   K L+ G L H +  
Subjt:  FRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLAT

Query:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        K  ++GS+Y+  A+++MYATC  TM  A  +F DI +K  V+
Subjt:  KHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS

AT3G13770.1 Pentatricopeptide repeat (PPR) superfamily protein6.3e-1836.97Show/hide
Query:  KEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWKALLHMYATCCA
        ++AR + DEMPE++VV+WT MI  ++   H  +A  +F EM+RS   PN FT ++VL +CI    L  G   H L  K + D  +++  +LL MYA    
Subjt:  KEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWKALLHMYATCCA

Query:  TMGDALTVFNDIPLKTIVS
         + +A  +F  +P + +VS
Subjt:  TMGDALTVFNDIPLKTIVS

AT3G20730.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.3e-1837.69Show/hide
Query:  IIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWK
        +I  Y  +G  K AR LFD + +RDVV+WT MI  F+ C ++P A  +F EM R  +  N FT  SVLK+C     L  G   H    K +  G++ +  
Subjt:  IIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYIWK

Query:  ALLHMYATCCATMGDALTVFNDIPLKTIVS
        ALL +YA  C  M +A   F+ +  + +VS
Subjt:  ALLHMYATCCATMGDALTVFNDIPLKTIVS

AT3G25970.1 Pentatricopeptide repeat (PPR) superfamily protein1.1e-1736.09Show/hide
Query:  TTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMY
        +  I+ SY   G    A  LFDEMP+RD V+W TMI  +TSC     AW +F  M RS  D + ++ S +LK     K    G+  H L  K   + ++Y
Subjt:  TTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMY

Query:  IWKALLHMYATCCATMGDALTVFNDIPLKTIVS
        +  +L+ MYA  C  + DA   F +I     VS
Subjt:  IWKALLHMYATCCATMGDALTVFNDIPLKTIVS

AT4G18520.1 Pentatricopeptide repeat (PPR) superfamily protein6.8e-2036.36Show/hide
Query:  TNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYI
        + ++  Y   G +++A N+  ++P RDVV+WT MI   +S  H  +A     EM++  ++PN FT SS LKAC  +++L  G   HS+A K+    ++++
Subjt:  TNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHSLATKHSIDGSMYI

Query:  WKALLHMYATCCATMGDALTVFNDIPLKTIVS
          AL+HMYA  C  + +A  VF+ +P K +VS
Subjt:  WKALLHMYATCCATMGDALTVFNDIPLKTIVS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGCAACGCCACCTTACAATAAAATCCATCTTTTTTAGACCAAAAGGACCATCCATTTGGACCACAAACATCATCAAATCATACTTCGACGAAGGCTTAACCAA
AGAAGCTCGTAACCTGTTTGATGAAATGCCTGAAAGGGATGTGGTTGCCTGGACTACTATGATTGTTATCTTTACTTCTTGCAATCACTACCCTCAAGCGTGGGCTATGT
TCTGTGAGATGTTAAGGAGTCAAATTGACCCAAATGCCTTCACTATGTCTAGTGTTCTCAAGGCTTGCATAGGCACGAAGGCTCTTTCATGTGGGGATTTGGCTCATAGT
TTGGCCACAAAGCACAGTATTGACGGGTCGATGTACATCTGGAAAGCACTCTTGCACATGTATGCTACTTGCTGTGCTACCATGGGTGATGCATTGACTGTGTTTAATGA
TATACCTCTGAAGACTATTGTGTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGCAACGCCACCTTACAATAAAATCCATCTTTTTTAGACCAAAAGGACCATCCATTTGGACCACAAACATCATCAAATCATACTTCGACGAAGGCTTAACCAA
AGAAGCTCGTAACCTGTTTGATGAAATGCCTGAAAGGGATGTGGTTGCCTGGACTACTATGATTGTTATCTTTACTTCTTGCAATCACTACCCTCAAGCGTGGGCTATGT
TCTGTGAGATGTTAAGGAGTCAAATTGACCCAAATGCCTTCACTATGTCTAGTGTTCTCAAGGCTTGCATAGGCACGAAGGCTCTTTCATGTGGGGATTTGGCTCATAGT
TTGGCCACAAAGCACAGTATTGACGGGTCGATGTACATCTGGAAAGCACTCTTGCACATGTATGCTACTTGCTGTGCTACCATGGGTGATGCATTGACTGTGTTTAATGA
TATACCTCTGAAGACTATTGTGTCATAG
Protein sequenceShow/hide protein sequence
MDKQRHLTIKSIFFRPKGPSIWTTNIIKSYFDEGLTKEARNLFDEMPERDVVAWTTMIVIFTSCNHYPQAWAMFCEMLRSQIDPNAFTMSSVLKACIGTKALSCGDLAHS
LATKHSIDGSMYIWKALLHMYATCCATMGDALTVFNDIPLKTIVS