; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg14468 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg14468
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function (DUF1645)
Genome locationCarg_Chr18:8434006..8434620
RNA-Seq ExpressionCarg14468
SyntenyCarg14468
Gene Ontology termsNA
InterPro domainsIPR012442 - Protein of unknown function DUF1645, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573741.1 hypothetical protein SDJN03_27628, partial [Cucurbita argyrosperma subsp. sororia]6.9e-107100Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NRNF
Subjt:  NRNF

XP_011656255.1 uncharacterized protein LOC105435701 [Cucumis sativus]1.0e-4658.25Show/hide
Query:  QELQEQTLTDRIEDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTL
        Q   E +   + +D E LDDFSF+ LNPD SPI AEDAFLNGQIR VFPL  +   + +       LKNLF+EE     + R+       S + AP   L
Subjt:  QELQEQTLTDRIEDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTL

Query:  GKKSHSTGFSKLWRFGEKIRRCSSDGK-EAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNA
        GKKS+STGFSKLWRFG+KIRR SS+GK EAF+FLR+ SSGS GEKA +  K  K+ + ETAS YHER Y+RNRAE E+NKRKS+LPYRSNLMG F+ PN 
Subjt:  GKKSHSTGFSKLWRFGEKIRRCSSDGK-EAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNA

Query:  GSNRNF
        G NRNF
Subjt:  GSNRNF

XP_022945749.1 uncharacterized protein LOC111449897 [Cucurbita moschata]9.6e-10196.08Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQEL+EQTLTDRIED ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEA TKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAP TTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRR SSDGKEAFVFLR+DSSGSGGEKAAENRKGGKRTKGETASCYHERLY+RNRAEKEINKRKSFLPYRSNLMG FSGPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NRNF
Subjt:  NRNF

XP_022966688.1 uncharacterized protein LOC111466316 [Cucurbita maxima]1.9e-9692.65Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQELQE+TLTDRIED ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFP+AIEA TKEP AVRPPLKNLFMEELISPTENRSNVSPDE SASA P TTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRR SSDGKEAFVFLR+DSSGSGGEKAAENRKGG RTKGETASCYHERLY+RNRAEKEINKRKSFLPYRSN+MG FSGPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NR F
Subjt:  NRNF

XP_023542069.1 uncharacterized protein LOC111802044 [Cucurbita pepo subsp. pepo]1.7e-9793.63Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQELQ+QTLTDRI+D ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEA TKEP AVRPPLKNLFMEELISPTENRSNVSPDE SASAAP TTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRR SSDGKEAFVFLR+DSS SGGEKAAENRKGGKRTKGETASCYHERLY+RNRAEKEINKRKSFLPYRSNLMG F GPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NRNF
Subjt:  NRNF

TrEMBL top hitse value%identityAlignment
A0A0A0KQ75 Uncharacterized protein5.1e-4758.25Show/hide
Query:  QELQEQTLTDRIEDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTL
        Q   E +   + +D E LDDFSF+ LNPD SPI AEDAFLNGQIR VFPL  +   + +       LKNLF+EE     + R+       S + AP   L
Subjt:  QELQEQTLTDRIEDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTL

Query:  GKKSHSTGFSKLWRFGEKIRRCSSDGK-EAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNA
        GKKS+STGFSKLWRFG+KIRR SS+GK EAF+FLR+ SSGS GEKA +  K  K+ + ETAS YHER Y+RNRAE E+NKRKS+LPYRSNLMG F+ PN 
Subjt:  GKKSHSTGFSKLWRFGEKIRRCSSDGK-EAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNA

Query:  GSNRNF
        G NRNF
Subjt:  GSNRNF

A0A5A7SYN7 Uncharacterized protein6.2e-4560.91Show/hide
Query:  EDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLGKKSHSTGFSKL
        ED E LDDFSF+ LNPD SPI AEDAFLNGQIR VFPL  +   + +       LKNLF+EE     + R          +AAP   LGKKS+STGFSKL
Subjt:  EDKE-LDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIE-ATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLGKKSHSTGFSKL

Query:  WRFGEKIRRCSSDGK-EAFVFLRTDSSGSG--GEKAAENRK-GGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGSNRNF
        WRFG+KIRR SS+GK EAF+FLR+ SSGSG   EK  E  K   K+ K ETAS YHER Y+RNRAE E+NKRKS+LPYRSNLMG F+ PN G NRNF
Subjt:  WRFGEKIRRCSSDGK-EAFVFLRTDSSGSG--GEKAAENRK-GGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGSNRNF

A0A6J1DCT2 uncharacterized protein LOC1110191697.3e-4655.71Show/hide
Query:  ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL----------AIEATTKEPPAVR----PPLKNLFM--EELISPTENRSNVSPD-------ELSASA
        + DDFSF+  NPD SPISAEDAF+NGQIR VFP+            E TT  P  VR      LK LFM  EEL S T++ + V+P+            A
Subjt:  ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL----------AIEATTKEPPAVR----PPLKNLFM--EELISPTENRSNVSPD-------ELSASA

Query:  APTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKG-------GKRTKG--ETASCYHERLYSRNRAEKEINKRKSFLP
        AP   LGKKS+STGFSKLWRFG++IRR SSDGKEAFVFLR+ SSG GGE   +   G       G+RTKG  ETASCYHER Y+RNRAE E+NKRKS+LP
Subjt:  APTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKG-------GKRTKG--ETASCYHERLYSRNRAEKEINKRKSFLP

Query:  YRSNLMGLFSGPNAGSNRN
        YRSNL+G F+  N  +N N
Subjt:  YRSNLMGLFSGPNAGSNRN

A0A6J1G1U6 uncharacterized protein LOC1114498974.7e-10196.08Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQEL+EQTLTDRIED ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEA TKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAP TTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRR SSDGKEAFVFLR+DSSGSGGEKAAENRKGGKRTKGETASCYHERLY+RNRAEKEINKRKSFLPYRSNLMG FSGPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NRNF
Subjt:  NRNF

A0A6J1HSU3 uncharacterized protein LOC1114663169.1e-9792.65Show/hide
Query:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG
        MQELQE+TLTDRIED ELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFP+AIEA TKEP AVRPPLKNLFMEELISPTENRSNVSPDE SASA P TTLG
Subjt:  MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLG

Query:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS
        KKSHSTGFSKLWRFGEKIRR SSDGKEAFVFLR+DSSGSGGEKAAENRKGG RTKGETASCYHERLY+RNRAEKEINKRKSFLPYRSN+MG FSGPNAGS
Subjt:  KKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGS

Query:  NRNF
        NR F
Subjt:  NRNF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G23710.1 Protein of unknown function (DUF1645)5.6e-2235.2Show/hide
Query:  ELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL---------AIEATTKEPPAV----RPPLKNLFMEELISPTENRSNVSPDE-
        +L+E    +  +++E ++FSF  +N + SPI+A++AF +GQIR VFPL           E    +  +V    RP L+ LF+E+     +       ++ 
Subjt:  ELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL---------AIEATTKEPPAV----RPPLKNLFMEELISPTENRSNVSPDE-

Query:  -----------LSASAAPTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFL-------RTDSSGSGGEKAA---------ENRKGGKRT-------
                     A A+P T   +KS+STGFSKLWRF + + R +SDG++AFVFL       RT SS S    AA         E +KG ++T       
Subjt:  -----------LSASAAPTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFL-------RTDSSGSGGEKAA---------ENRKGGKRT-------

Query:  -KGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGSNRN
         K  T    HE+LY RNRA KE  K +S+LPY+   +G F+  N G +RN
Subjt:  -KGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGSNRN

AT1G70420.1 Protein of unknown function (DUF1645)2.6e-2739.64Show/hide
Query:  QTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL---AIEATTKEPPAVRPPLKNLFMEELISPTENRSN--VSP-----DELSASAAPT
        Q  TD+ ++   +DFSF S+N D SPI+A++AF +GQIR V+PL    I     E   +R PLK LF+E   +  E   +  V P     +     A+P 
Subjt:  QTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPL---AIEATTKEPPAVRPPLKNLFMEELISPTENRSN--VSP-----DELSASAAPT

Query:  TTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGS---------------GGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKS
        T   +KS+STGFSKLWRF + + R +SDGK+AFVFL   SS +                 EK  E  K  K+         HE+LY RNRA +E  KR+S
Subjt:  TTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGS---------------GGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKS

Query:  FLPYRSNLMGLFSGPNAGSNRN
        +LPY+   +G F+  N G  RN
Subjt:  FLPYRSNLMGLFSGPNAGSNRN

AT3G27880.1 Protein of unknown function (DUF1645)7.6e-1134.45Show/hide
Query:  SPTENRSNVSPDELSASAAPTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKE
        +P  + +++SP   S     + + G  S ST  +K WR  + ++R  SDGK++  FL   +     E +    K  K+         HE+ Y RN+A KE
Subjt:  SPTENRSNVSPDELSASAAPTTTLGKKSHSTGFSKLWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKE

Query:  INKRKSFLPYRSNLMGLFS
         +KRKS+LPY+ +L+GLFS
Subjt:  INKRKSFLPYRSNLMGLFS

AT5G62770.1 Protein of unknown function (DUF1645)3.6e-0526.41Show/hide
Query:  EQTLTDRIEDKELD---DFSF-LSLNPDRSPI-SAEDAFLNGQIRSVFPLAIEA----------TTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELS
        +QT +D    ++ D   DF+F    N    P+ +A++ F NGQIR + P    A          TT  P   RP L+ L  E+   P  N S+ + ++L+
Subjt:  EQTLTDRIEDKELD---DFSF-LSLNPDRSPI-SAEDAFLNGQIRSVFPLAIEA----------TTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELS

Query:  ASAAPTTTLGK-----------------------KSHSTGFSKLWRFGEKIR-RCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERL
             T  + K                       KSHS GFSK W+    +  R SS+G +  VF           K  +     +R + E  S      
Subjt:  ASAAPTTTLGK-----------------------KSHSTGFSKLWRFGEKIR-RCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERL

Query:  YSRNRAEKEINKRKSFLPYRSNLMGLFSGPN
          R   E+E  KR++++PYR +++G+    N
Subjt:  YSRNRAEKEINKRKSFLPYRSNLMGLFSGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAGAACTCCAAGAACAAACCCTAACAGACCGAATCGAAGACAAAGAACTCGATGATTTCTCTTTTCTTTCTTTGAATCCAGATCGCTCTCCGATCAGCGCCGAAGA
CGCCTTCCTTAACGGCCAGATTCGCTCTGTTTTCCCCCTGGCGATTGAGGCGACGACGAAGGAACCGCCGGCCGTACGTCCGCCATTGAAGAACCTTTTCATGGAGGAAT
TGATTTCTCCGACTGAGAATCGGAGTAATGTTTCGCCGGATGAGTTGTCGGCTTCGGCGGCGCCGACGACGACGCTGGGGAAGAAGAGCCACTCGACGGGATTCTCGAAG
CTGTGGAGGTTTGGGGAGAAGATTCGGCGTTGTAGCAGCGATGGGAAGGAGGCGTTTGTGTTTCTGAGAACCGATTCTTCGGGAAGCGGCGGAGAGAAGGCGGCGGAGAA
TCGGAAGGGAGGGAAGCGGACGAAAGGGGAAACGGCGTCGTGTTATCACGAGCGGTTGTACTCGAGAAATAGAGCGGAGAAAGAAATTAATAAACGGAAATCGTTTCTGC
CGTATAGAAGCAATCTCATGGGGCTTTTTAGCGGTCCAAACGCCGGTTCGAATAGAAATTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGCAAGAACTCCAAGAACAAACCCTAACAGACCGAATCGAAGACAAAGAACTCGATGATTTCTCTTTTCTTTCTTTGAATCCAGATCGCTCTCCGATCAGCGCCGAAGA
CGCCTTCCTTAACGGCCAGATTCGCTCTGTTTTCCCCCTGGCGATTGAGGCGACGACGAAGGAACCGCCGGCCGTACGTCCGCCATTGAAGAACCTTTTCATGGAGGAAT
TGATTTCTCCGACTGAGAATCGGAGTAATGTTTCGCCGGATGAGTTGTCGGCTTCGGCGGCGCCGACGACGACGCTGGGGAAGAAGAGCCACTCGACGGGATTCTCGAAG
CTGTGGAGGTTTGGGGAGAAGATTCGGCGTTGTAGCAGCGATGGGAAGGAGGCGTTTGTGTTTCTGAGAACCGATTCTTCGGGAAGCGGCGGAGAGAAGGCGGCGGAGAA
TCGGAAGGGAGGGAAGCGGACGAAAGGGGAAACGGCGTCGTGTTATCACGAGCGGTTGTACTCGAGAAATAGAGCGGAGAAAGAAATTAATAAACGGAAATCGTTTCTGC
CGTATAGAAGCAATCTCATGGGGCTTTTTAGCGGTCCAAACGCCGGTTCGAATAGAAATTTTTAG
Protein sequenceShow/hide protein sequence
MQELQEQTLTDRIEDKELDDFSFLSLNPDRSPISAEDAFLNGQIRSVFPLAIEATTKEPPAVRPPLKNLFMEELISPTENRSNVSPDELSASAAPTTTLGKKSHSTGFSK
LWRFGEKIRRCSSDGKEAFVFLRTDSSGSGGEKAAENRKGGKRTKGETASCYHERLYSRNRAEKEINKRKSFLPYRSNLMGLFSGPNAGSNRNF