; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CcUC02G018600 (gene) of Watermelon (PI 537277) v1 genome

Gene IDCcUC02G018600
OrganismCitrullus colocynthis (Watermelon (PI 537277) v1)
DescriptionWound-responsive family protein
Genome locationCicolChr02:1440420..1442694
RNA-Seq ExpressionCcUC02G018600
SyntenyCcUC02G018600
Gene Ontology termsNA
InterPro domainsIPR022251 - Protein of unknown function wound-induced


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6738966.1 hypothetical protein POTOM_058601 [Populus tomentosa]4.2e-4847.06Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPS--ANHL-----QQSEESLRTVMYLSCWESNASGLEKAVWASE
        MSS+++A +VAA VG VEALKDQG CRWN+ LRS H +A+NHVRS SQAKKLSS+  +  +N +     +QSEESLR VMYLSCWE  +S      W   
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPS--ANHL-----QQSEESLRTVMYLSCWESNASGLEKAVWASE

Query:  FHVHATEKPRAKAYLKTQ--LYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVG
          +   E  + + + +    L   H     +      + K    SS    +  F+  K  Q      S +S   + + S    +KMSS+++AW+VAA+VG
Subjt:  FHVHATEKPRAKAYLKTQ--LYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVG

Query:  VVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQ---KQSEESFRTVMYLSCWGP
         +EALKDQG CRWN T+RS HQ+AKNHVRS  QA +LS SS+A++S K ++   KQSEES R VMYLSCWGP
Subjt:  VVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQ---KQSEESFRTVMYLSCWGP

KAG6738967.1 hypothetical protein POTOM_058602 [Populus tomentosa]5.1e-4640.5Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSA-------VPSANHLQQSEESLRTVMYLSCWESNASGLEKAVW---
        MSS+++A +VAA +G VEALKDQG CRWN+ LRS HH+A+ HVRS SQAKKLSS+       +      +QSEESLR VMYLSCW           W   
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSA-------VPSANHLQQSEESLRTVMYLSCWESNASGLEKAVW---

Query:  --------------------ASEFHVHATEKPRAKAYLKTQLYPSHSAINTRF------PGAISSRKGKNTSSPKKTTGSF---------EIYKQPQPDP
                                H HA    R+ +  K     S + I+ +           S RK  + S  KK + S          E+  +   + 
Subjt:  --------------------ASEFHVHATEKPRAKAYLKTQLYPSHSAINTRF------PGAISSRKGKNTSSPKKTTGSF---------EIYKQPQPDP

Query:  IHHSIKSKNYITFESKKK-------------RKKMSSSTRAWVVAASVGVVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQ
        +   +    ++  ++KK+              +KMSS+++AW+VAA+VG VEALKDQG CRWN T+RS HQ+AKNHVRS  QA +LS SS+A++S K ++
Subjt:  IHHSIKSKNYITFESKKK-------------RKKMSSSTRAWVVAASVGVVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQ

Query:  ---KQSEESFRTVMYLSCWGP
           KQSEES R VMYLSCWGP
Subjt:  ---KQSEESFRTVMYLSCWGP

MBA0649120.1 hypothetical protein [Gossypium klotzschianum]1.8e-4345.35Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCW--ESNASGLEKAVWASE
        M+SS RA +VAA++G VEALKDQGICRWN+  ++   +A+N+VRS SQAK LSS   +A        +QSEESLRTVMYLSCW   +  S +    WA+ 
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCW--ESNASGLEKAVWASE

Query:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV
                                       GA+ + K +                         +IK++ +          KMSS++RAWVVAAS+G V
Subjt:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV

Query:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        EALKDQGICRWN T+RS  Q+AKNHVRS  QA  LS  SSAA+  G  + KQSEES RTVMYLSCWGPN
Subjt:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

MBA0754482.1 hypothetical protein [Gossypium gossypioides]1.1e-4046.27Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH
        MSS++RA VVAA++G VEALKDQGICRWN+ +RS   +A+NHVRS SQAK LSS   +A     N  +QSEESLRTV+          GL  +   S   
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH

Query:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSI-KSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE
                          P+    +            KN S   +   +  +  +P+      SI  S+  IT  +     +MSS++RAW+VAAS+G VE
Subjt:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSI-KSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE

Query:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        ALKDQGICRWN T+RS  Q+AKNHVRS  QA  LS  SSAA+  G  + KQSEES RTVMYLSCWGPN
Subjt:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

XP_038703866.1 uncharacterized protein LOC120000071 [Tripterygium wilfordii]1.4e-4345.19Show/hide
Query:  SSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA--NHL-----QQSEESLRTVMYLSCWESNASGLEKAVWASEF
        SS++RA +VAA++G VEALKDQGICRWN+ LRS H + +N+V+SIS   KLSS   +A  N L     +QSEESLRTVMYLSCW  N+   + +      
Subjt:  SSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA--NHL-----QQSEESLRTVMYLSCWESNASGLEKAVWASEF

Query:  HVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE
         V    KP A            + I +     +S+R   +    K TTG                                KMSS++RAWV+AASVG VE
Subjt:  HVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE

Query:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGPN
        ALKDQGICRWN T+RS HQ+A+  VR + QA RL   S+A+VS K   ++ ++SEES R VMYLSCWGPN
Subjt:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGPN

TrEMBL top hitse value%identityAlignment
A0A5J5AAH5 Uncharacterized protein6.5e-3942.07Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLS-------SAVPSANHLQQSEESLRTVMYLSCWESNASGLEKAVWASE
        MSS+++A +VAA+VG+VEALKDQG CRWN+ +RS H +A+N++RS+SQAKKLS       S        +QSEESLR V+ L   E+    LE+ V    
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLS-------SAVPSANHLQQSEESLRTVMYLSCWESNASGLEKAVWASE

Query:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV
                                                                            +KN      +K++KKM+S++RAW+VA SVG+V
Subjt:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV

Query:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGPN
        EALKDQG CRWN TIRS HQ+AKN++RS+ QA +LS SS+A+VS K   ++ KQSEES R VMYLSCWGPN
Subjt:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGPN

A0A5N5J079 Uncharacterized protein5.0e-3944.2Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSA-------VPSANHLQQSEESLRTV------MYLSCWESNASGLEK
        MSS+++A +VAA +G VEALKDQG CRWN+ LRS H +A+NHVRS+SQAKKLSS+       + +    +QSEESLR V      M    +   A  LE 
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSA-------VPSANHLQQSEESLRTV------MYLSCWESNASGLEK

Query:  AVWASEFHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVA
           ASE  + A      +A         + A+       ISS     +   +    +F +Y                YIT        KMSS+++AW+VA
Subjt:  AVWASEFHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVA

Query:  ASVGVVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGP
        A++G VEALKDQG CRWN T+RS HQ+AKNHVRS  QA  LS SS+A++S K    + KQSEES R VMY SCWGP
Subjt:  ASVGVVEALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGK---QQQKQSEESFRTVMYLSCWGP

A0A7J8UFF6 Uncharacterized protein8.8e-4445.35Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCW--ESNASGLEKAVWASE
        M+SS RA +VAA++G VEALKDQGICRWN+  ++   +A+N+VRS SQAK LSS   +A        +QSEESLRTVMYLSCW   +  S +    WA+ 
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCW--ESNASGLEKAVWASE

Query:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV
                                       GA+ + K +                         +IK++ +          KMSS++RAWVVAAS+G V
Subjt:  FHVHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVV

Query:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        EALKDQGICRWN T+RS  Q+AKNHVRS  QA  LS  SSAA+  G  + KQSEES RTVMYLSCWGPN
Subjt:  EALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

A0A7J9D187 Uncharacterized protein5.3e-4146.27Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH
        MSS++RA VVAA++G VEALKDQGICRWN+ +RS   +A+NHVRS SQAK LSS   +A     N  +QSEESLRTV+          GL  +   S   
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH

Query:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSI-KSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE
                          P+    +            KN S   +   +  +  +P+      SI  S+  IT  +     +MSS++RAW+VAAS+G VE
Subjt:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSI-KSKNYITFESKKKRKKMSSSTRAWVVAASVGVVE

Query:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        ALKDQGICRWN T+RS  Q+AKNHVRS  QA  LS  SSAA+  G  + KQSEES RTVMYLSCWGPN
Subjt:  ALKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

A0A7J9FRS2 Uncharacterized protein7.2e-3842.7Show/hide
Query:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH
        MSS++RA VVAA++G VEALKDQGICRWN+ +RS   +A+NHVRS SQAK LSS   +A     N  +QSEESLRTV+           ++  V+A    
Subjt:  MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSA-----NHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFH

Query:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVVEA
                                                                                        +MSSS+RAW+VAAS+G VEA
Subjt:  VHATEKPRAKAYLKTQLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVVEA

Query:  LKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        LKDQGICRWN T+RS  Q+AKNHVRS  QA  LS  SSAA+  G  + KQSEES RTVMYLSCWGPN
Subjt:  LKDQGICRWNQTIRSAHQYAKNHVRSMPQATRLSG-SSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G10265.1 Wound-responsive family protein1.1e-2260.47Show/hide
Query:  MSSSTRAWVVAASVGVVEALKDQ-GICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQKQSEESFRTVMYLSCWGPN
        MSS+++ W+VAAS+G VEALKDQ G+CRWN  IRSA+QY +N++RS+ QA +LS SS   +    + KQ+EES RTVMYLSCWGP+
Subjt:  MSSSTRAWVVAASVGVVEALKDQ-GICRWNQTIRSAHQYAKNHVRSMPQATRLSGSSAAVVSGKQQQKQSEESFRTVMYLSCWGPN

AT4G10270.1 Wound-responsive family protein9.3e-2258.89Show/hide
Query:  MSSSTRAWVVAASVGVVEALKDQ-GICRWNQTIRSAHQYAKNHVRSMPQATRLSGS--SAAVVSG--KQQQKQSEESFRTVMYLSCWGPN
        MSS+++AW VA S+G VEALKDQ G+CRWN  +RS +Q+ +N+VRS+ Q  R S S  SAAV S    ++ K++EES RTVMYLSCWGPN
Subjt:  MSSSTRAWVVAASVGVVEALKDQ-GICRWNQTIRSAHQYAKNHVRSMPQATRLSGS--SAAVVSG--KQQQKQSEESFRTVMYLSCWGPN

AT4G33560.1 Wound-responsive family protein3.4e-0838.55Show/hide
Query:  AWVVAASVGVVEALKDQGICRWNQTIRSAHQYAKNHVR-----SMPQATRLSGSSAAVVSGKQQQKQSEESFRTVMYLSCWGP
        +W VA ++  VE LKDQG+ RWN   R  H+ A   VR     S P     S S+ ++ S        E SF   M LSC+GP
Subjt:  AWVVAASVGVVEALKDQGICRWNQTIRSAHQYAKNHVR-----SMPQATRLSGSSAAVVSGKQQQKQSEESFRTVMYLSCWGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCATCAACAAGAGCTTTGGTCGTAGCAGCTACCGTCGGCGTCGTGGAGGCTCTCAAGGATCAAGGAATCTGCCGGTGGAATCATGTCTTAAGATCCGCCCATCA
CTACGCCAGAAACCATGTCCGGTCTATTTCTCAGGCCAAGAAGCTTTCCTCCGCCGTGCCTTCCGCCAACCACCTCCAACAGTCCGAGGAATCTCTGAGAACGGTGATGT
ACCTTAGCTGTTGGGAGTCCAACGCCTCCGGTTTAGAAAAGGCGGTTTGGGCCTCAGAATTCCACGTTCACGCCACGGAGAAACCACGTGCAAAAGCTTACCTAAAAACC
CAACTCTACCCCTCCCACTCCGCTATAAACACCCGGTTTCCAGGGGCAATATCGTCCAGGAAAGGAAAAAACACATCCAGCCCAAAGAAAACCACGGGCAGTTTCGAAAT
ATATAAACAGCCACAGCCAGACCCCATCCACCACTCAATCAAATCAAAAAACTACATAACTTTCGAATCGAAGAAGAAAAGGAAAAAAATGAGTTCATCAACGAGAGCTT
GGGTTGTAGCCGCCAGTGTGGGCGTCGTGGAAGCCTTAAAAGATCAAGGAATTTGCCGATGGAATCAAACCATTAGATCGGCGCATCAATACGCCAAAAATCATGTCAGA
TCTATGCCTCAGGCCACCAGATTGAGTGGCTCTTCCGCCGCCGTGGTTTCCGGCAAGCAACAGCAGAAGCAATCGGAAGAATCATTCAGAACCGTCATGTATTTGAGCTG
TTGGGGTCCCAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCATCAACAAGAGCTTTGGTCGTAGCAGCTACCGTCGGCGTCGTGGAGGCTCTCAAGGATCAAGGAATCTGCCGGTGGAATCATGTCTTAAGATCCGCCCATCA
CTACGCCAGAAACCATGTCCGGTCTATTTCTCAGGCCAAGAAGCTTTCCTCCGCCGTGCCTTCCGCCAACCACCTCCAACAGTCCGAGGAATCTCTGAGAACGGTGATGT
ACCTTAGCTGTTGGGAGTCCAACGCCTCCGGTTTAGAAAAGGCGGTTTGGGCCTCAGAATTCCACGTTCACGCCACGGAGAAACCACGTGCAAAAGCTTACCTAAAAACC
CAACTCTACCCCTCCCACTCCGCTATAAACACCCGGTTTCCAGGGGCAATATCGTCCAGGAAAGGAAAAAACACATCCAGCCCAAAGAAAACCACGGGCAGTTTCGAAAT
ATATAAACAGCCACAGCCAGACCCCATCCACCACTCAATCAAATCAAAAAACTACATAACTTTCGAATCGAAGAAGAAAAGGAAAAAAATGAGTTCATCAACGAGAGCTT
GGGTTGTAGCCGCCAGTGTGGGCGTCGTGGAAGCCTTAAAAGATCAAGGAATTTGCCGATGGAATCAAACCATTAGATCGGCGCATCAATACGCCAAAAATCATGTCAGA
TCTATGCCTCAGGCCACCAGATTGAGTGGCTCTTCCGCCGCCGTGGTTTCCGGCAAGCAACAGCAGAAGCAATCGGAAGAATCATTCAGAACCGTCATGTATTTGAGCTG
TTGGGGTCCCAATTAA
Protein sequenceShow/hide protein sequence
MSSSTRALVVAATVGVVEALKDQGICRWNHVLRSAHHYARNHVRSISQAKKLSSAVPSANHLQQSEESLRTVMYLSCWESNASGLEKAVWASEFHVHATEKPRAKAYLKT
QLYPSHSAINTRFPGAISSRKGKNTSSPKKTTGSFEIYKQPQPDPIHHSIKSKNYITFESKKKRKKMSSSTRAWVVAASVGVVEALKDQGICRWNQTIRSAHQYAKNHVR
SMPQATRLSGSSAAVVSGKQQQKQSEESFRTVMYLSCWGPN