; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg17319 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg17319
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUsp domain-containing protein
Genome locationCarg_Chr18:10548340..10549817
RNA-Seq ExpressionCarg17319
SyntenyCarg17319
Gene Ontology termsNA
InterPro domainsIPR006016 - UspA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574035.1 hypothetical protein SDJN03_27922, partial [Cucurbita argyrosperma subsp. sororia]2.0e-8978.95Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ----------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCM
                                    VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW GGGDGGGTVEYCIQNASCM
Subjt:  ----------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCM

Query:  AIAVRRKSKKLGGYLITTKRQKDFWLLA
        AIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  AIAVRRKSKKLGGYLITTKRQKDFWLLA

KAG7013093.1 hypothetical protein SDJN02_25849, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-97100Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEIEVAVVEGKEKGPVIVE
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEIEVAVVEGKEKGPVIVE
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEIEVAVVEGKEKGPVIVE

Query:  EARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        EARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  EARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

XP_022945318.1 uncharacterized protein LOC111449594 [Cucurbita moschata]1.0e-9079.74Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
                                   VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
Subjt:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA

Query:  IAVRRKSKKLGGYLITTKRQKDFWLLA
        IAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  IAVRRKSKKLGGYLITTKRQKDFWLLA

XP_022968363.1 universal stress protein PHOS32-like [Cucurbita maxima]5.6e-8476.42Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSF LNRIRSHVRVPIQSKPDSVSVKTGER GGEF ESNGG KPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC
                                   VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLM+WAGHR GGGGD  GGGTVEYCIQNASC
Subjt:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC

Query:  MAIAVRRKSKKLGGYLITTKRQKDFWLLA
        MAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  MAIAVRRKSKKLGGYLITTKRQKDFWLLA

XP_023542429.1 uncharacterized protein LOC111802335 [Cucurbita pepo subsp. pepo]1.3e-8878.51Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD-GGGTVEYCIQNASCM
                                   VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKR TTWRLLM+WAGHRWGGGGD GGGTVEYCIQNASCM
Subjt:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD-GGGTVEYCIQNASCM

Query:  AIAVRRKSKKLGGYLITTKRQKDFWLLA
        AIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  AIAVRRKSKKLGGYLITTKRQKDFWLLA

TrEMBL top hitse value%identityAlignment
A0A0A0KQW0 Usp domain-containing protein1.4e-6463.87Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGE--FDESNG------GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEI--------
        MGKTG KLPSFCLNRIR HVRVPIQSKPD VSVKTG  K  +   DE N         K  +GIGRKIMIVVDSTIEAEGAL WALS+TV+I        
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGE--FDESNG------GGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEI--------

Query:  ---------------------------------------EVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW-GGGGDGGGTV
                                               EV VVE GKEKG VIVEEARK+ ASLLVLGQ KKRSTTWRLLM+WAG RW GGGG  GG V
Subjt:  ---------------------------------------EVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW-GGGGDGGGTV

Query:  EYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        EYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  EYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

A0A6J1G0H0 uncharacterized protein LOC1114495945.1e-9179.74Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
                                   VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
Subjt:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA

Query:  IAVRRKSKKLGGYLITTKRQKDFWLLA
        IAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  IAVRRKSKKLGGYLITTKRQKDFWLLA

A0A6J1GXX9 uncharacterized protein LOC1114582167.1e-6967.38Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNT------------------
        MGKTG KLPSFCLNRIR HVRVPIQSK DSVSVKTG +K  +  E    G KP LG GRKIMIVVDSTIEAEGALQWALS+T                  
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNG-GGKPALGIGRKIMIVVDSTIEAEGALQWALSNT------------------

Query:  -----------------------------VEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW---GGGGDGGGTVEYCIQ
                                     VE+EVAVVE GKEKG VIVEEARKQ ASLLVLGQ KKRSTTWRLLM+WAGHRW   GGGG GGG VEYCIQ
Subjt:  -----------------------------VEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRW---GGGGDGGGTVEYCIQ

Query:  NASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        NASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  NASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

A0A6J1HXT3 universal stress protein PHOS32-like2.7e-8476.42Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------
        MGKTGGKLPSF LNRIRSHVRVPIQSKPDSVSVKTGER GGEF ESNGG KPALGIGRKIMIVVDSTIEAEGALQWALSNT                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------------

Query:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC
                                   VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLM+WAGHR GGGGD  GGGTVEYCIQNASC
Subjt:  ---------------------------VEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGD--GGGTVEYCIQNASC

Query:  MAIAVRRKSKKLGGYLITTKRQKDFWLLA
        MAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  MAIAVRRKSKKLGGYLITTKRQKDFWLLA

A0A6J1IT49 uncharacterized protein LOC1114784231.6e-6866.67Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERK------GGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------
        MGKTG KLPSFCLNRIR HVRVPIQSK DSVSVKTG +K      GGE      G KP LG GRKIMIV+DSTIEAEGALQWALS+T             
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERK------GGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNT-------------

Query:  ----------------------------------VEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGG--DGGGTVE
                                          VE+EVAVVE GKEKG VIVEEARKQ ASLLVLGQ KKRSTTWRLLM+WAGHRWGGGG   GGG VE
Subjt:  ----------------------------------VEIEVAVVE-GKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGG--DGGGTVE

Query:  YCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        YCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
Subjt:  YCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G69080.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.9e-2737.34Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSN-------------------
        MGK G     F ++R+R++VRV    +P + + + G         S+     ++ I GR+I++VVDS  EA+ AL W LS+                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSN-------------------

Query:  ----------------------------------------TVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGG
                                                 V+ EV  V+G EKGP IV+EAR++ ASLLVLGQKK+ + TWRLLM+WA           
Subjt:  ----------------------------------------TVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGG

Query:  GTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
          VEYCI N+ CMAIAVR++ KKLGGY +TTKR KDFWLLA
Subjt:  GTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

AT1G69080.2 Adenine nucleotide alpha hydrolases-like superfamily protein9.3e-2939.65Show/hide
Query:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSN-------------------
        MGK G     F ++R+R++VRV    +P + + + G         S+     ++ I GR+I++VVDS  EA+ AL W LS+                   
Subjt:  MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGI-GRKIMIVVDSTIEAEGALQWALSN-------------------

Query:  --------------------------TVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA
                                   V+ EV  V+G EKGP IV+EAR++ ASLLVLGQKK+ + TWRLLM+WA             VEYCI N+ CMA
Subjt:  --------------------------TVEIEVAVVEGKEKGPVIVEEARKQGASLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMA

Query:  IAVRRKSKKLGGYLITTKRQKDFWLLA
        IAVR++ KKLGGY +TTKR KDFWLLA
Subjt:  IAVRRKSKKLGGYLITTKRQKDFWLLA

AT2G03720.1 Adenine nucleotide alpha hydrolases-like superfamily protein3.3e-2642.69Show/hide
Query:  MIVVDSTIEAEGALQWALSN-------------------------------------------------TVEIEVAVVE-GKEKGPVIVEEARKQGASLL
        M+VVD+T + + ALQWAL++                                                  V+ E+ VVE  +EKG  IVEE++KQGA +L
Subjt:  MIVVDSTIEAEGALQWALSN-------------------------------------------------TVEIEVAVVE-GKEKGPVIVEEARKQGASLL

Query:  VLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        VLGQ +KR++ WR++  W       GG GGG VEYCI N+ CMAIAVR+KS   GGYLITTKR KDFWLLA
Subjt:  VLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

AT3G03290.1 Adenine nucleotide alpha hydrolases-like superfamily protein4.8e-2539.08Show/hide
Query:  GRKIMIVVDSTIEAEGALQWALSNT-----------------------------------------------VEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++M+VVD  I + GAL+WAL +T                                               +E+E+  ++G  KEKG  IVEEA++Q  
Subjt:  GRKIMIVVDSTIEAEGALQWALSNT-----------------------------------------------VEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        SLLV+G K+K+   WRLL  W    W       GT++YC++ ASCM IAV+ K++KLGGYLITTKR K+FWLLA
Subjt:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA

AT5G17390.1 Adenine nucleotide alpha hydrolases-like superfamily protein9.6e-2639.66Show/hide
Query:  GRKIMIVVDSTIEAEGALQWALSNT-----------------------------------------------VEIEVAVVEG--KEKGPVIVEEARKQGA
        G ++M+VVD  + + GAL+WA+++T                                               +E+E+  +EG  K+KG  IVEE++KQ  
Subjt:  GRKIMIVVDSTIEAEGALQWALSNT-----------------------------------------------VEIEVAVVEG--KEKGPVIVEEARKQGA

Query:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA
        SLLV+GQ+KK    WRLL  WA  R  G     G ++YC++NASCM IAV+ K++KLGGYLITTKR K+FWLLA
Subjt:  SLLVLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGAAAACCGGCGGAAAGCTGCCGAGTTTTTGCCTGAACCGGATCCGGTCTCATGTTCGTGTGCCGATTCAGTCCAAACCGGACTCTGTTTCTGTGAAAACAGGGGA
GAGGAAGGGCGGGGAGTTTGATGAGAGTAATGGTGGAGGTAAGCCGGCGTTGGGAATTGGAAGGAAGATAATGATTGTGGTTGATTCCACCATTGAAGCTGAAGGAGCTC
TTCAATGGGCGCTCTCAAATACGGTGGAAATTGAAGTAGCAGTGGTGGAAGGGAAGGAGAAAGGGCCAGTGATTGTGGAAGAAGCAAGAAAGCAAGGGGCATCGTTGCTG
GTTTTGGGGCAGAAGAAGAAACGGTCGACGACATGGCGGCTTCTGATGATCTGGGCCGGCCACCGGTGGGGTGGCGGCGGAGACGGCGGCGGAACGGTGGAGTATTGTAT
TCAGAATGCGAGCTGCATGGCGATTGCAGTGAGAAGGAAGAGCAAGAAATTGGGTGGTTATTTGATCACAACAAAACGCCAAAAGGATTTCTGGCTTTTAGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGGAAAACCGGCGGAAAGCTGCCGAGTTTTTGCCTGAACCGGATCCGGTCTCATGTTCGTGTGCCGATTCAGTCCAAACCGGACTCTGTTTCTGTGAAAACAGGGGA
GAGGAAGGGCGGGGAGTTTGATGAGAGTAATGGTGGAGGTAAGCCGGCGTTGGGAATTGGAAGGAAGATAATGATTGTGGTTGATTCCACCATTGAAGCTGAAGGAGCTC
TTCAATGGGCGCTCTCAAATACGGTGGAAATTGAAGTAGCAGTGGTGGAAGGGAAGGAGAAAGGGCCAGTGATTGTGGAAGAAGCAAGAAAGCAAGGGGCATCGTTGCTG
GTTTTGGGGCAGAAGAAGAAACGGTCGACGACATGGCGGCTTCTGATGATCTGGGCCGGCCACCGGTGGGGTGGCGGCGGAGACGGCGGCGGAACGGTGGAGTATTGTAT
TCAGAATGCGAGCTGCATGGCGATTGCAGTGAGAAGGAAGAGCAAGAAATTGGGTGGTTATTTGATCACAACAAAACGCCAAAAGGATTTCTGGCTTTTAGCTTGA
Protein sequenceShow/hide protein sequence
MGKTGGKLPSFCLNRIRSHVRVPIQSKPDSVSVKTGERKGGEFDESNGGGKPALGIGRKIMIVVDSTIEAEGALQWALSNTVEIEVAVVEGKEKGPVIVEEARKQGASLL
VLGQKKKRSTTWRLLMIWAGHRWGGGGDGGGTVEYCIQNASCMAIAVRRKSKKLGGYLITTKRQKDFWLLA