; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg24185 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg24185
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionUvrABC system protein C
Genome locationCarg_Chr06:6161402..6162288
RNA-Seq ExpressionCarg24185
SyntenyCarg24185
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597014.1 hypothetical protein SDJN03_10194, partial [Cucurbita argyrosperma subsp. sororia]6.8e-106100Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQ
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQ
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQ

KAG7028490.1 hypothetical protein SDJN02_09671, partial [Cucurbita argyrosperma subsp. argyrosperma]8.8e-130100Show/hide
Query:  MKQIESVKDKKRRPSGRGAIAIIIIIISWTAHCPYMRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPL
        MKQIESVKDKKRRPSGRGAIAIIIIIISWTAHCPYMRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPL
Subjt:  MKQIESVKDKKRRPSGRGAIAIIIIIISWTAHCPYMRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPL

Query:  LHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVR
        LHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVR
Subjt:  LHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVR

Query:  TVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYGLK
        TVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYGLK
Subjt:  TVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYGLK

XP_022962561.1 uncharacterized protein LOC111462928 [Cucurbita moschata]1.6e-10799.01Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQELAGMGNCVLKVAGERGD+VKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKL YG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

XP_022975258.1 uncharacterized protein LOC111474379 [Cucurbita maxima]1.5e-10598.02Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQEL  MGNCVLKVAGER DVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELH GQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

XP_023540170.1 uncharacterized protein LOC111800623 [Cucurbita pepo subsp. pepo]2.0e-10598.02Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQEL GMGNCVLKVAGER DVVKVVTVDGGIMELYAPITAECITGEYPGHAIF SRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDK QYG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

TrEMBL top hitse value%identityAlignment
A0A5D3DB01 Uncharacterized protein2.7e-6065.5Show/hide
Query:  MGNCVLKVAGERG------DVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTP
        MGNC+LK AG  G       +VKVVTV GGIMELY PITAECITGEYPGHAIFKSRSIFSE L HK+EL  GQVYYLLPLN Y P+        SV+STP
Subjt:  MGNCVLKVAGERG------DVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTP

Query:  YRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYGLK
        YRMST ESQ         + +  +FPKY++ GVWKV LVICP+QLS+IL Q NRT+ELI+NVRTVAKCGN + SA++SDHSSVAGSWK    DK  YG K
Subjt:  YRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYGLK

A0A6J1CTQ7 uncharacterized protein LOC1110142471.7e-7074.37Show/hide
Query:  MGNCVLKVAGE--RGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPYRMS
        MGNCVLK      R DVVKVVT DGGIMEL+AP+TAECITGEYPGHAIFKSRSIFSEPLLHK+ELHAG+VYYLLPLNPYKPSA   ADA SV+STPYRMS
Subjt:  MGNCVLKVAGE--RGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPYRMS

Query:  TCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGS---AMDKLQYGLK
        T            +E EAEVFPKYSS+GVWKVKLVI PEQLS+IL   +RTEELIENVRTVAKCGNGVGSA +SDHSSVA SWKGS   +  +  YG K
Subjt:  TCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGS---AMDKLQYGLK

A0A6J1HDL7 uncharacterized protein LOC1114629287.8e-10899.01Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQELAGMGNCVLKVAGERGD+VKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKL YG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

A0A6J1IAU7 uncharacterized protein LOC1114728893.6e-10597.52Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQEL  MGNCVLKVAGER DVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELH GQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAE EAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

A0A6J1IIP8 uncharacterized protein LOC1114743797.3e-10698.02Show/hide
Query:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS
        MRIQEL  MGNCVLKVAGER DVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELH GQVYYLLPLNPYKPSAKIEADAPSVVS
Subjt:  MRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVS

Query:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
        TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG
Subjt:  TPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSVAGSWKGSAMDKLQYG

Query:  LK
        LK
Subjt:  LK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G64700.1 unknown protein2.5e-2133.16Show/hide
Query:  MGNCVLKVAGERGD--VVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLP------LNPYKPSAKIEADAPSVVS
        MGNC+    G+  +  ++KV+  DGG++E Y+P+TA  ++  + GHA+F +  +  +PL H   L  GQ YYL P      L  +  S  + +++ S+ +
Subjt:  MGNCVLKVAGERGD--VVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQVYYLLP------LNPYKPSAKIEADAPSVVS

Query:  -TPYRMS-------------TCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDH
         TPYRMS                S+N+  + R  E +       S   +WKV L+I  E+L +IL +  RT ELIE+VR VAK      +++SS++
Subjt:  -TPYRMS-------------TCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDH

AT2G01340.1 Encodes a protein whose expression is responsive to nematode infection.1.2e-0424.85Show/hide
Query:  KVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSI-----FSEPLLHKDELHAGQVYYLL-PLNPYKPS-------------------AKIEADAPSVV
        KV+ +DG   +L  P+TAE +  ++PGH +  S S+      ++PL  K  L A ++Y+++ P+    P                    A+  +   S++
Subjt:  KVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSI-----FSEPLLHKDELHAGQVYYLL-PLNPYKPS-------------------AKIEADAPSVV

Query:  STPYRMSTCESQNAV--AKKRAAEGEAEVFPKYSST---GVWKVKLVICPEQLSEILKQGNRTEE
          P   +T E + AV   K R  + E E   K  +T      K+  +   +Q  E   Q  R +E
Subjt:  STPYRMSTCESQNAV--AKKRAAEGEAEVFPKYSST---GVWKVKLVICPEQLSEILKQGNRTEE

AT3G61920.1 unknown protein3.3e-3446.23Show/hide
Query:  MGNCVLKVAG-------ERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSI--FSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVV
        MGNCV K  G       +   ++KVVT +GG+MEL+ PI AE IT E+PGH I  S S+   S PLLH +EL  G +YYLLPL+    +A  + D+   +
Subjt:  MGNCVLKVAG-------ERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSI--FSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVV

Query:  STPYRMSTCESQNAVAKKRAAEGEAEVFPKYS--STGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAK-----CGNGVGSAASSDHSSVAGSWKG
        STPYRMS               G+  +    S    GVWKV+LVI PEQL+EIL +   TE L+E+VRTVAK     CG GV S A+SD  SV  S+KG
Subjt:  STPYRMSTCESQNAVAKKRAAEGEAEVFPKYS--STGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAK-----CGNGVGSAASSDHSSVAGSWKG

AT5G37840.1 BEST Arabidopsis thaliana protein match is: plastid movement impaired 2 (TAIR:AT1G66480.1)7.2e-0522.99Show/hide
Query:  MGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRS-----IFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPY
        MGN ++     R + VKV+ +DG I  L  P+TA   T EYPG  +  S +     + ++PL     L     Y+L+ L P     K+          PY
Subjt:  MGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRS-----IFSEPLLHKDELHAGQVYYLLPLNPYKPSAKIEADAPSVVSTPY

Query:  RMSTCESQNAVAKKR-------------AAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENV
        R     + +  AK+R                  ++V       G  +V+L +   Q+++++++ +   E+   +
Subjt:  RMSTCESQNAVAKKR-------------AAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACAAATAGAGTCTGTAAAAGACAAAAAGCGAAGGCCATCGGGCCGCGGGGCCATCGCCATCATCATCATCATCATCTCCTGGACTGCTCACTGCCCATATATGAG
GATTCAGGAGCTTGCGGGTATGGGAAATTGCGTGTTGAAAGTCGCCGGAGAAAGGGGGGATGTGGTGAAAGTGGTGACTGTCGACGGCGGAATTATGGAGCTTTATGCGC
CGATTACGGCGGAGTGCATAACAGGGGAATATCCCGGCCACGCCATTTTTAAGAGCCGAAGTATATTCTCGGAGCCCCTTCTTCATAAGGATGAGCTTCACGCGGGTCAA
GTGTATTACCTTCTCCCGCTTAACCCTTACAAGCCCTCTGCTAAAATCGAGGCGGACGCGCCCAGCGTTGTTTCCACGCCGTATCGGATGTCGACGTGCGAGTCGCAGAA
TGCGGTTGCGAAGAAGCGGGCGGCGGAGGGGGAGGCGGAGGTGTTTCCGAAGTATAGTAGTACGGGCGTTTGGAAGGTGAAGCTGGTGATTTGCCCGGAACAGCTGTCGG
AGATTTTGAAGCAGGGTAATCGTACGGAGGAGTTGATTGAGAACGTGAGGACGGTGGCTAAGTGTGGAAACGGCGTCGGATCGGCTGCTAGCTCCGATCATTCCAGTGTG
GCAGGGAGCTGGAAGGGATCCGCCATGGATAAGCTCCAGTATGGACTGAAATAA
mRNA sequenceShow/hide mRNA sequence
ATGAAACAAATAGAGTCTGTAAAAGACAAAAAGCGAAGGCCATCGGGCCGCGGGGCCATCGCCATCATCATCATCATCATCTCCTGGACTGCTCACTGCCCATATATGAG
GATTCAGGAGCTTGCGGGTATGGGAAATTGCGTGTTGAAAGTCGCCGGAGAAAGGGGGGATGTGGTGAAAGTGGTGACTGTCGACGGCGGAATTATGGAGCTTTATGCGC
CGATTACGGCGGAGTGCATAACAGGGGAATATCCCGGCCACGCCATTTTTAAGAGCCGAAGTATATTCTCGGAGCCCCTTCTTCATAAGGATGAGCTTCACGCGGGTCAA
GTGTATTACCTTCTCCCGCTTAACCCTTACAAGCCCTCTGCTAAAATCGAGGCGGACGCGCCCAGCGTTGTTTCCACGCCGTATCGGATGTCGACGTGCGAGTCGCAGAA
TGCGGTTGCGAAGAAGCGGGCGGCGGAGGGGGAGGCGGAGGTGTTTCCGAAGTATAGTAGTACGGGCGTTTGGAAGGTGAAGCTGGTGATTTGCCCGGAACAGCTGTCGG
AGATTTTGAAGCAGGGTAATCGTACGGAGGAGTTGATTGAGAACGTGAGGACGGTGGCTAAGTGTGGAAACGGCGTCGGATCGGCTGCTAGCTCCGATCATTCCAGTGTG
GCAGGGAGCTGGAAGGGATCCGCCATGGATAAGCTCCAGTATGGACTGAAATAA
Protein sequenceShow/hide protein sequence
MKQIESVKDKKRRPSGRGAIAIIIIIISWTAHCPYMRIQELAGMGNCVLKVAGERGDVVKVVTVDGGIMELYAPITAECITGEYPGHAIFKSRSIFSEPLLHKDELHAGQ
VYYLLPLNPYKPSAKIEADAPSVVSTPYRMSTCESQNAVAKKRAAEGEAEVFPKYSSTGVWKVKLVICPEQLSEILKQGNRTEELIENVRTVAKCGNGVGSAASSDHSSV
AGSWKGSAMDKLQYGLK