; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021916 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021916
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionkinesin-related protein 6
Genome locationtig00153841:1174637..1175454
RNA-Seq ExpressionSgr021916
SyntenySgr021916
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576126.1 hypothetical protein SDJN03_26765, partial [Cucurbita argyrosperma subsp. sororia]1.2e-4149.63Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSSSSS    SSS GDF +V N LP A+LAL+S+L ++DREVLAFMMRRSMETSS SS  S  K SKRA KK+   RASS S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPL--IDEGSVVPILPSNDVAPATSLS
          +H+PP+F C                    H+  E  EE+   GE+  +N    +   IG           +PPL  + E   +PI   +  AP TS +
Subjt:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPL--IDEGSVVPILPSNDVAPATSLS

Query:  GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
         D           +  +G G+P    ++ A +I+PSP PSN KGLARKVWPDVLGLFNSRLWSLWGP+
Subjt:  GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

KAG6592803.1 hypothetical protein SDJN03_12279, partial [Cucurbita argyrosperma subsp. sororia]2.2e-4350.55Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSS           GDF DV N LPAAILAL++VL ++DREVLAFMMRRSMETS+PSS+ S+ K SKR SKK+G  RA+S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL
        V +HAPP+ +C         R  +    E+               E+ G+  +  R   IG          +  SPP + E  V+P+ +    VAPATS+
Subjt:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL

Query:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        SG  +A +GS           G  PR +A   A V++PS  PSNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

KAG7020289.1 hypothetical protein SDJN02_16972, partial [Cucurbita argyrosperma subsp. argyrosperma]2.5e-4249.44Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSS           GDF DV N LPAAILAL++VL ++DREVLAFMMRRSMETS+PSS+ S+ K SKR SKK+G  RA+S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSAAAGTSPPLIDEGSVVPI-LPSNDVAPATSLSG
        V +HAPP+ +C         R  +    E+               E+ G+  +  R   IG          +    +    V+P+ +    VAPATS+SG
Subjt:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSAAAGTSPPLIDEGSVVPI-LPSNDVAPATSLSG

Query:  --DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
          +A +GS           G  PR +A   A V++PS  PSNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  --DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

KGN46456.2 hypothetical protein Csa_005246 [Cucumis sativus]2.9e-4350.74Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSY-GDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-
        MKIKNKGKVHPSPSSSSSSSSSSSS S+SSS  G+F DVLN LP AI AL+SVL ++DREVLAFMMRRSMETSSPSS+ S KK SKR SKK+   RA S 
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSY-GDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-

Query:  SVSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSG
        S  +HAPP+   F+C +                   E  EE+  +GE+  +N    R   I       +    SPPL+      P+      + A + SG
Subjt:  SVSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSG

Query:  DAVDGSNLKEADKPVDGGGDPRLKAEDD-----AAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
              +++  +K  DG  DP    ED      A V+IPSP P++HKG ARKVWPDVLGLFNSRLWSLW PN
Subjt:  DAVDGSNLKEADKPVDGGGDPRLKAEDD-----AAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

XP_022960393.1 uncharacterized protein LOC111461129 [Cucurbita moschata]3.8e-4350.55Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSS SSSS           GDF DV N LPAAILAL++VL ++DREVLAFMMRRSMETS+PSS+ S+ K SKR SKK+G  RA+S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL
        V +HAPP+ +C         R  +    E+               E+ G+  +  R   IG          A  SPP + E  V+P+ +    VAPATS+
Subjt:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL

Query:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        SG  +A +GS           G  PR +A   A V++PS  PSNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

TrEMBL top hitse value%identityAlignment
A0A1S3CC21 uncharacterized protein LOC1034990741.6e-3948.87Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSSS          G+F DVLN LP AI ALVSVL ++DREVLAFMMRRSMETSSPSS+ S KK SKR SKK+   RA S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSGD
          +HAPP+   F+C +                   E  EE+  +GE+  +N    R   IG      +    SPPL+      P+      + A + SG 
Subjt:  VSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSGD

Query:  AVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        A      K  + P     +        A V+IPSP P+ HKGLARKVWPDVLGLFNSRLWSLW PN
Subjt:  AVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A5D3DM01 Uncharacterized protein1.1e-4050Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSSSSS        G+F DVLN LP AI ALVSVL ++DREVLAFMMRRSMETSSPSS+ S K+ SKR SKK+   RA S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPL--IDEGSVVPILPSNDVAPATSLS
          +HAPP+   F+C +                   E  EE+  +GE+  +N    R   IG      +    SPPL  + E   + I   +  APATS S
Subjt:  VSIHAPPA---FNCGLR-----------RTTGHRREVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPL--IDEGSVVPILPSNDVAPATSLS

Query:  G----DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
             +   G   +  ++ V+ GG         A V+IPSP P+ HKGLARKVWPDVLGLFNSRLWSLW PN
Subjt:  G----DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1GQW4 uncharacterized protein LOC1114562855.5e-4048.13Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSSSSSS          GDF +V N LP A+LAL+S+L ++DREVLAFMMRRSMETSS SS  S  K SKRA KK+   RASS S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSV--VPILPSNDVAPATSLS
          +H+PP+F C                    H+  E  EE+   GE+  +N    +   IG           +PPL     V  +PI   +  AP TS S
Subjt:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSV--VPILPSNDVAPATSLS

Query:  GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
         D           +  +G G+P    ++ A +I+PSP PSN KGLARKVWPDVLGLFNSRLWSLWGP+
Subjt:  GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1H8S8 uncharacterized protein LOC1114611291.8e-4350.55Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSS SSSS           GDF DV N LPAAILAL++VL ++DREVLAFMMRRSMETS+PSS+ S+ K SKR SKK+G  RA+S S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL
        V +HAPP+ +C         R  +    E+               E+ G+  +  R   IG          A  SPP + E  V+P+ +    VAPATS+
Subjt:  VSIHAPPAFNC-------GLRRTTGHRREV-------------REERQGEEERENRPPVIGEACEYSA--AAGTSPPLIDEGSVVPI-LPSNDVAPATSL

Query:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN
        SG  +A +GS           G  PR +A   A V++PS  PSNHKGLARKVWPDVLGLFNSRLWSLWGPN
Subjt:  SG--DAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGPN

A0A6J1JUQ5 uncharacterized protein LOC1114880302.7e-3949.06Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S
        MKIKNKGKVHPSPSSSSS              GDF +V N LP AILAL+SVL  +DREVLAFMMRRSMETSS SS  S  K SKRA KK+   RASS S
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASS-S

Query:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSGD
          +H+PP+F C                    H+  E  EE+   GE+  +N    +   IG           +PPL  E   +PI   +  AP TS    
Subjt:  VSIHAPPAFNC-------------GLRRTTGHRR-EVREER--QGEEEREN----RPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSGD

Query:  AVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGP
           GS+  EA    +G G+P    ++ A VI+PSP PSN KGLARKVWPDVLGLFNSRLWSLWGP
Subjt:  AVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHKGLARKVWPDVLGLFNSRLWSLWGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G24270.1 unknown protein3.0e-0647.13Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRA
        MK+  KGKVHPSP   SSSSS+           D L V   L +AIL LVSVL  ED EVLA+++ RS+ T++  S +  KK S +A
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRA

AT5G13090.1 unknown protein4.5e-1833.22Show/hide
Query:  MKIKNKGKVHPS-PSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASSS
        MK+K KGKV+PS P    SSSSSSSS   +    D L VL  LPA IL LVSVL  E+REVLA+++ R    S   ++ S  K  K+++K         S
Subjt:  MKIKNKGKVHPS-PSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASSS

Query:  VSIHAPPAFNC---------GLRRTTGHRREV-------REERQGEEERENRPPV------------IGEACEYSAAAGT------SPPLIDEGSVVPIL
           H PP F+C           R  +   RE+        E   GEE   +R               + ++    A   T      S P+++  +   + 
Subjt:  VSIHAPPAFNC---------GLRRTTGHRREV-------REERQGEEERENRPPV------------IGEACEYSAAAGT------SPPLIDEGSVVPIL

Query:  PSNDVAPATSLS-GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTP--SNHKGLARKVWPDVLGLFNSRLWSLWGPN
         S+ V+    LS  +  +G    E     DG      + E+   V+ P+     + HKGLARKV PDVLGLF+S  W LW PN
Subjt:  PSNDVAPATSLS-GDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTP--SNHKGLARKVWPDVLGLFNSRLWSLWGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAAAGAACAAAGGTAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTGTTTCTGCTTCTTCTTCTTATGGGGACTTTCTCGA
CGTCTTGAACTGTCTGCCGGCGGCGATTCTGGCGTTGGTCTCCGTTCTGGGGCTCGAGGATCGGGAGGTTTTGGCTTTCATGATGAGGAGGTCGATGGAGACTTCATCGC
CGTCGTCTGCTGAATCTGATAAGAAAGCTTCGAAGAGGGCTTCCAAGAAAGCCGGCGGTGGACGTGCCAGCAGTAGTGTGAGTATTCACGCGCCGCCGGCTTTTAACTGT
GGCCTTCGAAGAACAACTGGCCACCGGCGAGAAGTCCGGGAAGAACGCCAAGGGGAAGAGGAGAGAGAAAATCGGCCGCCGGTCATCGGAGAAGCCTGTGAATACTCCGC
CGCCGCCGGAACTTCCCCTCCACTGATTGATGAAGGTTCTGTCGTTCCGATTCTTCCGAGTAATGACGTGGCTCCGGCGACATCTCTGAGCGGCGACGCCGTTGACGGAA
GCAACTTGAAAGAGGCGGATAAGCCGGTGGATGGAGGTGGCGACCCGCGGCTGAAGGCTGAGGATGACGCGGCGGTGATTATTCCGTCGCCGACGCCGAGCAACCACAAG
GGTTTGGCCCGGAAGGTATGGCCGGATGTGTTAGGGTTATTCAATTCCCGTTTATGGAGTCTTTGGGGTCCAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAAAGAACAAAGGTAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTGTTTCTGCTTCTTCTTCTTATGGGGACTTTCTCGA
CGTCTTGAACTGTCTGCCGGCGGCGATTCTGGCGTTGGTCTCCGTTCTGGGGCTCGAGGATCGGGAGGTTTTGGCTTTCATGATGAGGAGGTCGATGGAGACTTCATCGC
CGTCGTCTGCTGAATCTGATAAGAAAGCTTCGAAGAGGGCTTCCAAGAAAGCCGGCGGTGGACGTGCCAGCAGTAGTGTGAGTATTCACGCGCCGCCGGCTTTTAACTGT
GGCCTTCGAAGAACAACTGGCCACCGGCGAGAAGTCCGGGAAGAACGCCAAGGGGAAGAGGAGAGAGAAAATCGGCCGCCGGTCATCGGAGAAGCCTGTGAATACTCCGC
CGCCGCCGGAACTTCCCCTCCACTGATTGATGAAGGTTCTGTCGTTCCGATTCTTCCGAGTAATGACGTGGCTCCGGCGACATCTCTGAGCGGCGACGCCGTTGACGGAA
GCAACTTGAAAGAGGCGGATAAGCCGGTGGATGGAGGTGGCGACCCGCGGCTGAAGGCTGAGGATGACGCGGCGGTGATTATTCCGTCGCCGACGCCGAGCAACCACAAG
GGTTTGGCCCGGAAGGTATGGCCGGATGTGTTAGGGTTATTCAATTCCCGTTTATGGAGTCTTTGGGGTCCAAATTAG
Protein sequenceShow/hide protein sequence
MKIKNKGKVHPSPSSSSSSSSSSSSVSASSSYGDFLDVLNCLPAAILALVSVLGLEDREVLAFMMRRSMETSSPSSAESDKKASKRASKKAGGGRASSSVSIHAPPAFNC
GLRRTTGHRREVREERQGEEERENRPPVIGEACEYSAAAGTSPPLIDEGSVVPILPSNDVAPATSLSGDAVDGSNLKEADKPVDGGGDPRLKAEDDAAVIIPSPTPSNHK
GLARKVWPDVLGLFNSRLWSLWGPN