; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G002210 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G002210
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionGPI transamidase component Gpi16 subunit family protein isoform 1
Genome locationCmo_Chr08:1330046..1332952
RNA-Seq ExpressionCmoCh08G002210
SyntenyCmoCh08G002210
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6593003.1 hypothetical protein SDJN03_12479, partial [Cucurbita argyrosperma subsp. sororia]5.1e-13399.18Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLK+MNSGSKRIDSP+LKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

XP_022959564.1 uncharacterized protein LOC111460597 isoform X1 [Cucurbita moschata]7.9e-134100Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

XP_022959565.1 uncharacterized protein LOC111460597 isoform X2 [Cucurbita moschata]2.2e-115100Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPR
        AGREINSTANGFPR
Subjt:  AGREINSTANGFPR

XP_023004810.1 uncharacterized protein LOC111498000 isoform X1 [Cucurbita maxima]5.9e-12996.72Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFC SLSTIAPFVAS DQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEED NLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPV KCSPNDY+RKQHMEEV+LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREIN+TANGFPRTA SCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

XP_023513937.1 uncharacterized protein LOC111778382 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-13097.95Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAH PQEIANGDCCEADEED NLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREIN+TAN FPRT YSCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

TrEMBL top hitse value%identityAlignment
A0A6J1CVB3 uncharacterized protein LOC1110145924.7e-8473.68Show/hide
Query:  RTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVDGKLSKIGVKRAHCPQE
        R IKL CPSLS IAPF+AS    IDIG+IAT FGL+PSTVKLNGHFLSRG DL+SSVTW SLLSFFSAKRLP G SD+D LVVDGKLSKIG+KRA   QE
Subjt:  RTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVDGKLSKIGVKRAHCPQE

Query:  IANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSDAGREINSTANGFPRTA
        I +G CCEADEEDANLN        NLVKNK+LK  + GSK +DS V KCSPN YKRKQ MEEVILLKKLKLNETKSG DELSD  + ++  AN  PR  
Subjt:  IANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSDAGREINSTANGFPRTA

Query:  YSCSYNSKNMKRMREDEALVPAFCKRTK
        YSCSYNSKNMKRMREDE LV AFCKRT+
Subjt:  YSCSYNSKNMKRMREDEALVPAFCKRTK

A0A6J1H6M8 uncharacterized protein LOC111460597 isoform X21.0e-115100Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPR
        AGREINSTANGFPR
Subjt:  AGREINSTANGFPR

A0A6J1H8F0 uncharacterized protein LOC111460597 isoform X13.8e-134100Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

A0A6J1KT59 uncharacterized protein LOC111498000 isoform X27.0e-11296.73Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFC SLSTIAPFVAS DQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEED NLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPV KCSPNDY+RKQHMEEV+LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPR
        AGREIN+TANGFPR
Subjt:  AGREINSTANGFPR

A0A6J1KVM1 uncharacterized protein LOC111498000 isoform X12.8e-12996.72Show/hide
Query:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
        MEIMMEKKKKDPGQELRTIKLFC SLSTIAPFVAS DQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD
Subjt:  MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVD

Query:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD
        GKLSKIGVKRAHCPQEIANGDCCEADEED NLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPV KCSPNDY+RKQHMEEV+LLKKLKLNETKSGFDELSD
Subjt:  GKLSKIGVKRAHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSD

Query:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK
        AGREIN+TANGFPRTA SCSYNSKNMKRMREDEALVPAFCKRTK
Subjt:  AGREINSTANGFPRTAYSCSYNSKNMKRMREDEALVPAFCKRTK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G07150.1 unknown protein7.1e-3243.04Show/hide
Query:  RTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSS-VTWNSLLSFFSAKRLPTGGSDDDALVVDGKLSKIGVKRAHCPQ
        R IKLFCPS+S I  +VA  D+ +D  +IA  FGLEPSTVKLNGHF+SRG DLV++ VTW SLL+FFSA+ L TG  + DAL+V GKLSK+G KRA    
Subjt:  RTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSS-VTWNSLLSFFSAKRLPTGGSDDDALVVDGKLSKIGVKRAHCPQ

Query:  EIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSDAGREINSTANGFPRT
                    ED   N      +  L+K K+LK   S     +S +  C+    KRK   E+   LKKLKLN      D    +G           +T
Subjt:  EIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSDAGREINSTANGFPRT

Query:  AYSCSYNSKN-MKRMREDEALVPAFCKRTK
           CS+ S N +KR RED+ +  A CK+ +
Subjt:  AYSCSYNSKN-MKRMREDEALVPAFCKRTK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATTATGATGGAGAAGAAGAAGAAGGATCCAGGCCAAGAGCTCAGAACTATCAAGCTATTCTGCCCTTCACTCTCCACTATTGCCCCATTCGTTGCATCGTTGGA
CCAGTGCATCGATATCGGCTCCATAGCTACCATTTTCGGCCTCGAGCCCTCAACGGTGAAACTCAATGGTCACTTCCTCAGTCGAGGCCTCGATCTCGTCTCCTCTGTTA
CTTGGAACTCTCTTCTCTCTTTCTTCTCTGCTAAACGGCTGCCTACTGGAGGCTCCGATGATGATGCACTTGTTGTTGATGGAAAGCTCTCTAAAATTGGCGTCAAGAGA
GCTCACTGCCCTCAGGAAATTGCAAACGGGGATTGTTGCGAGGCTGATGAAGAGGATGCCAATCTTAATGGTGGAAGGCTAAAACCAGAAAGCAACCTGGTCAAGAATAA
AAGGTTGAAGCATATGAACTCAGGAAGCAAACGCATAGATTCTCCAGTATTGAAATGTAGTCCCAATGATTATAAAAGAAAACAACACATGGAAGAAGTTATCTTGCTCA
AGAAATTGAAGTTAAACGAAACTAAATCAGGTTTCGACGAATTATCCGATGCAGGCAGAGAAATAAACAGCACGGCCAATGGCTTCCCACGTACGGCATACTCGTGTAGC
TATAATAGTAAGAATATGAAAAGGATGAGAGAAGATGAGGCTCTTGTTCCTGCTTTCTGCAAGAGAACCAAATAA
mRNA sequenceShow/hide mRNA sequence
CGGGAAACTTCCAGAGACCATTTGGAAGGAAAACCCGCGAACAGTGTGTCGCCATCGCCGTTCCAAAAATGTCCATTCTCCATTTTCAGCTTCTTCTTCCCACAACAACC
AGTTCATCCAGCTTTCTGTTTCTGCTGCAATGGAGATTATGATGGAGAAGAAGAAGAAGGATCCAGGCCAAGAGCTCAGAACTATCAAGCTATTCTGCCCTTCACTCTCC
ACTATTGCCCCATTCGTTGCATCGTTGGACCAGTGCATCGATATCGGCTCCATAGCTACCATTTTCGGCCTCGAGCCCTCAACGGTGAAACTCAATGGTCACTTCCTCAG
TCGAGGCCTCGATCTCGTCTCCTCTGTTACTTGGAACTCTCTTCTCTCTTTCTTCTCTGCTAAACGGCTGCCTACTGGAGGCTCCGATGATGATGCACTTGTTGTTGATG
GAAAGCTCTCTAAAATTGGCGTCAAGAGAGCTCACTGCCCTCAGGAAATTGCAAACGGGGATTGTTGCGAGGCTGATGAAGAGGATGCCAATCTTAATGGTGGAAGGCTA
AAACCAGAAAGCAACCTGGTCAAGAATAAAAGGTTGAAGCATATGAACTCAGGAAGCAAACGCATAGATTCTCCAGTATTGAAATGTAGTCCCAATGATTATAAAAGAAA
ACAACACATGGAAGAAGTTATCTTGCTCAAGAAATTGAAGTTAAACGAAACTAAATCAGGTTTCGACGAATTATCCGATGCAGGCAGAGAAATAAACAGCACGGCCAATG
GCTTCCCACGTACGGCATACTCGTGTAGCTATAATAGTAAGAATATGAAAAGGATGAGAGAAGATGAGGCTCTTGTTCCTGCTTTCTGCAAGAGAACCAAATAAACACAT
GGCTTTCCCAAGCCAATCCCCACCTCATTTGTTCATAACTTTTATTTCTTCATAGTTCTCTTTCGCTTGTTGAGGTTTGTAAATGTGAAGAAAGAAAATGGTGGCTATGT
TCTATGGTGACTTTCAATACATAAATGTATAGGTTTGTACTGAATTGTACTTTTGTTCACCGTCTTATAAATATGCTTTGGGAAGTTACATAAAAGCTGC
Protein sequenceShow/hide protein sequence
MEIMMEKKKKDPGQELRTIKLFCPSLSTIAPFVASLDQCIDIGSIATIFGLEPSTVKLNGHFLSRGLDLVSSVTWNSLLSFFSAKRLPTGGSDDDALVVDGKLSKIGVKR
AHCPQEIANGDCCEADEEDANLNGGRLKPESNLVKNKRLKHMNSGSKRIDSPVLKCSPNDYKRKQHMEEVILLKKLKLNETKSGFDELSDAGREINSTANGFPRTAYSCS
YNSKNMKRMREDEALVPAFCKRTK