; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g36670 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g36670
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:27514811..27515899
RNA-Seq ExpressionMoc04g36670
SyntenyMoc04g36670
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]4.2e-10777.65Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNP
        MRTQM +ME+MY+EM+ AAGA SRSEN+V   D+ EQRG  LGPV++  PE  E E +T QRGDLREHLN+KR SSLRKGQ PS S+R+SNQQAESS+NP
Subjt:  MRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNP

Query:  ATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQ
         TP GVITREEFDQL+ K DAQVEALKAKCE+KE   +DGDLGESPFTSD+ EA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+D IKCR FQ
Subjt:  ATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQ

Query:  IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        IALTGSARLWYRRLPARSISTY+QLR+EF+ QFSSRHYD+KT THL TIRQK+GETLREYVTRF
Subjt:  IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]2.6e-9392.75Show/hide
Query:  KRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGS
        +RGSSLRKGQ PSRS+RSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDV EAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        +DPKDYVEVFEGLMDFQAASD IKCR FQIALT SARLWYRRLPARSISTY+QLRREFLAQFSSRHYDK+T THLATIRQK+GETLREYVTRF
Subjt:  KDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]1.4e-8677.93Show/hide
Query:  DNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV
        + E E +T QRGDLREHLN+KR SSLRKGQ PS S+R+SNQQAESS+NP TP  VITREEFDQL+ K DAQVEALKA CE+KE   +DGDLGE PFT D+
Subjt:  DNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV

Query:  FEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQ
         EAPI  KFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+D IKCR FQIA TGSARLWYRRLPARSISTY+QLR+EF++QFSSR+YD+KT THLATIRQ
Subjt:  FEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQ

Query:  KKGETLREYVTRF
        KKGETLREYVTRF
Subjt:  KKGETLREYVTRF

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]5.9e-9368.91Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESS
        MEAMRTQMR+MEEMYN+M+  AGA SRS +QV H DV EQ      PV+EE              GDLR+HLN+KR SS R  +  +  +++SNQQAESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESS

Query:  HNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCR
        +NP  P GVITREEF+QL+ K DAQVEALK +CE+KE   +DGDLGESPFTSD+ EA IP KFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+D IKCR
Subjt:  HNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCR

Query:  VFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
         FQIALTGSARLWYRRLPARSISTY+QLR+EF++QF SRHYD+KT THLATIRQK+G+TL+EY+TRF
Subjt:  VFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.8e-12467.68Show/hide
Query:  MVQPTNSTNTADRSTLAASDAHQREVGAAVVEGQGHNGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQP +STNT DR  L A+D HQREVGA VVEGQ H GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAMR
Subjt:  MVQPTNSTNTADRSTLAASDAHQREVGAAVVEGQGHNGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPAT
        TQM +MEEMYNEM+ A GAGSRSE++                                +RGDLR+HL++KR SSLRKG+ PS S+++SNQQAESS+NP  
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIA
        P GVITREEFDQL+ K DAQVE LKA+CE K    +DGDLGESPFTSD+ EA IPSKFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+D IK R FQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIA

Query:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        LT SARLWYRRLPARSISTY+QLR+EF +QFSSRHY++KT THLATIRQK+ ETLREYVT F
Subjt:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198421.3e-9392.75Show/hide
Query:  KRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGS
        +RGSSLRKGQ PSRS+RSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDV EAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        +DPKDYVEVFEGLMDFQAASD IKCR FQIALT SARLWYRRLPARSISTY+QLRREFLAQFSSRHYDK+T THLATIRQK+GETLREYVTRF
Subjt:  KDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

A0A6J1DDW5 uncharacterized protein LOC1110196342.0e-10777.65Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNP
        MRTQM +ME+MY+EM+ AAGA SRSEN+V   D+ EQRG  LGPV++  PE  E E +T QRGDLREHLN+KR SSLRKGQ PS S+R+SNQQAESS+NP
Subjt:  MRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNP

Query:  ATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQ
         TP GVITREEFDQL+ K DAQVEALKAKCE+KE   +DGDLGESPFTSD+ EA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+D IKCR FQ
Subjt:  ATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQ

Query:  IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        IALTGSARLWYRRLPARSISTY+QLR+EF+ QFSSRHYD+KT THL TIRQK+GETLREYVTRF
Subjt:  IALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

A0A6J1DM55 uncharacterized protein LOC1110222676.8e-8777.93Show/hide
Query:  DNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV
        + E E +T QRGDLREHLN+KR SSLRKGQ PS S+R+SNQQAESS+NP TP  VITREEFDQL+ K DAQVEALKA CE+KE   +DGDLGE PFT D+
Subjt:  DNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDV

Query:  FEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQ
         EAPI  KFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+D IKCR FQIA TGSARLWYRRLPARSISTY+QLR+EF++QFSSR+YD+KT THLATIRQ
Subjt:  FEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQ

Query:  KKGETLREYVTRF
        KKGETLREYVTRF
Subjt:  KKGETLREYVTRF

A0A6J1DPN4 uncharacterized protein LOC1110230602.9e-9368.91Show/hide
Query:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESS
        MEAMRTQMR+MEEMYN+M+  AGA SRS +QV H DV EQ      PV+EE              GDLR+HLN+KR SS R  +  +  +++SNQQAESS
Subjt:  MEAMRTQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESS

Query:  HNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCR
        +NP  P GVITREEF+QL+ K DAQVEALK +CE+KE   +DGDLGESPFTSD+ EA IP KFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+D IKCR
Subjt:  HNPATPAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCR

Query:  VFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
         FQIALTGSARLWYRRLPARSISTY+QLR+EF++QF SRHYD+KT THLATIRQK+G+TL+EY+TRF
Subjt:  VFQIALTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

A0A6J1DZJ1 uncharacterized protein LOC1110257381.8e-12467.68Show/hide
Query:  MVQPTNSTNTADRSTLAASDAHQREVGAAVVEGQGHNGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR
        MVQP +STNT DR  L A+D HQREVGA VVEGQ H GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAMR
Subjt:  MVQPTNSTNTADRSTLAASDAHQREVGAAVVEGQGHNGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMR

Query:  TQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPAT
        TQM +MEEMYNEM+ A GAGSRSE++                                +RGDLR+HL++KR SSLRKG+ PS S+++SNQQAESS+NP  
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPAT

Query:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIA
        P GVITREEFDQL+ K DAQVE LKA+CE K    +DGDLGESPFTSD+ EA IPSKFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+D IK R FQIA
Subjt:  PAGVITREEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIA

Query:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF
        LT SARLWYRRLPARSISTY+QLR+EF +QFSSRHY++KT THLATIRQK+ ETLREYVT F
Subjt:  LTGSARLWYRRLPARSISTYAQLRREFLAQFSSRHYDKKTVTHLATIRQKKGETLREYVTRF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCCACAAACTCGACCAATACGGCAGATCGAAGTACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACAA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCGCGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCAAGTGACGCACATTGACGTACGCGAGCAAAGGGGTTCCCGCCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAAAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTTACCATCCCGCT
CATACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTCGAAGCGCCAATCCCTTCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGTAATCAAAT
GTCGCGTCTTTCAGATCGCGCTTACTGGCAGTGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGTCACCCATCTCGCCACCATCAGGCAAAAGAAAGGTGAGACGCTGCGGGAATATGTCACCAGGTTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCCACAAACTCGACCAATACGGCAGATCGAAGTACCCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACAA
CGGCCTAGCAACAGAACCCCTCCGCAGGTCGGCGCGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAAATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCAAGTGACGCACATTGACGTACGCGAGCAAAGGGGTTCCCGCCTCGGCCCAGTCGAGGAGGAACG
TCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAGCATCTCAACAAAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTTACCATCCCGCT
CATACAGGAGCTCCAACCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGAGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAG
GTTGAGGCCTTAAAGGCCAAGTGTGAGCAGAAAGAAGGTCCACTGAACGATGGCGACCTGGGAGAATCGCCATTCACCTCGGACGTTTTCGAAGCGCCAATCCCTTCGAA
GTTCAAAGCTCCTACTGTGAAACCTTATGATGGGTCGAAAGACCCTAAGGATTATGTTGAGGTCTTTGAGGGCCTCATGGATTTTCAAGCGGCATCAGACGTAATCAAAT
GTCGCGTCTTTCAGATCGCGCTTACTGGCAGTGCGCGTTTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACGCTCAGCTGAGAAGGGAGTTCCTTGCCCAG
TTCTCTTCTCGGCACTATGACAAAAAGACAGTCACCCATCTCGCCACCATCAGGCAAAAGAAAGGTGAGACGCTGCGGGAATATGTCACCAGGTTCTAG
Protein sequenceShow/hide protein sequence
MVQPTNSTNTADRSTLAASDAHQREVGAAVVEGQGHNGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMRTQMRSMEEMY
NEMILAAGAGSRSENQVTHIDVREQRGSRLGPVEEERPEDNESEGHTRQRGDLREHLNKKRGSSLRKGQLPSRSYRSSNQQAESSHNPATPAGVITREEFDQLRGKLDAQ
VEALKAKCEQKEGPLNDGDLGESPFTSDVFEAPIPSKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDVIKCRVFQIALTGSARLWYRRLPARSISTYAQLRREFLAQ
FSSRHYDKKTVTHLATIRQKKGETLREYVTRF