; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC11g0008 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC11g0008
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionPoly polymerase 1, putative
Genome locationMC11:113591..114202
RNA-Seq ExpressionMC11g0008
SyntenyMC11g0008
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654294.1 uncharacterized protein LOC101220453 [Cucumis sativus]5.75e-6564.36Show/hide
Query:  ASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKAS
        +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD L+YDDFIP +PLD QL  +QIYF+LPSS L  RL+A DMAA+A+KA+
Subjt:  ASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKAS

Query:  LALQNASSKD-PLLRKKGR---ISPLL---IPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGT
        LALQNAS+ +  L   KGR   ISPL     PN   + H + +  L+ + + KNN            SSSV+KLQ+LTSRRAKMAVRSFKL+LSTIYEGT
Subjt:  LALQNASSKD-PLLRKKGR---ISPLL---IPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGT

Query:  VL
        VL
Subjt:  VL

XP_022154429.1 uncharacterized protein LOC111021675 [Momordica charantia]5.19e-133100Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
        MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
Subjt:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE

Query:  GTVL
        GTVL
Subjt:  GTVL

XP_022940531.1 uncharacterized protein LOC111446101 [Cucurbita moschata]1.18e-7368.9Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTS+RAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

XP_022981858.1 uncharacterized protein LOC111480876 [Cucurbita maxima]4.13e-7469.38Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTSRRAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

XP_038896630.1 uncharacterized protein LOC120084892 [Benincasa hispida]3.41e-7366.82Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C      +SVPPPP  PTAKVI+L+G+LREYP PISVSRVLQTE+ SSSTSDSFLCNSD LYYDDFIPP+PLD QL  ++IYFLL SSKL QR
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRK-KGRISPLLIPNPT-SDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKL
        L+ASDMAA+A+KA+LALQN S+ DP LR+ KGRISP+L+ +   SD  S  D +    +SKKN+            SSSVR+LQ+LTSRRAKMAVRSFKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRK-KGRISPLLIPNPT-SDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKL

Query:  KLSTIYEGTVL
        +LSTIYEG VL
Subjt:  KLSTIYEGTVL

TrEMBL top hitse value%identityAlignment
A0A0A0L5Z9 Uncharacterized protein2.78e-6564.36Show/hide
Query:  ASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKAS
        +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD L+YDDFIP +PLD QL  +QIYF+LPSS L  RL+A DMAA+A+KA+
Subjt:  ASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKAS

Query:  LALQNASSKD-PLLRKKGR---ISPLL---IPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGT
        LALQNAS+ +  L   KGR   ISPL     PN   + H + +  L+ + + KNN            SSSV+KLQ+LTSRRAKMAVRSFKL+LSTIYEGT
Subjt:  LALQNASSKD-PLLRKKGR---ISPLL---IPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGT

Query:  VL
        VL
Subjt:  VL

A0A1S3BUP5 uncharacterized protein LOC1034938641.28e-6461.75Show/hide
Query:  MGACLS-CS-----PASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ
        MG CLS C       +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD LY+DDFIP +PLD QL  +QIYF+LPSS L  
Subjt:  MGACLS-CS-----PASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ

Query:  RLSASDMAAMALKASLALQNASSKD----PLLRKKGR---ISPLL-IPNPTSDSHS-DSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAK
        RL+A DMAA+A+KA+LALQNAS+ +     L R KGR   ISPL  + +P    H  + +  L+ + + KNN            SSSV+KLQ+LTSRRAK
Subjt:  RLSASDMAAMALKASLALQNASSKD----PLLRKKGR---ISPLL-IPNPTSDSHS-DSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAK

Query:  MAVRSFKLKLSTIYEGT
        MAVRSFKL+LSTIYEGT
Subjt:  MAVRSFKLKLSTIYEGT

A0A6J1DM27 uncharacterized protein LOC1110216752.51e-133100Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
        MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
Subjt:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE

Query:  GTVL
        GTVL
Subjt:  GTVL

A0A6J1FIQ7 uncharacterized protein LOC1114461015.71e-7468.9Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTS+RAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

A0A6J1J381 uncharacterized protein LOC1114808762.00e-7469.38Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTSRRAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21010.1 unknown protein1.2e-3745.07Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS-------FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLR
        MG C+S          ++PT K++++ G+LREY  P+  S+VL+ E+ ++ +S S       F+C+SDSLYYDDFIP +  ++ L A QIYF+LP SK +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS-------FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLR

Query:  QRLSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKK--NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSF
         RL+ASDMAA+A+KAS+A+QN+  K+   RKK RISP+++   ++DS + + S       +   +   PV     I  S SVR L++ TS+RAK+AVRSF
Subjt:  QRLSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKK--NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSF

Query:  KLKLSTIYEGTVL
        +LKLSTIYEG+V+
Subjt:  KLKLSTIYEGTVL

AT1G76600.1 unknown protein7.2e-3846.05Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS----FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRL
        MG C+S +        ++ TAK++++ G+LREY  P+  S+VL++E+ SSS+S S    FLCNSDSLYYDDFIP +  D+ L A+QIYF+LP SK + RL
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS----FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRL

Query:  SASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKK---------NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA
        SASDMAA+A+KAS+A++ A+ K    R+ GRISP++  N  +D+   + +N    ++           N   P         S SVRKL++ TS RAK+A
Subjt:  SASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKK---------NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA

Query:  VRSFKLKLSTIYEGT
        VRSF+L+LSTIYEG+
Subjt:  VRSFKLKLSTIYEGT

AT2G23690.1 unknown protein4.4e-1134.07Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MG C S     V       TAK+I  +G + E+ +P+ V  VLQ +NP       F+CNSD + +D+ +  +  D++    Q+YF LP S L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNAS---SKDPLLRKKGRISPLL
        MAA+A+KAS AL  +     +D    ++  +SP++
Subjt:  MAAMALKASLALQNAS---SKDPLLRKKGRISPLL

AT3G50800.1 unknown protein1.5e-1140.18Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGAC S           T TAK+I  +G L+E+  P+ V ++LQ +NP+     SF+CNSD + +DD +  +P  + L   ++YF+LP + L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLAL
        MAA+A+KAS AL
Subjt:  MAAMALKASLAL

AT5G66580.1 unknown protein8.9e-1239.29Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGAC S           + +AK+I L+G L+E+ +P+ V ++LQ +NP+     SF+CNSD + +DD +  +  +++L + Q+YF+LP + L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLAL
        MAA+A+KAS AL
Subjt:  MAAMALKASLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCCTGCTTGTCGTGCTCCCCAGCTTCCGTCCCTCCTCCTCCGGCTACTCCCACCGCGAAAGTGATCTCTTTAGAGGGCAATCTCCGGGAGTACCCTGCTCCCAT
CTCCGTTTCCCGCGTTCTTCAAACCGAAAACCCCTCTTCCTCCACCTCCGATTCCTTTCTTTGCAACTCCGACAGCCTGTACTACGACGATTTCATTCCCCCGATGCCCC
TCGACGACCAGCTTCTGGCCAGTCAGATCTATTTCCTTCTTCCTTCCTCCAAGCTCCGCCAGCGATTGAGCGCCTCCGACATGGCCGCCATGGCCCTCAAAGCCAGCCTC
GCCCTCCAAAATGCTTCCTCTAAAGACCCCCTCTTGCGTAAGAAGGGTCGTATTTCTCCCCTCCTCATCCCCAACCCCACCTCCGACTCCCACTCCGACTCGGACTCCAA
CCTCGCCCCCTCCGACTCCAAGAAGAATAATGGCGTCCCCGTTCCCATCCCCATCCCCATCCCCGAATCCTCTTCCGTTAGAAAATTGCAGAAATTGACATCCAGAAGGG
CAAAAATGGCGGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCACCGTTCTC
mRNA sequenceShow/hide mRNA sequence
ATGGGCGCCTGCTTGTCGTGCTCCCCAGCTTCCGTCCCTCCTCCTCCGGCTACTCCCACCGCGAAAGTGATCTCTTTAGAGGGCAATCTCCGGGAGTACCCTGCTCCCAT
CTCCGTTTCCCGCGTTCTTCAAACCGAAAACCCCTCTTCCTCCACCTCCGATTCCTTTCTTTGCAACTCCGACAGCCTGTACTACGACGATTTCATTCCCCCGATGCCCC
TCGACGACCAGCTTCTGGCCAGTCAGATCTATTTCCTTCTTCCTTCCTCCAAGCTCCGCCAGCGATTGAGCGCCTCCGACATGGCCGCCATGGCCCTCAAAGCCAGCCTC
GCCCTCCAAAATGCTTCCTCTAAAGACCCCCTCTTGCGTAAGAAGGGTCGTATTTCTCCCCTCCTCATCCCCAACCCCACCTCCGACTCCCACTCCGACTCGGACTCCAA
CCTCGCCCCCTCCGACTCCAAGAAGAATAATGGCGTCCCCGTTCCCATCCCCATCCCCATCCCCGAATCCTCTTCCGTTAGAAAATTGCAGAAATTGACATCCAGAAGGG
CAAAAATGGCGGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCACCGTTCTC
Protein sequenceShow/hide protein sequence
MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKASL
ALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLAPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGTVL