; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021740 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021740
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionPoly polymerase 1, putative
Genome locationscaffold1:120385..120996
RNA-Seq ExpressionMS021740
SyntenyMS021740
Gene Ontology termsNA
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_011654294.1 uncharacterized protein LOC101220453 [Cucumis sativus]4.5e-5062.67Show/hide
Query:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ
        MG C S C       +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD L+YDDFIP +PLD QL  +QIYF+LPSS L  
Subjt:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ

Query:  RLSASDMAAMALKASLALQNASSKD-PLLRKKG---RISPLL---IPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA
        RL+A DMAA+A+KA+LALQNAS+ +  L   KG   RISPL     PN   + H    +  T S+SK N             SSSV+KLQ+LTSRRAKMA
Subjt:  RLSASDMAAMALKASLALQNASSKD-PLLRKKG---RISPLL---IPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA

Query:  VRSFKLKLSTIYEGTVL
        VRSFKL+LSTIYEGTVL
Subjt:  VRSFKLKLSTIYEGTVL

XP_022154429.1 uncharacterized protein LOC111021675 [Momordica charantia]1.9e-10199.51Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
        MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNL PSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
Subjt:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE

Query:  GTVL
        GTVL
Subjt:  GTVL

XP_022940531.1 uncharacterized protein LOC111446101 [Cucurbita moschata]1.9e-5668.9Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTS+RAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

XP_022981858.1 uncharacterized protein LOC111480876 [Cucurbita maxima]8.5e-5769.38Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTSRRAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

XP_038896630.1 uncharacterized protein LOC120084892 [Benincasa hispida]3.2e-5666.82Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C      +SVPPPP  PTAKVI+L+G+LREYP PISVSRVLQTE+ SSSTSDSFLCNSD LYYDDFIPP+PLD QL  ++IYFLL SSKL QR
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRK-KGRISPLLIPNPT-SDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKL
        L+ASDMAA+A+KA+LALQN S+ DP LR+ KGRISP+L+ +   SD  S  D +    +SKKN+            SSSVR+LQ+LTSRRAKMAVRSFKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRK-KGRISPLLIPNPT-SDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKL

Query:  KLSTIYEGTVL
        +LSTIYEG VL
Subjt:  KLSTIYEGTVL

TrEMBL top hitse value%identityAlignment
A0A0A0L5Z9 Uncharacterized protein2.2e-5062.67Show/hide
Query:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ
        MG C S C       +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD L+YDDFIP +PLD QL  +QIYF+LPSS L  
Subjt:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ

Query:  RLSASDMAAMALKASLALQNASSKD-PLLRKKG---RISPLL---IPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA
        RL+A DMAA+A+KA+LALQNAS+ +  L   KG   RISPL     PN   + H    +  T S+SK N             SSSV+KLQ+LTSRRAKMA
Subjt:  RLSASDMAAMALKASLALQNASSKD-PLLRKKG---RISPLL---IPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA

Query:  VRSFKLKLSTIYEGTVL
        VRSFKL+LSTIYEGTVL
Subjt:  VRSFKLKLSTIYEGTVL

A0A1S3BUP5 uncharacterized protein LOC1034938641.4e-4961.64Show/hide
Query:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ
        MG CLS C       +SVPPPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSSTSDSFLCNSD LY+DDFIP +PLD QL  +QIYF+LPSS L  
Subjt:  MGACLS-C-----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQ

Query:  RLSASDMAAMALKASLALQNASSKD----PLLRKKG---RISPLL-IPNPTSDSHS-DSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAK
        RL+A DMAA+A+KA+LALQNAS+ +     L R KG   RISPL  + +P    H  + +  L+ + + KNN            SSSV+KLQ+LTSRRAK
Subjt:  RLSASDMAAMALKASLALQNASSKD----PLLRKKG---RISPLL-IPNPTSDSHS-DSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAK

Query:  MAVRSFKLKLSTIYEGTVL
        MAVRSFKL+LSTIYEGT L
Subjt:  MAVRSFKLKLSTIYEGTVL

A0A6J1DM27 uncharacterized protein LOC1110216759.4e-10299.51Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
        MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNL PSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE
Subjt:  MAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYE

Query:  GTVL
        GTVL
Subjt:  GTVL

A0A6J1FIQ7 uncharacterized protein LOC1114461019.2e-5768.9Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTS+RAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

A0A6J1J381 uncharacterized protein LOC1114808764.1e-5769.38Show/hide
Query:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR
        MGACLS C     P+SV PPP  PTAKVISL+G+LREYP PISVSRVLQTEN SSS SDSFLCNSD LYYDDFIPP+PLD+QLL +QIYFLLPSS L  R
Subjt:  MGACLS-C----SPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQR

Query:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL
        LSAS MAA+A+KASLALQNAS  D   RKKGR+SPLL       + SDSD  ++   SKKN             S SVRKLQ+LTSRRAKMAVRSFKLKL
Subjt:  LSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKL

Query:  STIYEGTVL
        STIYEG VL
Subjt:  STIYEGTVL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21010.1 unknown protein4.2e-3845.54Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS-------FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLR
        MG C+S          ++PT K++++ G+LREY  P+  S+VL+ E+ ++ +S S       F+C+SDSLYYDDFIP +  ++ L A QIYF+LP SK +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS-------FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLR

Query:  QRLSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKK--NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSF
         RL+ASDMAA+A+KAS+A+QN+  K+   RKK RISP+++   ++DS + + S  T    +   +   PV     I  S SVR L++ TS+RAK+AVRSF
Subjt:  QRLSASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKK--NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSF

Query:  KLKLSTIYEGTVL
        +LKLSTIYEG+V+
Subjt:  KLKLSTIYEGTVL

AT1G76600.1 unknown protein9.5e-3846.05Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS----FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRL
        MG C+S +        ++ TAK++++ G+LREY  P+  S+VL++E+ SSS+S S    FLCNSDSLYYDDFIP +  D+ L A+QIYF+LP SK + RL
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDS----FLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRL

Query:  SASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKK---------NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA
        SASDMAA+A+KAS+A++ A+ K    R+ GRISP++  N  +D+   + +N    ++           N   P         S SVRKL++ TS RAK+A
Subjt:  SASDMAAMALKASLALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKK---------NNGVPVPIPIPIPESSSVRKLQKLTSRRAKMA

Query:  VRSFKLKLSTIYEGT
        VRSF+L+LSTIYEG+
Subjt:  VRSFKLKLSTIYEGT

AT2G23690.1 unknown protein4.4e-1134.07Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MG C S     V       TAK+I  +G + E+ +P+ V  VLQ +NP       F+CNSD + +D+ +  +  D++    Q+YF LP S L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLALQNAS---SKDPLLRKKGRISPLL
        MAA+A+KAS AL  +     +D    ++  +SP++
Subjt:  MAAMALKASLALQNAS---SKDPLLRKKGRISPLL

AT3G50800.1 unknown protein1.5e-1140.18Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGAC S           T TAK+I  +G L+E+  P+ V ++LQ +NP+     SF+CNSD + +DD +  +P  + L   ++YF+LP + L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLAL
        MAA+A+KAS AL
Subjt:  MAAMALKASLAL

AT5G66580.1 unknown protein8.9e-1239.29Show/hide
Query:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD
        MGAC S           + +AK+I L+G L+E+ +P+ V ++LQ +NP+     SF+CNSD + +DD +  +  +++L + Q+YF+LP + L   L A +
Subjt:  MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASD

Query:  MAAMALKASLAL
        MAA+A+KAS AL
Subjt:  MAAMALKASLAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCGCCTGCTTGTCGTGCTCCCCAGCTTCCGTCCCTCCTCCTCCGGCTACTCCCACCGCGAAAGTGATCTCTTTAGAGGGCAATCTCCGGGAGTACCCTGCTCCCAT
CTCCGTTTCCCGCGTTCTTCAAACCGAAAACCCCTCTTCCTCCACCTCCGATTCCTTTCTTTGCAACTCCGACAGCCTGTACTACGACGATTTCATTCCCCCGATGCCCC
TCGACGACCAGCTTCTGGCCAGTCAGATCTATTTCCTTCTTCCTTCCTCCAAGCTCCGCCAGCGATTGAGCGCCTCCGACATGGCCGCCATGGCCCTCAAAGCCAGCCTC
GCCCTCCAAAATGCTTCCTCTAAAGACCCCCTCTTGCGTAAGAAGGGTCGTATTTCTCCCCTCCTCATCCCCAACCCCACCTCCGACTCCCACTCCGACTCGGACTCCAA
CCTCACCCCCTCCGACTCCAAGAAGAATAATGGCGTCCCCGTTCCCATCCCCATCCCCATCCCCGAATCCTCTTCCGTTAGAAAATTGCAGAAATTGACATCCAGAAGGG
CAAAAATGGCGGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCACCGTTCTC
mRNA sequenceShow/hide mRNA sequence
ATGGGCGCCTGCTTGTCGTGCTCCCCAGCTTCCGTCCCTCCTCCTCCGGCTACTCCCACCGCGAAAGTGATCTCTTTAGAGGGCAATCTCCGGGAGTACCCTGCTCCCAT
CTCCGTTTCCCGCGTTCTTCAAACCGAAAACCCCTCTTCCTCCACCTCCGATTCCTTTCTTTGCAACTCCGACAGCCTGTACTACGACGATTTCATTCCCCCGATGCCCC
TCGACGACCAGCTTCTGGCCAGTCAGATCTATTTCCTTCTTCCTTCCTCCAAGCTCCGCCAGCGATTGAGCGCCTCCGACATGGCCGCCATGGCCCTCAAAGCCAGCCTC
GCCCTCCAAAATGCTTCCTCTAAAGACCCCCTCTTGCGTAAGAAGGGTCGTATTTCTCCCCTCCTCATCCCCAACCCCACCTCCGACTCCCACTCCGACTCGGACTCCAA
CCTCACCCCCTCCGACTCCAAGAAGAATAATGGCGTCCCCGTTCCCATCCCCATCCCCATCCCCGAATCCTCTTCCGTTAGAAAATTGCAGAAATTGACATCCAGAAGGG
CAAAAATGGCGGTTCGTTCCTTTAAACTCAAATTGAGCACCATCTATGAAGGCACCGTTCTC
Protein sequenceShow/hide protein sequence
MGACLSCSPASVPPPPATPTAKVISLEGNLREYPAPISVSRVLQTENPSSSTSDSFLCNSDSLYYDDFIPPMPLDDQLLASQIYFLLPSSKLRQRLSASDMAAMALKASL
ALQNASSKDPLLRKKGRISPLLIPNPTSDSHSDSDSNLTPSDSKKNNGVPVPIPIPIPESSSVRKLQKLTSRRAKMAVRSFKLKLSTIYEGTVL