; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g12610 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g12610
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr4:9665678..9668355
RNA-Seq ExpressionMoc04g12610
SyntenyMoc04g12610
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.1e-7154.65Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGAGGI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAMVCGFSQ
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N  + P E  RPNS LAMVC F+ 
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAMVCGFSQ

Query:  DVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKILEVSPLREVRRRASPKKSKKKK
         V+ K      A + +++ +  +PAV  P +ED      +P+ E+     P + K+ +
Subjt:  DVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKILEVSPLREVRRRASPKKSKKKK

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]1.2e-5466.67Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEYD R P+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSG
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.2e-6764.58Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGAGGI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSEL
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N  + P E+ RPNSEL
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]3.6e-6764.43Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAM
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ DKLLL SGLL++N  + P E+ RPNSEL M
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]3.9e-8559.64Show/hide
Query:  RLPREGERADNPPEGCITLYLKMFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYY
        RLP EGERADNPPEG +TLY KMFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +L  VDQLLACFE KRI +KPGR+Y
Subjt:  RLPREGERADNPPEGCITLYLKMFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYY

Query:  LCARKGAGGILKGPTSIKKWVGKWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNT
        +CARKGAGGI+KGPTSIK WV KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N 
Subjt:  LCARKGAGGILKGPTSIKKWVGKWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNT

Query:  FLAPSEARRPNSELAMVCGFSQDVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKI---LEVS--PLREVRRR
         + P E+ RPNSELAMVCGF+  V+ K      A + +++ +  +PAV  P +ED  +   LE S  P RE R R
Subjt:  FLAPSEARRPNSELAMVCGFSQDVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKI---LEVS--PLREVRRR

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138265.3e-7254.65Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGAGGI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAMVCGFSQ
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N  + P E  RPNS LAMVC F+ 
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAMVCGFSQ

Query:  DVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKILEVSPLREVRRRASPKKSKKKK
         V+ K      A + +++ +  +PAV  P +ED      +P+ E+     P + K+ +
Subjt:  DVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKILEVSPLREVRRRASPKKSKKKK

A0A6J1DPM7 uncharacterized protein LOC1110223715.9e-5566.67Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEYD R P+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSG
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSG

A0A6J1DWD2 uncharacterized protein LOC1110246803.0e-6764.58Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGAGGI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSEL
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N  + P E+ RPNSEL
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSEL

A0A6J1DWF1 uncharacterized protein LOC1110251081.8e-6764.43Show/hide
Query:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG
        MFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +LL VDQLLACFE KRI +KPGR+Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVG

Query:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAM
        KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ DKLLL SGLL++N  + P E+ RPNSEL M
Subjt:  KWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAM

A0A6J1DXS5 uncharacterized protein LOC1110255021.9e-8559.64Show/hide
Query:  RLPREGERADNPPEGCITLYLKMFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYY
        RLP EGERADNPPEG +TLY KMFEY  RLP+HP VQEFL RTGLAPAQVAPNGWGVIF+L ILFWL+    E+ +L  VDQLLACFE KRI +KPGR+Y
Subjt:  RLPREGERADNPPEGCITLYLKMFEYDFRLPVHPLVQEFLVRTGLAPAQVAPNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYY

Query:  LCARKGAGGILKGPTSIKKWVGKWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNT
        +CARKGAGGI+KGPTSIK WV KWF+AS  WLAK+E    FF+VP RFGNLV+IR +P+ ++ +F+ LK++K  F  G+++ TL+ D+LLL SGLL++N 
Subjt:  LCARKGAGGILKGPTSIKKWVGKWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIRLIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNT

Query:  FLAPSEARRPNSELAMVCGFSQDVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKI---LEVS--PLREVRRR
         + P E+ RPNSELAMVCGF+  V+ K      A + +++ +  +PAV  P +ED  +   LE S  P RE R R
Subjt:  FLAPSEARRPNSELAMVCGFSQDVRCKC---PRARQTSKAKEAWSPAVADPPTEDQKI---LEVS--PLREVRRR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGATCATCATGTCGAAGTCTTCCAGTTCTCCAAGTTCGTCTAGTAGTAATTCTTCTAGTAGCGCAACTCGGAGTCGGGATCCTTCACCCAAGAAGGCCGAT
TCCGTAGAGGAGTTTGCGAGTAGGTTAGACTCTGAGCTGGAGAAAGAGATAGATAATTTTAGGCTTCCTAGGGAAGGGGAGCGAGCTGACAATCCTCCGGAGGGG
TGTATAACCCTTTATTTGAAGATGTTTGAGTACGATTTTCGCCTACCCGTCCATCCTTTGGTGCAGGAGTTCTTAGTCCGAACTGGGTTAGCTCCTGCTCAAGTG
GCCCCCAATGGATGGGGAGTCATCTTTAGCCTAACTATATTATTTTGGCTAAAGTGTATGGTGGTAGAAGATCTAGACCTCCTCAGAGTTGACCAACTTTTAGCC
TGCTTTGAGGTCAAGCGAATCTTTAGGAAGCCGGGGAGGTACTACTTGTGTGCTAGGAAGGGTGCAGGAGGCATTCTGAAAGGTCCAACCTCCATAAAAAAGTGG
GTTGGAAAGTGGTTCTTTGCCTCCGAAGCATGGCTGGCCAAGAACGAGTTCGACCTGCCTTTCTTCAACGTCCCTTATAGGTTTGGGAACTTAGTTGCTATAAGG
CTGATTCCTCAGCCTTCCGAGCCGACCTTTAACGCTTTGAAGTTCTTCAAAGGCATATTCAAGAGCGGCAAGCAGATTAGCACGCTCATAATAGACAAGCTCCTC
CTTACCTCTGGACTGCTCAACTTCAATACGTTTTTAGCCCCTTCTGAAGCTCGGAGACCGAACTCGGAACTAGCCATGGTGTGCGGCTTCTCCCAGGACGTCAGG
TGCAAATGCCCTCGTGCTAGACAGACCTCCAAGGCTAAGGAGGCATGGAGTCCTGCTGTTGCTGATCCTCCTACCGAGGATCAAAAGATCCTTGAGGTCTCTCCC
CTTAGGGAGGTTCGGAGGAGGGCCTCCCCTAAGAAGTCCAAGAAGAAGAAATGCAAGGCTCACCACTCCGAGGACGAGGTGAGGGAGTCTACTTTCGAAGCCGAT
AAGGCCAAGCTCAAGAGTGCTCAGGCGGAGGTTGCCCGGCACATGGAGAATCTGAAAGGCGCATATGCTGTGGCAAAATGCCTTAAAAAGGAAAATTTCGCGCTG
ATAAAGCAGAACGACGAACTCGAGTGCCTCCGAGAAGACATGGAGGTCTTTCGCAAACACCCTAACTTCGACGGGTTTGCCAAAAACTTCAGCGACGTCGGTTTC
CAGTTCTTGATGAAGGGGGTCAAGGAAGTGGCTCTCGAGCTCGACCTTGAGCCCATCAAGCTCAGATATGCCAAGAAGTGGGCTTCAGGTCCCAACAGGACACCT
GGCCCCCAAGACTTTGTGGACCAATGCTTGAAGGAGCTCGAATCTGACGCTGAGCTCAACGAGGGGGAGGACATTGGCCTTTCTTCCCAAGAAGTTTATGGGGCT
CTCCACTCAACTGTTGGCGTTAACTTCTACCAAGAGGTCGGGATGATGGACTCTCAAGAGGTCGACATCCTTGGATCTCAAGGCGAACTTGGGTCTCACCTCGGA
AGCAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAGATCATCATGTCGAAGTCTTCCAGTTCTCCAAGTTCGTCTAGTAGTAATTCTTCTAGTAGCGCAACTCGGAGTCGGGATCCTTCACCCAAGAAGGCCGAT
TCCGTAGAGGAGTTTGCGAGTAGGTTAGACTCTGAGCTGGAGAAAGAGATAGATAATTTTAGGCTTCCTAGGGAAGGGGAGCGAGCTGACAATCCTCCGGAGGGG
TGTATAACCCTTTATTTGAAGATGTTTGAGTACGATTTTCGCCTACCCGTCCATCCTTTGGTGCAGGAGTTCTTAGTCCGAACTGGGTTAGCTCCTGCTCAAGTG
GCCCCCAATGGATGGGGAGTCATCTTTAGCCTAACTATATTATTTTGGCTAAAGTGTATGGTGGTAGAAGATCTAGACCTCCTCAGAGTTGACCAACTTTTAGCC
TGCTTTGAGGTCAAGCGAATCTTTAGGAAGCCGGGGAGGTACTACTTGTGTGCTAGGAAGGGTGCAGGAGGCATTCTGAAAGGTCCAACCTCCATAAAAAAGTGG
GTTGGAAAGTGGTTCTTTGCCTCCGAAGCATGGCTGGCCAAGAACGAGTTCGACCTGCCTTTCTTCAACGTCCCTTATAGGTTTGGGAACTTAGTTGCTATAAGG
CTGATTCCTCAGCCTTCCGAGCCGACCTTTAACGCTTTGAAGTTCTTCAAAGGCATATTCAAGAGCGGCAAGCAGATTAGCACGCTCATAATAGACAAGCTCCTC
CTTACCTCTGGACTGCTCAACTTCAATACGTTTTTAGCCCCTTCTGAAGCTCGGAGACCGAACTCGGAACTAGCCATGGTGTGCGGCTTCTCCCAGGACGTCAGG
TGCAAATGCCCTCGTGCTAGACAGACCTCCAAGGCTAAGGAGGCATGGAGTCCTGCTGTTGCTGATCCTCCTACCGAGGATCAAAAGATCCTTGAGGTCTCTCCC
CTTAGGGAGGTTCGGAGGAGGGCCTCCCCTAAGAAGTCCAAGAAGAAGAAATGCAAGGCTCACCACTCCGAGGACGAGGTGAGGGAGTCTACTTTCGAAGCCGAT
AAGGCCAAGCTCAAGAGTGCTCAGGCGGAGGTTGCCCGGCACATGGAGAATCTGAAAGGCGCATATGCTGTGGCAAAATGCCTTAAAAAGGAAAATTTCGCGCTG
ATAAAGCAGAACGACGAACTCGAGTGCCTCCGAGAAGACATGGAGGTCTTTCGCAAACACCCTAACTTCGACGGGTTTGCCAAAAACTTCAGCGACGTCGGTTTC
CAGTTCTTGATGAAGGGGGTCAAGGAAGTGGCTCTCGAGCTCGACCTTGAGCCCATCAAGCTCAGATATGCCAAGAAGTGGGCTTCAGGTCCCAACAGGACACCT
GGCCCCCAAGACTTTGTGGACCAATGCTTGAAGGAGCTCGAATCTGACGCTGAGCTCAACGAGGGGGAGGACATTGGCCTTTCTTCCCAAGAAGTTTATGGGGCT
CTCCACTCAACTGTTGGCGTTAACTTCTACCAAGAGGTCGGGATGATGGACTCTCAAGAGGTCGACATCCTTGGATCTCAAGGCGAACTTGGGTCTCACCTCGGA
AGCAGCTGA
Protein sequenceShow/hide protein sequence
MEIIMSKSSSSPSSSSSNSSSSATRSRDPSPKKADSVEEFASRLDSELEKEIDNFRLPREGERADNPPEGCITLYLKMFEYDFRLPVHPLVQEFLVRTGLAPAQV
APNGWGVIFSLTILFWLKCMVVEDLDLLRVDQLLACFEVKRIFRKPGRYYLCARKGAGGILKGPTSIKKWVGKWFFASEAWLAKNEFDLPFFNVPYRFGNLVAIR
LIPQPSEPTFNALKFFKGIFKSGKQISTLIIDKLLLTSGLLNFNTFLAPSEARRPNSELAMVCGFSQDVRCKCPRARQTSKAKEAWSPAVADPPTEDQKILEVSP
LREVRRRASPKKSKKKKCKAHHSEDEVRESTFEADKAKLKSAQAEVARHMENLKGAYAVAKCLKKENFALIKQNDELECLREDMEVFRKHPNFDGFAKNFSDVGF
QFLMKGVKEVALELDLEPIKLRYAKKWASGPNRTPGPQDFVDQCLKELESDAELNEGEDIGLSSQEVYGALHSTVGVNFYQEVGMMDSQEVDILGSQGELGSHLG
SS