; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g29650 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g29650
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlus3 domain-containing protein
Genome locationchr9:22400984..22403727
RNA-Seq ExpressionMoc09g29650
SyntenyMoc09g29650
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]2.3e-4444.17Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVF
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN  L        
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVF

Query:  VLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHA
                            AMVC F+  V+RK     HA
Subjt:  VLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHA

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]1.1e-4350.52Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN EL
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.5e-4350.52Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K     G+++ T +T+KLLL S LLDYNP +  +E  RPN EL
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]2.8e-7450Show/hide
Query:  EFASRLDSELEEEIDNFRFSEDEGDDSDTSTSSLGLEYPSQMPENYLGRLHRRYSILDDILLRLPKEGERVDNLPEGRVTLYLKMFEYGFRLPVHPLVQE
        + A RL+S+L EEI+N R S D+G+DSD STS  GLEYPS++PE+YLG L R ++I ++ILLRLP+EGER DN PEG VTLY KMFEYG RLP+HP VQE
Subjt:  EFASRLDSELEEEIDNFRFSEDEGDDSDTSTSSLGLEYPSQMPENYLGRLHRRYSILDDILLRLPKEGERVDNLPEGRVTLYLKMFEYGFRLPVHPLVQE

Query:  FLVRTGLIPAQ----GSGRFGP------PRSRPAS-----------GLLEVKRISRKPGKYYLCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESD
        FL RTGL PAQ    G G           R+R +               E KRI++KPG++Y+CARKGA GI+KGPTSIK WV KWF+AS  WLA++ES 
Subjt:  FLVRTGLIPAQ----GSGRFGP------PRSRPAS-----------GLLEVKRISRKPGKYYLCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESD

Query:  LPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQL
          FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN EL                        
Subjt:  LPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQL

Query:  FFLAAMVCGFSQDVRRKRPCTGHA
            AMVCGF+  V+RK     HA
Subjt:  FFLAAMVCGFSQDVRRKRPCTGHA

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]8.7e-6036.35Show/hide
Query:  LCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNP
        +CARKG  GI+KGPTSIK WV KWFFAS  WLA++ES   FF+V  RF NLV+I+ IP+L++ TF+ LK +K      ++I T +T+KLLL S LLDYNP
Subjt:  LCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNP

Query:  LLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHASKNKEAPSLLSLTFP---------PKSRWLWSDMK-PL
        L+ L+E  RPN EL                            AMVCGF+  V+RK     HA K       ++ T P         P S      ++  L
Subjt:  LLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHASKNKEAPSLLSLTFP---------PKSRWLWSDMK-PL

Query:  RRGRRLRPAPLRITRSLKFLPFRRFGGRLPPRSPRR------------RSARPTT-------PRTRIGGTSDIEMKI-----------------------
          GR         + +L   P     G  P R  R+            R   PT+       P  R+ GTS++ M+                        
Subjt:  RRGRRLRPAPLRITRSLKFLPFRRFGGRLPPRSPRR------------RSARPTT-------PRTRIGGTSDIEMKI-----------------------

Query:  ------------------------KAHAAACQTSIMVKAELEGRGFLIVKEREASSAALE------GELKEARVEAQAWRSTSEADKAELKSAQAEVARH
                                +A  A+   ++MVKAEL+GR  L  KERE S AALE      GEL +A+ E    R+  +A K +L   + E  +H
Subjt:  ------------------------KAHAAACQTSIMVKAELEGRGFLIVKEREASSAALE------GELKEARVEAQAWRSTSEADKAELKSAQAEVARH

Query:  LENLRGTHAMVKCLEKENFMA-----DLKAKLELEES--------------KLRNEVLLEEAFHKHPDFDGFAKDFSDAGFRFLMKGIQEVAP--ELDLA
          +LR  HA+ K LEKE F       DL   LE +++              +L N  LLEE+F +HPDFDGFAKDFSDAGF+FLMKGI    P  ++DL 
Subjt:  LENLRGTHAMVKCLEKENFMA-----DLKAKLELEES--------------KLRNEVLLEEAFHKHPDFDGFAKDFSDAGFRFLMKGIQEVAP--ELDLA

Query:  PIKLRYAEKWASGSNETPGP
         +K +Y+EKWASG N TP P
Subjt:  PIKLRYAEKWASGSNETPGP

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.1e-4444.17Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVF
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN  L        
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVF

Query:  VLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHA
                            AMVC F+  V+RK     HA
Subjt:  VLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHA

A0A6J1DWD2 uncharacterized protein LOC1110246805.5e-4450.52Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN EL
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL

A0A6J1DWF1 uncharacterized protein LOC1110251087.2e-4450.52Show/hide
Query:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE
        MFEYG RLP+HP VQEFL RTGL PAQ +              +   R    + LL+V         KRI++KPG++Y+CARKGA GI+KGPTSIK WV 
Subjt:  MFEYGFRLPVHPLVQEFLVRTGLIPAQGSGR------------FGPPRSRPASGLLEV---------KRISRKPGKYYLCARKGARGILKGPTSIKKWVE

Query:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL
        KWF+AS  WLA++ES   FF+V  RF NLV+IRP+P+L++ +F+ LK++K     G+++ T +T+KLLL S LLDYNP +  +E  RPN EL
Subjt:  KWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFEL

A0A6J1DXS5 uncharacterized protein LOC1110255021.3e-7450Show/hide
Query:  EFASRLDSELEEEIDNFRFSEDEGDDSDTSTSSLGLEYPSQMPENYLGRLHRRYSILDDILLRLPKEGERVDNLPEGRVTLYLKMFEYGFRLPVHPLVQE
        + A RL+S+L EEI+N R S D+G+DSD STS  GLEYPS++PE+YLG L R ++I ++ILLRLP+EGER DN PEG VTLY KMFEYG RLP+HP VQE
Subjt:  EFASRLDSELEEEIDNFRFSEDEGDDSDTSTSSLGLEYPSQMPENYLGRLHRRYSILDDILLRLPKEGERVDNLPEGRVTLYLKMFEYGFRLPVHPLVQE

Query:  FLVRTGLIPAQ----GSGRFGP------PRSRPAS-----------GLLEVKRISRKPGKYYLCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESD
        FL RTGL PAQ    G G           R+R +               E KRI++KPG++Y+CARKGA GI+KGPTSIK WV KWF+AS  WLA++ES 
Subjt:  FLVRTGLIPAQ----GSGRFGP------PRSRPAS-----------GLLEVKRISRKPGKYYLCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESD

Query:  LPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQL
          FF+V  RF NLV+IRP+P+L++ +F+ LK++K R   G+++ T +T++LLL S LLDYNP +  +E  RPN EL                        
Subjt:  LPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQL

Query:  FFLAAMVCGFSQDVRRKRPCTGHA
            AMVCGF+  V+RK     HA
Subjt:  FFLAAMVCGFSQDVRRKRPCTGHA

A0A6J1DZB3 uncharacterized protein LOC1110256654.2e-6036.35Show/hide
Query:  LCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNP
        +CARKG  GI+KGPTSIK WV KWFFAS  WLA++ES   FF+V  RF NLV+I+ IP+L++ TF+ LK +K      ++I T +T+KLLL S LLDYNP
Subjt:  LCARKGARGILKGPTSIKKWVEKWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNP

Query:  LLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHASKNKEAPSLLSLTFP---------PKSRWLWSDMK-PL
        L+ L+E  RPN EL                            AMVCGF+  V+RK     HA K       ++ T P         P S      ++  L
Subjt:  LLALLEVRRPNFELVQLPLNVFVLDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHASKNKEAPSLLSLTFP---------PKSRWLWSDMK-PL

Query:  RRGRRLRPAPLRITRSLKFLPFRRFGGRLPPRSPRR------------RSARPTT-------PRTRIGGTSDIEMKI-----------------------
          GR         + +L   P     G  P R  R+            R   PT+       P  R+ GTS++ M+                        
Subjt:  RRGRRLRPAPLRITRSLKFLPFRRFGGRLPPRSPRR------------RSARPTT-------PRTRIGGTSDIEMKI-----------------------

Query:  ------------------------KAHAAACQTSIMVKAELEGRGFLIVKEREASSAALE------GELKEARVEAQAWRSTSEADKAELKSAQAEVARH
                                +A  A+   ++MVKAEL+GR  L  KERE S AALE      GEL +A+ E    R+  +A K +L   + E  +H
Subjt:  ------------------------KAHAAACQTSIMVKAELEGRGFLIVKEREASSAALE------GELKEARVEAQAWRSTSEADKAELKSAQAEVARH

Query:  LENLRGTHAMVKCLEKENFMA-----DLKAKLELEES--------------KLRNEVLLEEAFHKHPDFDGFAKDFSDAGFRFLMKGIQEVAP--ELDLA
          +LR  HA+ K LEKE F       DL   LE +++              +L N  LLEE+F +HPDFDGFAKDFSDAGF+FLMKGI    P  ++DL 
Subjt:  LENLRGTHAMVKCLEKENFMA-----DLKAKLELEES--------------KLRNEVLLEEAFHKHPDFDGFAKDFSDAGFRFLMKGIQEVAP--ELDLA

Query:  PIKLRYAEKWASGSNETPGP
         +K +Y+EKWASG N TP P
Subjt:  PIKLRYAEKWASGSNETPGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGCAGTATAAGCTTGTTCCTTTGATTCTTATGTCATCGGCGAATCCCCGTCCTCCACGTGGCGAAGGATATTTCATCCCCAAACATTGGCCCCCTCTTTGTCGGCT
TTCTGAAATCTTGGAGTCCGACCTCATCAGATCGGAATCTTTAGCTCGGAATTGGGACCCTTCGCCTGGTAGGGCCGATTCCGTAGATGAGTTTGCTAGTAGGTTAGACT
CCGAACTAGAAGAGGAGATAGATAACTTTAGGTTCTCGGAAGACGAGGGAGATGATAGTGACACCTCGACCTCGAGTCTAGGTTTAGAATATCCTTCCCAAATGCCTGAG
AACTACCTCGGCCGTCTTCATAGGAGGTATAGTATTCTAGACGACATCCTTCTTAGACTTCCCAAGGAAGGGGAACGAGTTGATAATCTACCTGAAGGGCGCGTGACCCT
TTACCTGAAAATGTTTGAGTATGGTTTTCGCCTTCCCGTTCACCCCTTAGTGCAGGAGTTCCTAGTCCGAACTGGATTAATCCCTGCTCAGGGAAGTGGACGATTTGGAC
CTCCTCGGAGTCGACCAGCTTCTGGCTTGCTTGAGGTTAAGCGAATCTCTAGGAAGCCAGGGAAATATTACCTGTGTGCCAGGAAGGGTGCAAGAGGCATTCTAAAGGGC
CCAACCTCCATAAAGAAGTGGGTTGAAAAGTGGTTCTTCGCCTCTAGCGCGTGGCTGGCCAGGAACGAGTCCGACCTACCTTTCTTCAACGTCCTCTATAGGTTTGAGAA
CTTAGTCGCTATAAGGCCGATCCCTCAACTTTCTGAGCCGACCTTCAACGTTCTAAAGTTTTTCAAAGGCAGACTCAAGATTGGTAAGCAGATCAACACGTTTATCACAA
ACAAGCTTCTTCTTTCCTCCAGACTGCTGGACTACAATCCTCTTCTAGCCCTACTCGAAGTTCGGAGACCGAACTTCGAACTAGTTCAGTTGCCTTTGAATGTCTTTGTG
CTTGATATCAGACGTTTTTCGTACCTTGTGCTTAACTCACAACTGTTTTTCCTTGCAGCCATGGTTTGTGGCTTCTCCCAGGATGTTCGGCGCAAGCGCCCATGCACTGG
GCATGCTTCTAAGAACAAGGAGGCACCTTCCCTACTTTCGCTGACCTTCCCACCGAAGTCGAGGTGGTTGTGGTCGGACATGAAGCCACTTCGACGCGGGAGGCGACTGC
GGCCGGCGCCTCTTAGGATCACGAGATCCTTGAAGTTTCTCCCTTTCAGGAGATTTGGAGGAAGGCTTCCCCCAAGAAGTCCAAGAAGAAGAAGCGCAAGACCCACCACT
CCTAGGACGAGGATAGGCGGCACCTCTGACATCGAGATGAAGATTAAGGCTCATGCTGCTGCTTGCCAGACATCCATTATGGTGAAGGCCGAGCTGGAGGGGCGTGGCTT
TCTTATTGTGAAGGAGCGGGAGGCCTCTTCAGCTGCCTTGGAGGGGGAACTCAAAGAGGCTCGCGTTGAGGCTCAGGCATGGAGATCCACCTCTGAAGCTGACAAGGCCG
AGCTCAAAAGTGCCCAAGCAGAAGTTGCTCGACACCTGGAGAATCTGAGGGGCACGCATGCCATGGTCAAGTGCCTTGAGAAGGAGAATTTCATGGCAGACCTAAAGGCC
AAGCTCGAGCTCGAAGAGTCCAAGCTCCGCAATGAGGTTCTTCTGGAGGAAGCGTTTCACAAGCATCCTGACTTCGACGGCTTCGCCAAGGACTTTAGTGATGCTGGCTT
TAGATTCCTGATGAAAGGAATCCAAGAAGTGGCGCCCGAGCTCGATCTCGCGCCCATCAAGCTCAGGTATGCGGAGAAGTGGGCTTCCGGTTCCAATGAGACCCCTGGCC
CCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGCAGTATAAGCTTGTTCCTTTGATTCTTATGTCATCGGCGAATCCCCGTCCTCCACGTGGCGAAGGATATTTCATCCCCAAACATTGGCCCCCTCTTTGTCGGCT
TTCTGAAATCTTGGAGTCCGACCTCATCAGATCGGAATCTTTAGCTCGGAATTGGGACCCTTCGCCTGGTAGGGCCGATTCCGTAGATGAGTTTGCTAGTAGGTTAGACT
CCGAACTAGAAGAGGAGATAGATAACTTTAGGTTCTCGGAAGACGAGGGAGATGATAGTGACACCTCGACCTCGAGTCTAGGTTTAGAATATCCTTCCCAAATGCCTGAG
AACTACCTCGGCCGTCTTCATAGGAGGTATAGTATTCTAGACGACATCCTTCTTAGACTTCCCAAGGAAGGGGAACGAGTTGATAATCTACCTGAAGGGCGCGTGACCCT
TTACCTGAAAATGTTTGAGTATGGTTTTCGCCTTCCCGTTCACCCCTTAGTGCAGGAGTTCCTAGTCCGAACTGGATTAATCCCTGCTCAGGGAAGTGGACGATTTGGAC
CTCCTCGGAGTCGACCAGCTTCTGGCTTGCTTGAGGTTAAGCGAATCTCTAGGAAGCCAGGGAAATATTACCTGTGTGCCAGGAAGGGTGCAAGAGGCATTCTAAAGGGC
CCAACCTCCATAAAGAAGTGGGTTGAAAAGTGGTTCTTCGCCTCTAGCGCGTGGCTGGCCAGGAACGAGTCCGACCTACCTTTCTTCAACGTCCTCTATAGGTTTGAGAA
CTTAGTCGCTATAAGGCCGATCCCTCAACTTTCTGAGCCGACCTTCAACGTTCTAAAGTTTTTCAAAGGCAGACTCAAGATTGGTAAGCAGATCAACACGTTTATCACAA
ACAAGCTTCTTCTTTCCTCCAGACTGCTGGACTACAATCCTCTTCTAGCCCTACTCGAAGTTCGGAGACCGAACTTCGAACTAGTTCAGTTGCCTTTGAATGTCTTTGTG
CTTGATATCAGACGTTTTTCGTACCTTGTGCTTAACTCACAACTGTTTTTCCTTGCAGCCATGGTTTGTGGCTTCTCCCAGGATGTTCGGCGCAAGCGCCCATGCACTGG
GCATGCTTCTAAGAACAAGGAGGCACCTTCCCTACTTTCGCTGACCTTCCCACCGAAGTCGAGGTGGTTGTGGTCGGACATGAAGCCACTTCGACGCGGGAGGCGACTGC
GGCCGGCGCCTCTTAGGATCACGAGATCCTTGAAGTTTCTCCCTTTCAGGAGATTTGGAGGAAGGCTTCCCCCAAGAAGTCCAAGAAGAAGAAGCGCAAGACCCACCACT
CCTAGGACGAGGATAGGCGGCACCTCTGACATCGAGATGAAGATTAAGGCTCATGCTGCTGCTTGCCAGACATCCATTATGGTGAAGGCCGAGCTGGAGGGGCGTGGCTT
TCTTATTGTGAAGGAGCGGGAGGCCTCTTCAGCTGCCTTGGAGGGGGAACTCAAAGAGGCTCGCGTTGAGGCTCAGGCATGGAGATCCACCTCTGAAGCTGACAAGGCCG
AGCTCAAAAGTGCCCAAGCAGAAGTTGCTCGACACCTGGAGAATCTGAGGGGCACGCATGCCATGGTCAAGTGCCTTGAGAAGGAGAATTTCATGGCAGACCTAAAGGCC
AAGCTCGAGCTCGAAGAGTCCAAGCTCCGCAATGAGGTTCTTCTGGAGGAAGCGTTTCACAAGCATCCTGACTTCGACGGCTTCGCCAAGGACTTTAGTGATGCTGGCTT
TAGATTCCTGATGAAAGGAATCCAAGAAGTGGCGCCCGAGCTCGATCTCGCGCCCATCAAGCTCAGGTATGCGGAGAAGTGGGCTTCCGGTTCCAATGAGACCCCTGGCC
CCTAG
Protein sequenceShow/hide protein sequence
MEQYKLVPLILMSSANPRPPRGEGYFIPKHWPPLCRLSEILESDLIRSESLARNWDPSPGRADSVDEFASRLDSELEEEIDNFRFSEDEGDDSDTSTSSLGLEYPSQMPE
NYLGRLHRRYSILDDILLRLPKEGERVDNLPEGRVTLYLKMFEYGFRLPVHPLVQEFLVRTGLIPAQGSGRFGPPRSRPASGLLEVKRISRKPGKYYLCARKGARGILKG
PTSIKKWVEKWFFASSAWLARNESDLPFFNVLYRFENLVAIRPIPQLSEPTFNVLKFFKGRLKIGKQINTFITNKLLLSSRLLDYNPLLALLEVRRPNFELVQLPLNVFV
LDIRRFSYLVLNSQLFFLAAMVCGFSQDVRRKRPCTGHASKNKEAPSLLSLTFPPKSRWLWSDMKPLRRGRRLRPAPLRITRSLKFLPFRRFGGRLPPRSPRRRSARPTT
PRTRIGGTSDIEMKIKAHAAACQTSIMVKAELEGRGFLIVKEREASSAALEGELKEARVEAQAWRSTSEADKAELKSAQAEVARHLENLRGTHAMVKCLEKENFMADLKA
KLELEESKLRNEVLLEEAFHKHPDFDGFAKDFSDAGFRFLMKGIQEVAPELDLAPIKLRYAEKWASGSNETPGP