; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g10860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g10860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlus3 domain-containing protein
Genome locationchr4:8118208..8124832
RNA-Seq ExpressionMoc04g10860
SyntenyMoc04g10860
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]3.0e-9773.33Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM------
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE SRPNS LAM      
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM------

Query:  -------------SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE
                      AAQ          GP+S  P PVIEL+S+G  SREKR R +
Subjt:  -------------SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE

XP_022155229.1 uncharacterized protein LOC111022371 [Momordica charantia]1.1e-7284.62Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEY LR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLR RD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKG
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+G
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKG

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]2.2e-9285.94Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKL
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE+SRPNS+L
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKL

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]1.5e-9386.08Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK +FP+GRK+GTLVTDKLLLESGLL+YNP VRPIE+SRPNS+L M
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]1.0e-13774.27Show/hide
Query:  SDSEEVLARRLESELEEIENFRFSDDGEDNDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPERWVTFYLKMFEYGLRLPLHPF
        S+ E  LARRLES+LEEIEN R SDDGED+D STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPE WVT Y KMFEYGLRLPLHPF
Subjt:  SDSEEVLARRLESELEEIENFRFSDDGEDNDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPERWVTFYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFASGEWLAKD
         QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA L  VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV KWF+ASGEWLAKD
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFASGEWLAKD

Query:  ESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM-------------------
        ESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE+SRPNS+LAM                   
Subjt:  ESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM-------------------

Query:  SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE
         AAQ          GP+S  P  VIEL+S+G  SREKR R +
Subjt:  SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138261.4e-9773.33Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM------
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE SRPNS LAM      
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM------

Query:  -------------SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE
                      AAQ          GP+S  P PVIEL+S+G  SREKR R +
Subjt:  -------------SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE

A0A6J1DPM7 uncharacterized protein LOC1110223715.5e-7384.62Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEY LR PLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLR RD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKG
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+G
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKG

A0A6J1DWD2 uncharacterized protein LOC1110246801.1e-9285.94Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKL
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE+SRPNS+L
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKL

A0A6J1DWF1 uncharacterized protein LOC1110251087.4e-9486.08Show/hide
Query:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG
        MFEYGLRLPLHPF QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA LL VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM
        KWF+ASGEWLAKDESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK +FP+GRK+GTLVTDKLLLESGLL+YNP VRPIE+SRPNS+L M
Subjt:  KWFFASGEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM

A0A6J1DXS5 uncharacterized protein LOC1110255024.9e-13874.27Show/hide
Query:  SDSEEVLARRLESELEEIENFRFSDDGEDNDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPERWVTFYLKMFEYGLRLPLHPF
        S+ E  LARRLES+LEEIEN R SDDGED+D STSGQGLEYPSR+PEHYLG LRRGF IP +ILLR+PEEGERADNPPE WVT Y KMFEYGLRLPLHPF
Subjt:  SDSEEVLARRLESELEEIENFRFSDDGEDNDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPERWVTFYLKMFEYGLRLPLHPF

Query:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFASGEWLAKD
         QEFL RTGLAPAQVAPNGWGVIFALAILFWLRARD +EA L  VDQLL CFEAKRIAKKPGR+YMCARKGA GIVKGPTSIKGWV KWF+ASGEWLAKD
Subjt:  AQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFASGEWLAKD

Query:  ESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM-------------------
        ESGR FFDVP RF NLVSI+P+PEL QA+FDTLKYYK  FP+GRK+GTLVTD+LLLESGLL+YNP VRPIE+SRPNS+LAM                   
Subjt:  ESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAM-------------------

Query:  SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE
         AAQ          GP+S  P  VIEL+S+G  SREKR R +
Subjt:  SAAQDQ-------AGPSSAAPTPVIELDSTGERSREKRSRSE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15420.1 myosin heavy chain-related2.0e-0631.39Show/hide
Query:  NIPNDILLRIPEEGERADNPPERWVTFYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGL-LSVDQLLGCFEAK
        N P +I L  P+  +R   PPE ++  Y   F   GL  PL  F  E+  R  +A +Q+          LAIL         E G+ +  D         
Subjt:  NIPNDILLRIPEEGERADNPPERWVTFYLKMF-EYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGL-LSVDQLLGCFEAK

Query:  RIAKKPGRYYMCARKGACGIVKGPTS-IKGWVGKWFF
        R+ + PG YY  A K    IV G  S I GW  ++FF
Subjt:  RIAKKPGRYYMCARKGACGIVKGPTS-IKGWVGKWFF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTATGCAAAGGTTTGCACAACGCTGTTTTACGAACCAAGCTCGAACCCGGTCCCAGGATCGACCTGAACTCAAGAGTGAACCTGCACAAGAGGGTAAACTCTCCGA
CGCTCAAGTTAGTGTAGGTCTCATCGATCCCTTTGTTCTAGAAGATTGGAATTTGCACAACGGTTCTGCACGAATCGAGTTCGAACCCGATCTCCGGTTCCGACCTAAAC
AACAGAGTGGACCTGCACAAGAGGGTAAGCACTCCGACGCTCAAGTAATGGATCCAACAGCACACACGACCGGCGGTTACATGTCTTTTTTCATGTCGGACCTGTCGGGT
TCCGAGCAGGTCGGACCCCAGTCAGGTCGAACTTTGGTGCCCATACTTCATCTTTTAAGGGGCAAACTCGGTCACCTCGACAGGGCCGAGGTTCGATCTCGACCTGGCAG
AGAAGTTTATTCGAATCTATTTTTGGACACGTGGCGACTTTTCATTTGTAGAAGGAATATGACCGTTGCGCAAGACGTTTCGACCTGTCAGGTTGTCGGAGCACTCAAGC
GTTTCGCCGTTGCGTATCCCGAGAAGATCCCAGCCTTCGATCTGAAACCAGCTCGAACCCTTTCTCTAGGCCAGTCATTTCTCCTTGCTCTTACTCTTCTTTCAAAAATG
GTAGTTTTCTTATCTTCCCCCTGCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGAGGAGGTCTTAGCTCGTAG
GTTAGAGTCCGAGCTTGAGGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAACGATACCTCTACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCG
AGCATTATCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGAGATGGGTCACT
TTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTG
GGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGACGAGGCCGGGCTGCTAAGTGTAGATCAGCTCCTTGGGTGTTTTGAGGCTAAGAGGA
TAGCCAAAAAACCAGGTCGGTACTATATGTGTGCGAGGAAAGGCGCATGTGGCATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGAAAGTGGTTCTTTGCCTCG
GGAGAGTGGCTGGCAAAGGATGAGTCAGGTCGTCCCTTCTTTGACGTGCCTGCTAGGTTTGAGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCAATCAAGCCACTTT
TGACACCCTCAAGTACTACAAGTACAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTAACTACAACCCTCTAG
TTCGACCAATCGAAGCTTCAAGGCCAAACTCCAAGCTCGCCATGAGTGCAGCTCAGGACCAGGCGGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCT
ACCGGGGAGCGATCCAGGGAGAAGCGCTCGAGGAGCGAAGATCTGGACTCTGACTACTCCGACCTAGATGAAGATGAGGTCCCAAGTCAGGAACCTACTGAGGTCGGCAC
CACTCAAGAAGGAGTCCCTTCTCAGCAGGACGGATCTCAAGAGGTCAACCTTCTGGGGTCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTATGCAAAGGTTTGCACAACGCTGTTTTACGAACCAAGCTCGAACCCGGTCCCAGGATCGACCTGAACTCAAGAGTGAACCTGCACAAGAGGGTAAACTCTCCGA
CGCTCAAGTTAGTGTAGGTCTCATCGATCCCTTTGTTCTAGAAGATTGGAATTTGCACAACGGTTCTGCACGAATCGAGTTCGAACCCGATCTCCGGTTCCGACCTAAAC
AACAGAGTGGACCTGCACAAGAGGGTAAGCACTCCGACGCTCAAGTAATGGATCCAACAGCACACACGACCGGCGGTTACATGTCTTTTTTCATGTCGGACCTGTCGGGT
TCCGAGCAGGTCGGACCCCAGTCAGGTCGAACTTTGGTGCCCATACTTCATCTTTTAAGGGGCAAACTCGGTCACCTCGACAGGGCCGAGGTTCGATCTCGACCTGGCAG
AGAAGTTTATTCGAATCTATTTTTGGACACGTGGCGACTTTTCATTTGTAGAAGGAATATGACCGTTGCGCAAGACGTTTCGACCTGTCAGGTTGTCGGAGCACTCAAGC
GTTTCGCCGTTGCGTATCCCGAGAAGATCCCAGCCTTCGATCTGAAACCAGCTCGAACCCTTTCTCTAGGCCAGTCATTTCTCCTTGCTCTTACTCTTCTTTCAAAAATG
GTAGTTTTCTTATCTTCCCCCTGCAGTAGTGATAGCCTGGGTAGTGTAGGTCGGACGATAAGTAGTTCGCCCCCCAAACCAAGTGACTCTGAGGAGGTCTTAGCTCGTAG
GTTAGAGTCCGAGCTTGAGGAAATAGAGAACTTTAGGTTCTCAGATGACGGAGAGGATAACGATACCTCTACCTCGGGCCAGGGTCTGGAGTACCCTTCTAGGATGCCCG
AGCATTATCTTGGACCCCTTCGTAGGGGGTTTAACATTCCGAATGACATCCTCCTTAGGATTCCGGAGGAAGGGGAAAGAGCTGACAATCCTCCAGAGAGATGGGTCACT
TTTTATTTGAAAATGTTTGAGTACGGCCTCAGACTTCCTCTTCATCCCTTTGCTCAGGAGTTCTTAAACCGAACTGGACTGGCTCCTGCTCAAGTGGCCCCCAATGGGTG
GGGTGTCATTTTTGCTTTAGCCATTCTTTTTTGGTTGCGAGCTCGGGATGAGGACGAGGCCGGGCTGCTAAGTGTAGATCAGCTCCTTGGGTGTTTTGAGGCTAAGAGGA
TAGCCAAAAAACCAGGTCGGTACTATATGTGTGCGAGGAAAGGCGCATGTGGCATAGTCAAGGGGCCGACCTCCATCAAAGGATGGGTAGGAAAGTGGTTCTTTGCCTCG
GGAGAGTGGCTGGCAAAGGATGAGTCAGGTCGTCCCTTCTTTGACGTGCCTGCTAGGTTTGAGAACCTAGTATCGATCAAGCCGATTCCCGAGCTCAATCAAGCCACTTT
TGACACCCTCAAGTACTACAAGTACAACTTCCCCAAGGGCAGGAAGATCGGAACCTTGGTCACCGACAAGCTTCTCTTGGAGTCGGGGCTTCTTAACTACAACCCTCTAG
TTCGACCAATCGAAGCTTCAAGGCCAAACTCCAAGCTCGCCATGAGTGCAGCTCAGGACCAGGCGGGTCCATCTTCTGCAGCTCCAACTCCGGTGATTGAGTTGGATTCT
ACCGGGGAGCGATCCAGGGAGAAGCGCTCGAGGAGCGAAGATCTGGACTCTGACTACTCCGACCTAGATGAAGATGAGGTCCCAAGTCAGGAACCTACTGAGGTCGGCAC
CACTCAAGAAGGAGTCCCTTCTCAGCAGGACGGATCTCAAGAGGTCAACCTTCTGGGGTCCTAG
Protein sequenceShow/hide protein sequence
MFMQRFAQRCFTNQARTRSQDRPELKSEPAQEGKLSDAQVSVGLIDPFVLEDWNLHNGSARIEFEPDLRFRPKQQSGPAQEGKHSDAQVMDPTAHTTGGYMSFFMSDLSG
SEQVGPQSGRTLVPILHLLRGKLGHLDRAEVRSRPGREVYSNLFLDTWRLFICRRNMTVAQDVSTCQVVGALKRFAVAYPEKIPAFDLKPARTLSLGQSFLLALTLLSKM
VVFLSSPCSSDSLGSVGRTISSSPPKPSDSEEVLARRLESELEEIENFRFSDDGEDNDTSTSGQGLEYPSRMPEHYLGPLRRGFNIPNDILLRIPEEGERADNPPERWVT
FYLKMFEYGLRLPLHPFAQEFLNRTGLAPAQVAPNGWGVIFALAILFWLRARDEDEAGLLSVDQLLGCFEAKRIAKKPGRYYMCARKGACGIVKGPTSIKGWVGKWFFAS
GEWLAKDESGRPFFDVPARFENLVSIKPIPELNQATFDTLKYYKYNFPKGRKIGTLVTDKLLLESGLLNYNPLVRPIEASRPNSKLAMSAAQDQAGPSSAAPTPVIELDS
TGERSREKRSRSEDLDSDYSDLDEDEVPSQEPTEVGTTQEGVPSQQDGSQEVNLLGS