; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g04120 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g04120
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionmyosin heavy chain-related
Genome locationchr3:3046958..3049227
RNA-Seq ExpressionMoc03g04120
SyntenyMoc03g04120
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022144034.1 uncharacterized protein LOC111013826 [Momordica charantia]1.1e-9470Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL--------
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GRK+GTLVTD+LLL SGLLDYNP VRP+E SRPNS L        
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL--------

Query:  ---DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLD
             SK RA     A   +P       P+++ P PVIEL+S+   SREKRP+D++E +D
Subjt:  ---DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLD

XP_022158122.1 uncharacterized protein LOC111024680 [Momordica charantia]6.4e-9082.14Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSELDHSK
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GRK+GTLVTD+LLL SGLLDYNP VRP+E+SRPNSEL   K
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSELDHSK

XP_022158650.1 uncharacterized protein LOC111025108 [Momordica charantia]7.6e-9184.38Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+HFP GRK+GTLVTDKLLL SGLLDYNP VRP+E+SRPNSEL
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL

XP_022159063.1 uncharacterized protein LOC111025502, partial [Momordica charantia]7.5e-11569.97Show/hide
Query:  MPKHYLGPLRSGFSILDDIILRIPEERERADNPLEGWVTLYLKMFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLD
        +P+HYLG LR GF+I ++I+LR+PEE ERADNP EGWVTLY KMFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +L D
Subjt:  MPKHYLGPLRSGFSILDDIILRIPEERERADNPLEGWVTLYLKMFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLD

Query:  VEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGR
        V+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GR
Subjt:  VEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGR

Query:  KIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL-----------DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESE
        K+GTLVTD+LLL SGLLDYNP VRP+E+SRPNSEL             SK RA     A   +PA      P+++ P  VIEL+S+   SREKRP+D++E
Subjt:  KIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL-----------DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESE

Query:  VLD
         +D
Subjt:  VLD

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]1.2e-9647.38Show/hide
Query:  MCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNP
        MCARKG  GIVKGPTSIKGWVGKWFFASGEWLAK+ESGR FFDVP RF NLVSI+ I EL QA+FDTLK+YKDHFP  RKI TLVTDKLLL SGLLDYNP
Subjt:  MCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNP

Query:  LVRPVEASRPNSEL-----------DHSKHRA-----------TTPTVARPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLDVSPLCE----
        LVR +EASRPNSEL             SK RA            TPTV R  AQ  + PS+  PTPVIELD +   S EKR ++ESE LDVSPL E    
Subjt:  LVRPVEASRPNSEL-----------DHSKHRA-----------TTPTVARPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLDVSPLCE----

Query:  -------------------------------------------------------------VSRILASCLDYCLQRASKFVSDPGSMLQRTIDHATEAFI
                                                                     VSRI A+CLD  L+RASKFVSDPGS+LQRTID+  EAFI
Subjt:  -------------------------------------------------------------VSRILASCLDYCLQRASKFVSDPGSMLQRTIDHATEAFI

Query:  ASIHSTVMIKAELDGREILATRESANSSATLEAATTMKGELLKARSEVETLKAEVEAKALLLKKKRRK-------AQGLPPSCSRH-HQGVGEGEISAPE
        ASIH  VM+KAELDGRE LA +E  NS A LEAATT+KGELLKA+ EV+ L+AEV+AK  LLKK+  K       A  +     +   Q + E +  A  
Subjt:  ASIHSTVMIKAELDGREILATRESANSSATLEAATTMKGELLKARSEVETLKAEVEAKALLLKKKRRK-------AQGLPPSCSRH-HQGVGEGEISAPE

Query:  GEERPGS--------------------------------------------------------------------------------SPASQVEKYVTEL
         EE+  S                                                                                 P S V+KYV EL
Subjt:  GEERPGS--------------------------------------------------------------------------------SPASQVEKYVTEL

Query:  DSDYSDLEEDDAPSQEPNEVGTTQEEVPSQQGGS
        DSDYSD+EE+DAPSQEP EVGTTQEEVPSQQGGS
Subjt:  DSDYSDLEEDDAPSQEPNEVGTTQEEVPSQQGGS

TrEMBL top hitse value%identityAlignment
A0A6J1CR42 uncharacterized protein LOC1110138265.5e-9570Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL--------
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GRK+GTLVTD+LLL SGLLDYNP VRP+E SRPNS L        
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL--------

Query:  ---DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLD
             SK RA     A   +P       P+++ P PVIEL+S+   SREKRP+D++E +D
Subjt:  ---DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLD

A0A6J1DWD2 uncharacterized protein LOC1110246803.1e-9082.14Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSELDHSK
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GRK+GTLVTD+LLL SGLLDYNP VRP+E+SRPNSEL   K
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSELDHSK

A0A6J1DWF1 uncharacterized protein LOC1110251083.7e-9184.38Show/hide
Query:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG
        MFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +LLDV+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV 
Subjt:  MFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVG

Query:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL
        KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+HFP GRK+GTLVTDKLLL SGLLDYNP VRP+E+SRPNSEL
Subjt:  KWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL

A0A6J1DXS5 uncharacterized protein LOC1110255023.7e-11569.97Show/hide
Query:  MPKHYLGPLRSGFSILDDIILRIPEERERADNPLEGWVTLYLKMFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLD
        +P+HYLG LR GF+I ++I+LR+PEE ERADNP EGWVTLY KMFEY LRLPLHPF QEFL RTGL PAQVA NGWGVIFALAILFWLRAR+ +E +L D
Subjt:  MPKHYLGPLRSGFSILDDIILRIPEERERADNPLEGWVTLYLKMFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLD

Query:  VEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGR
        V+QLL CFEAKRIAKKPG++YMCARKGA GIVKGPTSIKGWV KWF+ASGEWLAK+ESGR FFDVP RF NLVSIRP+ ELTQASFDTLKYYK+ FP GR
Subjt:  VEQLLGCFEAKRIAKKPGQYYMCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGR

Query:  KIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL-----------DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESE
        K+GTLVTD+LLL SGLLDYNP VRP+E+SRPNSEL             SK RA     A   +PA      P+++ P  VIEL+S+   SREKRP+D++E
Subjt:  KIGTLVTDKLLLGSGLLDYNPLVRPVEASRPNSEL-----------DHSKHRATTPTVA---RPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESE

Query:  VLD
         +D
Subjt:  VLD

A0A6J1DZB3 uncharacterized protein LOC1110256655.9e-9747.38Show/hide
Query:  MCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNP
        MCARKG  GIVKGPTSIKGWVGKWFFASGEWLAK+ESGR FFDVP RF NLVSI+ I EL QA+FDTLK+YKDHFP  RKI TLVTDKLLL SGLLDYNP
Subjt:  MCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYNP

Query:  LVRPVEASRPNSEL-----------DHSKHRA-----------TTPTVARPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLDVSPLCE----
        LVR +EASRPNSEL             SK RA            TPTV R  AQ  + PS+  PTPVIELD +   S EKR ++ESE LDVSPL E    
Subjt:  LVRPVEASRPNSEL-----------DHSKHRA-----------TTPTVARPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLDVSPLCE----

Query:  -------------------------------------------------------------VSRILASCLDYCLQRASKFVSDPGSMLQRTIDHATEAFI
                                                                     VSRI A+CLD  L+RASKFVSDPGS+LQRTID+  EAFI
Subjt:  -------------------------------------------------------------VSRILASCLDYCLQRASKFVSDPGSMLQRTIDHATEAFI

Query:  ASIHSTVMIKAELDGREILATRESANSSATLEAATTMKGELLKARSEVETLKAEVEAKALLLKKKRRK-------AQGLPPSCSRH-HQGVGEGEISAPE
        ASIH  VM+KAELDGRE LA +E  NS A LEAATT+KGELLKA+ EV+ L+AEV+AK  LLKK+  K       A  +     +   Q + E +  A  
Subjt:  ASIHSTVMIKAELDGREILATRESANSSATLEAATTMKGELLKARSEVETLKAEVEAKALLLKKKRRK-------AQGLPPSCSRH-HQGVGEGEISAPE

Query:  GEERPGS--------------------------------------------------------------------------------SPASQVEKYVTEL
         EE+  S                                                                                 P S V+KYV EL
Subjt:  GEERPGS--------------------------------------------------------------------------------SPASQVEKYVTEL

Query:  DSDYSDLEEDDAPSQEPNEVGTTQEEVPSQQGGS
        DSDYSD+EE+DAPSQEP EVGTTQEEVPSQQGGS
Subjt:  DSDYSDLEEDDAPSQEPNEVGTTQEEVPSQQGGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G32010.1 myosin heavy chain-related4.0e-0526.72Show/hide
Query:  IILRIPEERERADNPLEGWVTLYLKMF-EYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKP
        + +RIP + +R  +  EG++ L+   F E  LR P+  F   F     +  +Q+       I   A L  L AR       L VE +       ++  K 
Subjt:  IILRIPEERERADNPLEGWVTLYLKMF-EYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEAKRIAKKP

Query:  GQYYMCARKGASGIVKGPTSIKGWVGKWFFA
        GQ+Y+ + +G   +  GP+  + W+G +F+A
Subjt:  GQYYMCARKGASGIVKGPTSIKGWVGKWFFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTAAGCACTATCTTGGACCCCTCCGTAGCGGGTTTAGTATTCTAGATGATATCATCCTTAGAATTCCGGAGGAAAGGGAAAGAGCTGACAATCCTCTAGAGGGATG
GGTCACTCTCTATTTAAAAATGTTTGAGTACGACCTCAGACTTCCCCTTCACCCTTTTGCCCAGGAGTTCCTGAACCGAACTGGACTGACTCCTGCTCAAGTGGCCCGCA
ACGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGAGAAGTGGACGAGACCGATCTGCTAGATGTTGAACAGCTTCTAGGGTGCTTTGAAGCT
AAAAGGATAGCTAAGAAGCCAGGTCAGTACTACATGTGCGCAAGGAAGGGCGCGAGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTGGGGAAATGGTTCTT
TGCCTCTGGAGAGTGGCTGGCAAAGAACGAGTCAGGTCGTCCCTTCTTTGACGTTCCTGTTAGATTTGAGAATTTAGTGTCGATCAGACCAATTCTCGAGCTCACTCAAG
CCTCCTTCGACACCCTCAAGTATTACAAGGATCACTTTCCAATGGGCAGGAAGATCGGAACCTTGGTGACTGATAAGCTGCTTCTCGGATCTGGATTGTTAGATTACAAC
CCCTTGGTACGCCCAGTCGAAGCCTCAAGACCAAACTCTGAGCTTGACCATTCAAAGCACAGAGCCACAACTCCCACTGTCGCTCGACCTGCGGCTCAAGACAAAACTGA
GCCCTCTGCTGACGCCCCAACTCCAGTGATCGAACTAGATTCTACCGAGGAACACTCCAGAGAGAAGCGCCCAAAGGATGAGTCCGAGGTGCTGGATGTTTCTCCTCTGT
GCGAGGTTTCCCGCATCTTGGCTTCATGCTTGGACTACTGTCTTCAAAGGGCGTCTAAGTTTGTAAGTGACCCTGGGTCCATGCTGCAAAGGACCATTGACCACGCCACC
GAGGCGTTCATTGCTTCCATTCACTCGACAGTTATGATAAAGGCTGAGCTGGATGGAAGGGAGATCTTGGCAACCAGGGAGAGTGCAAACTCCTCTGCTACCCTAGAAGC
TGCCACCACGATGAAGGGCGAACTACTGAAAGCTCGCTCCGAAGTGGAGACTCTAAAGGCCGAGGTGGAGGCCAAGGCTCTACTGCTGAAAAAAAAAAGAAGAAAAGCAC
AAGGCCTACCTCCGAGCTGCTCACGCCATCACCAAGGGGTTGGAGAAGGAGAAATTTCAGCTCCTGAAGGAGAAGAACGACCTGGCTCAAGTCCTGCATCCCAGGTGGAA
AAGTACGTCACAGAACTAGACTCTGACTATTCCGACTTGGAAGAAGACGATGCCCCTAGTCAGGAGCCCAATGAGGTCGGCACTACCCAAGAAGAAGTCCCTTCGCAACA
GGGCGGATCCCAGGAGGTCAACCTTCTGGGCTCCCAAGGTGTGTTGTCTTCTCACCTCGGGAGTGGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTAAGCACTATCTTGGACCCCTCCGTAGCGGGTTTAGTATTCTAGATGATATCATCCTTAGAATTCCGGAGGAAAGGGAAAGAGCTGACAATCCTCTAGAGGGATG
GGTCACTCTCTATTTAAAAATGTTTGAGTACGACCTCAGACTTCCCCTTCACCCTTTTGCCCAGGAGTTCCTGAACCGAACTGGACTGACTCCTGCTCAAGTGGCCCGCA
ACGGATGGGGTGTCATTTTTGCTTTGGCCATCCTTTTTTGGTTACGAGCTCGAGAAGTGGACGAGACCGATCTGCTAGATGTTGAACAGCTTCTAGGGTGCTTTGAAGCT
AAAAGGATAGCTAAGAAGCCAGGTCAGTACTACATGTGCGCAAGGAAGGGCGCGAGTGGTATAGTCAAAGGGCCGACCTCCATCAAAGGATGGGTGGGGAAATGGTTCTT
TGCCTCTGGAGAGTGGCTGGCAAAGAACGAGTCAGGTCGTCCCTTCTTTGACGTTCCTGTTAGATTTGAGAATTTAGTGTCGATCAGACCAATTCTCGAGCTCACTCAAG
CCTCCTTCGACACCCTCAAGTATTACAAGGATCACTTTCCAATGGGCAGGAAGATCGGAACCTTGGTGACTGATAAGCTGCTTCTCGGATCTGGATTGTTAGATTACAAC
CCCTTGGTACGCCCAGTCGAAGCCTCAAGACCAAACTCTGAGCTTGACCATTCAAAGCACAGAGCCACAACTCCCACTGTCGCTCGACCTGCGGCTCAAGACAAAACTGA
GCCCTCTGCTGACGCCCCAACTCCAGTGATCGAACTAGATTCTACCGAGGAACACTCCAGAGAGAAGCGCCCAAAGGATGAGTCCGAGGTGCTGGATGTTTCTCCTCTGT
GCGAGGTTTCCCGCATCTTGGCTTCATGCTTGGACTACTGTCTTCAAAGGGCGTCTAAGTTTGTAAGTGACCCTGGGTCCATGCTGCAAAGGACCATTGACCACGCCACC
GAGGCGTTCATTGCTTCCATTCACTCGACAGTTATGATAAAGGCTGAGCTGGATGGAAGGGAGATCTTGGCAACCAGGGAGAGTGCAAACTCCTCTGCTACCCTAGAAGC
TGCCACCACGATGAAGGGCGAACTACTGAAAGCTCGCTCCGAAGTGGAGACTCTAAAGGCCGAGGTGGAGGCCAAGGCTCTACTGCTGAAAAAAAAAAGAAGAAAAGCAC
AAGGCCTACCTCCGAGCTGCTCACGCCATCACCAAGGGGTTGGAGAAGGAGAAATTTCAGCTCCTGAAGGAGAAGAACGACCTGGCTCAAGTCCTGCATCCCAGGTGGAA
AAGTACGTCACAGAACTAGACTCTGACTATTCCGACTTGGAAGAAGACGATGCCCCTAGTCAGGAGCCCAATGAGGTCGGCACTACCCAAGAAGAAGTCCCTTCGCAACA
GGGCGGATCCCAGGAGGTCAACCTTCTGGGCTCCCAAGGTGTGTTGTCTTCTCACCTCGGGAGTGGCTGA
Protein sequenceShow/hide protein sequence
MPKHYLGPLRSGFSILDDIILRIPEERERADNPLEGWVTLYLKMFEYDLRLPLHPFAQEFLNRTGLTPAQVARNGWGVIFALAILFWLRAREVDETDLLDVEQLLGCFEA
KRIAKKPGQYYMCARKGASGIVKGPTSIKGWVGKWFFASGEWLAKNESGRPFFDVPVRFENLVSIRPILELTQASFDTLKYYKDHFPMGRKIGTLVTDKLLLGSGLLDYN
PLVRPVEASRPNSELDHSKHRATTPTVARPAAQDKTEPSADAPTPVIELDSTEEHSREKRPKDESEVLDVSPLCEVSRILASCLDYCLQRASKFVSDPGSMLQRTIDHAT
EAFIASIHSTVMIKAELDGREILATRESANSSATLEAATTMKGELLKARSEVETLKAEVEAKALLLKKKRRKAQGLPPSCSRHHQGVGEGEISAPEGEERPGSSPASQVE
KYVTELDSDYSDLEEDDAPSQEPNEVGTTQEEVPSQQGGSQEVNLLGSQGVLSSHLGSG