; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC09g1514 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC09g1514
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC09:20965659..20968553
RNA-Seq ExpressionMC09g1514
SyntenyMC09g1514
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6601286.1 hypothetical protein SDJN03_06519, partial [Cucurbita argyrosperma subsp. sororia]1.46e-11390.56Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGPISDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDMA GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

XP_022150691.1 uncharacterized protein LOC111018762 [Momordica charantia]1.88e-130100Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

XP_022957437.1 uncharacterized protein LOC111458833 [Cucurbita moschata]9.25e-11590.56Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGPISDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDMA GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

XP_022986983.1 uncharacterized protein LOC111484540 [Cucurbita maxima]1.08e-11390Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGPISDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDMA GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIP APIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

XP_023517111.1 uncharacterized protein LOC111780963 [Cucurbita pepo subsp. pepo]1.53e-11389.44Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGP SDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDM  GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

TrEMBL top hitse value%identityAlignment
A0A1S3BG95 uncharacterized protein LOC1034892879.51e-10987.22Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D SLRWNL HQVE IDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARF +EKFIPF PIA SPS QLSPST+SRKGGRVRA+FGNKP VRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

A0A5D3CB96 Uncharacterized protein9.51e-10987.22Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FSGHEEMRTSVP PISD RDQ+VCPKPRRL     TVN H+D SLRWNL HQVE IDMAAGPDLLDFLLTK GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARF +EKFIPF PIA SPS QLSPST+SRKGGRVRA+FGNKP VRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

A0A6J1DC97 uncharacterized protein LOC1110187629.11e-131100Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

A0A6J1GZ47 uncharacterized protein LOC1114588334.48e-11590.56Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGPISDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDMA GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIPFAPIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

A0A6J1JI50 uncharacterized protein LOC1114845405.23e-11490Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHCAILSN FS HEEMR  VPGPISDLRDQ+VCPKPRRLSN KVTV GHAD+SLRWNL HQVEQIDMA GPDLLDFLLT+ GCSVDQSFTQLASSPPFL
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        CGSPPSRVANPLIQDARFGDEKFIP APIA SPS QLSPSTASRKGGRVRA+FGNKP VRIEGFDC DRDRQNCSIPAFA
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13390.1 unknown protein8.9e-3243.24Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTK-NGCSVDQSFTQLASSPP-
        MN C I  N F   EEMR +    +SD RD ++CPKPRR+  L    N H+  SLRW L HQ+E  +  +G ++LDF+LTK  G   +Q  T+   +PP 
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTK-NGCSVDQSFTQLASSPP-

Query:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASP-SVQLSPSTASRKGGRVRA--NFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        F  GSPPSRV+NPL +D+ F +E  +  +P  ++P + +  P ++ R G  V A  +FGN P VR+ GFDC DR   N SI   A
Subjt:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASP-SVQLSPSTASRKGGRVRA--NFGNKPAVRIEGFDCLDRDRQNCSIPAFA

AT1G13390.2 unknown protein8.9e-3243.24Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTK-NGCSVDQSFTQLASSPP-
        MN C I  N F   EEMR +    +SD RD ++CPKPRR+  L    N H+  SLRW L HQ+E  +  +G ++LDF+LTK  G   +Q  T+   +PP 
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTK-NGCSVDQSFTQLASSPP-

Query:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASP-SVQLSPSTASRKGGRVRA--NFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        F  GSPPSRV+NPL +D+ F +E  +  +P  ++P + +  P ++ R G  V A  +FGN P VR+ GFDC DR   N SI   A
Subjt:  FLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASP-SVQLSPSTASRKGGRVRA--NFGNKPAVRIEGFDCLDRDRQNCSIPAFA

AT1G68490.1 unknown protein2.0e-3948.13Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSP-PF
        MNH A+  N F+   ++R+S    +   +  +VCPKPRR+       + H   SLR    HQ+E  +  A  D+LD +LTK+G   +Q   Q+  SP PF
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSP-PF

Query:  LCGSPPSRVANPLIQDARFGDE-----KFIPFAPIAASPSVQLSPSTASRKGG-RVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA
        LCGSPPSRVANPL QDARF DE       IP  P    P      S++ RKGG  VR NFGN P VR+EGFDCLDRD +NCSIPA A
Subjt:  LCGSPPSRVANPLIQDARFGDE-----KFIPFAPIAASPSVQLSPSTASRKGG-RVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA

AT3G02555.1 unknown protein8.1e-3348.07Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL
        MNHC++  N F   EE R  VP   S   D +VCPKPRR +N+      H   S   ++C      D  AG DLLD    K   S        + SPPF 
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFL

Query:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPA-VRIEGFDCLDRDRQNCSIPAFA
         GSPPSR ANPL QDARFGDEK    +P + SP   L PS +  K G  R  FG KPA VR+EGFDCL+RDR N SIPA A
Subjt:  CGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPA-VRIEGFDCLDRDRQNCSIPAFA

AT5G16110.1 unknown protein5.8e-3144.85Show/hide
Query:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQI-DMAAGPDLLDFLLTK--NGCSVDQSFTQLASSP
        MNHC +  N F   EEM         D +D +VCPKPRR+  L      +    LR ++      + D  AG +LL+ +  K  NG ++ Q    L+SSP
Subjt:  MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQI-DMAAGPDLLDFLLTK--NGCSVDQSFTQLASSP

Query:  PFLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQ----------LSPSTASRKGGRVRANFG-NKPAVRIEGFDCLDRDRQNCSIPAFA
        P+  GSPPSR ANPL QDARF DEK  P +P   SP +Q           S S++S   G VR  FG N PAVR+EGFDCL+RDRQN SIPA A
Subjt:  PFLCGSPPSRVANPLIQDARFGDEKFIPFAPIAASPSVQ----------LSPSTASRKGGRVRANFG-NKPAVRIEGFDCLDRDRQNCSIPAFA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCACTGCGCCATTCTGTCAAACACCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGCCCCATTTCTGACCTCAGAGATCAGATAGTTTGCCCTAAACC
TCGACGTTTAAGCAATCTAAAGGTCACCGTCAACGGCCACGCCGATGCCTCACTCCGGTGGAATCTATGTCACCAAGTTGAGCAAATTGACATGGCAGCGGGACCGGATC
TGCTGGACTTCCTCCTCACAAAAAATGGTTGCAGCGTGGACCAATCCTTCACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAGAGTAGCCAAC
CCATTGATTCAGGACGCCCGATTCGGGGACGAAAAATTCATTCCCTTTGCACCGATTGCAGCTTCACCGTCGGTTCAGTTGTCACCCTCCACAGCCTCCAGGAAAGGAGG
CCGTGTGAGGGCGAATTTTGGGAACAAACCAGCGGTGAGGATTGAGGGTTTCGATTGCCTCGACAGGGATAGGCAAAATTGCAGCATCCCTGCCTTCGCCTAG
mRNA sequenceShow/hide mRNA sequence
CCTACGTATATATTGCATCATATTTTCTTTAATAATATCTAATCTCCACCAAAAAGCCTAGCCATAGCCAAAGGTGAGGAAGCGTAAATGGTGATTATACAATCATTGAT
CTCGTTATTTTCATCTTCCGAACCAACTCATAGTTAATACTTTGCAGCTTGAAAAAGCGTGTAAACACGTGACGACGTCGTTGAGATAGCCGACTAACCACATGAGGCGA
TACACGTGTCGGAATCAGGTAGCTTGTGTGGTAATTGAGTACAACCGTCCATTTAATTCAGTGATGAGTGGGGCCCCGCTGGAGTTTGGGGTTTCCGTCTAAAATTTGAG
ATATTTACACATCTACAGTAAAATAATAATAATTAAAAAGTTGGGAGTTAAGGCGCCGAAACCGGTGAAACTTTTCTGTCGCCCTCCGCTGGCATTGGCAGTAGCACCAC
CGGCACCGCCAACGCTCCACTTACCTAACAAATACGAAATTTAAAATAAAAAAAATATATATTTTTTAAAAATGGAAATATAAATTAAATTATAAACCCGATGCGAAGAC
TCACTCGGCCCGGGTTGGGAACCGTTGCTGGTCTTCCTCTGCTTCTTCTTCCTCCTCCGCCTTATCTCTCTACTCTCTTCTCTCTCTTCCTTTTTCTGCTATTCTGATTT
CTCTGTCTCCCTCTTTGTAAACCGCGCTTTATTCACTTTTCCCATCTCTCTGTATCTGTGCTCAACAGTTATCCGTTTATGCCCATCCACACGGAGCTTTAGCTTTTAAG
TTTTTCTTCTTTGATTTCTGTATATAATACTTCCACAGCCTTTTCCGGACCTCTCACAGCCAACTGCTGCCTCTTGATTTTCAGGTTTATTTGTCCCTTGATATAACCGA
AGACGCAAACATGAATCACTGCGCCATTCTGTCAAACACCTTCTCGGGCCACGAGGAGATGAGAACCTCTGTTCCGGGCCCCATTTCTGACCTCAGAGATCAGATAGTTT
GCCCTAAACCTCGACGTTTAAGCAATCTAAAGGTCACCGTCAACGGCCACGCCGATGCCTCACTCCGGTGGAATCTATGTCACCAAGTTGAGCAAATTGACATGGCAGCG
GGACCGGATCTGCTGGACTTCCTCCTCACAAAAAATGGTTGCAGCGTGGACCAATCCTTCACGCAGTTGGCTTCGTCGCCCCCTTTTTTATGTGGGTCTCCGCCGAGCAG
AGTAGCCAACCCATTGATTCAGGACGCCCGATTCGGGGACGAAAAATTCATTCCCTTTGCACCGATTGCAGCTTCACCGTCGGTTCAGTTGTCACCCTCCACAGCCTCCA
GGAAAGGAGGCCGTGTGAGGGCGAATTTTGGGAACAAACCAGCGGTGAGGATTGAGGGTTTCGATTGCCTCGACAGGGATAGGCAAAATTGCAGCATCCCTGCCTTCGCC
TAGAAACCCCATCTCTAAAATCAATTCCATACAATACAAGACATATTATACTTCATCTCGGAGATCTCAAGAGAAGAAAAGGATACGATTTGCGTTCGTGTAAATACGTT
TGAAGGGTTCTCTGTAAATGTAAATAAAAAAAAAAAAAAAAAATCAACCCCATGTATACTAAAACTGTCCACTTGTAAGGCTTTTTTTGAGTCGGGGGCAAACGATGCTA
TGGATGTCAAGCTCTTGCATTATATTTTTGTAATTAGGCGTCTGTAAGAAGAAAGTTGAGGTTGAGGACAATTTTCATTGTGGTTCGTTTTGGGATCATTTGAAGTTAAA
GAGTGAGAAAGAGAGGCAATCTGTACATATTGCAGAATTTGAAATGGAGTCTGTAAGAAAGAAGTCATCAACATGTGAATAATGTAGAACATTTTCTGATTGTGAATTTA
GTTCTGTCTCTGGATTTTTGTTTGATTTTGGTTCAGAAAGTGATTCTGCGAGATCAAGATGTAGTGTGTTTGCTTAGGCATAACGCT
Protein sequenceShow/hide protein sequence
MNHCAILSNTFSGHEEMRTSVPGPISDLRDQIVCPKPRRLSNLKVTVNGHADASLRWNLCHQVEQIDMAAGPDLLDFLLTKNGCSVDQSFTQLASSPPFLCGSPPSRVAN
PLIQDARFGDEKFIPFAPIAASPSVQLSPSTASRKGGRVRANFGNKPAVRIEGFDCLDRDRQNCSIPAFA