; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC07g0047 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC07g0047
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUnknown protein
Genome locationMC07:573385..574107
RNA-Seq ExpressionMC07g0047
SyntenyMC07g0047
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592803.1 hypothetical protein SDJN03_12279, partial [Cucurbita argyrosperma subsp. sororia]1.42e-10165.76Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+ +
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
          CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S        SG
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         VE AE      + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

KAG7020289.1 hypothetical protein SDJN02_16972, partial [Cucurbita argyrosperma subsp. argyrosperma]8.20e-10164.98Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+ +
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTPATSG
          CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P           PE+ P  I+DGS+AP  S        SG
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         VE AE      + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022953880.1 uncharacterized protein LOC111456285 [Cucurbita moschata]3.18e-9663.24Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSPSSSSSSSS DGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F
Subjt:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D   A 
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT

Query:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         G  E + + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022960393.1 uncharacterized protein LOC111461129 [Cucurbita moschata]9.50e-10064.98Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+ +
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
          CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S        SG
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         VE AE      + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022991379.1 uncharacterized protein LOC111488030 [Cucurbita maxima]5.05e-9663.86Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS   SSDGDF +V NYLP AILAL+SVL A+DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F 
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV
         TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P        P  LP P+ E  S AP  S +D   A  G  
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV

Query:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        E + + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP 
Subjt:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

TrEMBL top hitse value%identityAlignment
A0A1S3CC21 uncharacterized protein LOC1034990742.66e-9162.65Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSPSSSSSSSS DG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S KK SK+ SKKS    A     CVHAPP+ 
Subjt:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD------APP----PELPPPVIEDGSLAPVVSSNDVTPATS
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D      +PP    PE  P  I++GS A   +S        
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD------APP----PELPPPVIEDGSLAPVVSSNDVTPATS

Query:  GNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
            EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  GNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A5D3DM01 Uncharacterized protein1.63e-9061.78Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSS---DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPP
        MKIKNKGKVHPSPSSSSSSSS   DG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S K+ SK+ SKKS    A     CVHAPP
Subjt:  MKIKNKGKVHPSPSSSSSSSS---DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPP

Query:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD------APP----PELPPPVIEDGSLAPVVSSNDVTPA
        +   TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D      +PP    PE  P  I++GS A   +S      
Subjt:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD------APP----PELPPPVIEDGSLAPVVSSNDVTPA

Query:  TSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
              EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  TSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1GQW4 uncharacterized protein LOC1114562851.54e-9663.24Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSPSSSSSSSS DGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F
Subjt:  MKIKNKGKVHPSPSSSSSSSS-DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D   A 
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT

Query:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         G  E + + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1H8S8 uncharacterized protein LOC1114611294.60e-10064.98Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+ +
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
          CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S        SG
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         VE AE      + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAE------KAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1JUQ5 uncharacterized protein LOC1114880302.45e-9663.86Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS   SSDGDF +V NYLP AILAL+SVL A+DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F 
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV
         TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P        P  LP P+ E  S AP  S +D   A  G  
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV

Query:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        E + + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP 
Subjt:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein4.6e-1531.58Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        K+  KG VHPSP    S+        +L  LP AI +L +VL  EDREVLA+++  +  +   +P S  ++ KA KK+   + +             P F
Subjt:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP---A
        +  CF CY  YW+RWDSSP+ ++I++ I+AF++ L   +N  KN           GK    +   S      E+P  + E   +  P  SS+++T     
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP---A

Query:  TSGNVEEAE
         SG +E  E
Subjt:  TSGNVEEAE

AT1G24270.1 unknown protein5.4e-2443.05Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDG-DFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MK+  KGKVHPSP   SSSSS+G D L V   L +AIL LVSVL AED EVLA+++ RS+ T++  S + ++                      H  P  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDG-DFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK
        +  CFDCY  YW +WDSS N ++INQ IEAF++ L   E    + +K  KK
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK

AT1G62422.1 unknown protein2.5e-1332.32Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNY
        K+  KG VHPSP    +  +D  FL +   LP AIL+LV+ L  EDREVLA+++          S +S + +  K +K+             H  P F  
Subjt:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNY

Query:  TCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAK
         CF CY  YW+RWD+SP  ++I++ I+A+++ L   E   K   + K+  K   R        L     E GS +   +  D     +   EEAEK K
Subjt:  TCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAK

AT5G13090.1 unknown protein9.8e-4243.53Show/hide
Query:  MKIKNKGKVHPSP-----SSSSSSSS------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGG
        MK+K KGKV+PSP     SSSSSSSS      D D L VL  LPA IL LVSVL +E+REVLA+++ R    S   +S S+ K  KKS+K S        
Subjt:  MKIKNKGKVHPSP-----SSSSSSSS------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGG

Query:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP
            H PP F+  CFDCY  YW RWDSSPN ++I++ IEAF+       +  ++ +K GKKKEK GRR TD+   P L      D    PVV   N+ T 
Subjt:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP

Query:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        + S +V       EAE A               ++E+  +V P+     + HKGLA KV PDVLGLF+S  W LW PN
Subjt:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCCGGCGGCGATTTT
GGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGGAAAAGAAAGCTT
CCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACTCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGTTACATGAGGTAC
TGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAACGCCGCGAAAGG
GAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTTCAAGCAATGACG
TGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAGACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAGGGTTTGGCCGGG
AAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAAT
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCCGGCGGCGATTTT
GGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGGAAAAGAAAGCTT
CCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACTCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGTTACATGAGGTAC
TGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAACGCCGCGAAAGG
GAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTTCAAGCAATGACG
TGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAGACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAGGGTTTGGCCGGG
AAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAAT
Protein sequenceShow/hide protein sequence
MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNYTCFDCYMRY
WLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAG
KVWPDVLGLFNSRLWSLWGPN