; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00860 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00860
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:570005..570730
RNA-Seq ExpressionMoc07g00860
SyntenyMoc07g00860
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592803.1 hypothetical protein SDJN03_12279, partial [Cucurbita argyrosperma subsp. sororia]2.1e-7866.14Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
         +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    A  G
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        +  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

KAG7020289.1 hypothetical protein SDJN02_16972, partial [Cucurbita argyrosperma subsp. argyrosperma]7.9e-7865.34Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTPATSG
         +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P           PE+ P  I+DGS+AP  S +    A  G
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        +  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022953880.1 uncharacterized protein LOC111456285 [Cucurbita moschata]2.4e-7463.24Show/hide
Query:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSP SSSSSSSSDGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F
Subjt:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D   A 
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT

Query:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022960393.1 uncharacterized protein LOC111461129 [Cucurbita moschata]5.1e-7765.34Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
         +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    A  G
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        +  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022991379.1 uncharacterized protein LOC111488030 [Cucurbita maxima]5.3e-7464.11Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSP   SSSSSDGDF +V NYLP AILAL+SVL A+DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F 
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV
         TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P        P  LP P+ E  S AP  S +D   A  G  
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV

Query:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP
         E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP
Subjt:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP

TrEMBL top hitse value%identityAlignment
A0A1S3CC21 uncharacterized protein LOC1034990741.0e-7062.26Show/hide
Query:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSP SSSSSSSSDG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S KK SK+ SKKS    A     CVHAPP+ 
Subjt:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD----------APPPELPPPVIEDGSLAPVVSSNDVTPATS
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D           P PE  P  I++GS A   +S        
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD----------APPPELPPPVIEDGSLAPVVSSNDVTPATS

Query:  GNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
            EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  GNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A5D3DM01 Uncharacterized protein3.9e-7061.39Show/hide
Query:  MKIKNKGKVHPSP---SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPP
        MKIKNKGKVHPSP   SSSSSSSSDG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S K+ SK+ SKKS    A     CVHAPP
Subjt:  MKIKNKGKVHPSP---SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPP

Query:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDA----------PPPELPPPVIEDGSLAPVVSSNDVTPA
        +   TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D           P PE  P  I++GS A   +S      
Subjt:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDA----------PPPELPPPVIEDGSLAPVVSSNDVTPA

Query:  TSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
              EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  TSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1GQW4 uncharacterized protein LOC1114562851.2e-7463.24Show/hide
Query:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MKIKNKGKVHPSP SSSSSSSSDGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F
Subjt:  MKIKNKGKVHPSP-SSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT
          TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D   A 
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVTPAT

Query:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1H8S8 uncharacterized protein LOC1114611292.5e-7765.34Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSPSSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA  A  G  CVHAPP+  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG
         +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    A  G
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTPATSG

Query:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        +  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  NVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1JUQ5 uncharacterized protein LOC1114880302.6e-7464.11Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN
        MKIKNKGKVHPSP   SSSSSDGDF +V NYLP AILAL+SVL A+DREVLAFMMRRSMETSS  S  S  K SK++ KKS A  A+ G  CVH+PP+F 
Subjt:  MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFN

Query:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV
         TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P        P  LP P+ E  S AP  S +D   A  G  
Subjt:  YTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPATSGNV

Query:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP
         E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP
Subjt:  EEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein4.6e-1531.58Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        K+  KG VHPSP    S+        +L  LP AI +L +VL  EDREVLA+++  +  +   +P S  ++ KA KK+   + +             P F
Subjt:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP---A
        +  CF CY  YW+RWDSSP+ ++I++ I+AF++ L   +N  KN           GK    +   S      E+P  + E   +  P  SS+++T     
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP---A

Query:  TSGNVEEAE
         SG +E  E
Subjt:  TSGNVEEAE

AT1G24270.1 unknown protein5.4e-2443.05Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSDG-DFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF
        MK+  KGKVHPSP   SSSSS+G D L V   L +AIL LVSVL AED EVLA+++ RS+ T++  S + ++                      H  P  
Subjt:  MKIKNKGKVHPSPSSSSSSSSDG-DFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAF

Query:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK
        +  CFDCY  YW +WDSS N ++INQ IEAF++ L   E    + +K  KK
Subjt:  NYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK

AT1G62422.1 unknown protein2.5e-1332.32Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNY
        K+  KG VHPSP    +  +D  FL +   LP AIL+LV+ L  EDREVLA+++          S +S + +  K +K+             H  P F  
Subjt:  KIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNY

Query:  TCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAK
         CF CY  YW+RWD+SP  ++I++ I+A+++ L   E   K   + K+  K   R        L     E GS +   +  D     +   EEAEK K
Subjt:  TCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAK

AT5G13090.1 unknown protein9.8e-4243.53Show/hide
Query:  MKIKNKGKVHPSP-----SSSSSSSS------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGG
        MK+K KGKV+PSP     SSSSSSSS      D D L VL  LPA IL LVSVL +E+REVLA+++ R    S   +S S+ K  KKS+K S        
Subjt:  MKIKNKGKVHPSP-----SSSSSSSS------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGG

Query:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP
            H PP F+  CFDCY  YW RWDSSPN ++I++ IEAF+       +  ++ +K GKKKEK GRR TD+   P L      D    PVV   N+ T 
Subjt:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP

Query:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        + S +V       EAE A               ++E+  +V P+     + HKGLA KV PDVLGLF+S  W LW PN
Subjt:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCCGGCGGCGATTTT
GGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGGAAAAGAAAGCTT
CCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACTCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGTTACATGAGGTAC
TGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAACGCCGCGAAAGG
GAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTTCAAGCAATGACG
TGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAGACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAGGGTTTGGCCGGG
AAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCCGGCGGCGATTTT
GGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGGAAAAGAAAGCTT
CCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACTCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGTTACATGAGGTAC
TGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAACGCCGCGAAAGG
GAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTTCAAGCAATGACG
TGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAGACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAGGGTTTGGCCGGG
AAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAATTAG
Protein sequenceShow/hide protein sequence
MKIKNKGKVHPSPSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAALATGGGECVHAPPAFNYTCFDCYMRY
WLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAG
KVWPDVLGLFNSRLWSLWGPN