; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002974 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002974
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionMucin-17 like
Genome locationscaffold595_1:619330..620064
RNA-Seq ExpressionMS002974
SyntenyMS002974
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576126.1 hypothetical protein SDJN03_26765, partial [Cucurbita argyrosperma subsp. sororia]2.9e-7563.57Show/hide
Query:  MKIKNKGKVHPSP--SSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVH
        MKIKNKGKVHPSP  SSSSSSSSSSSSDGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A RA+ G  CVH
Subjt:  MKIKNKGKVHPSP--SSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVH

Query:  APPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-----------PPELPPPVIEDGSLAPVVSSND
        +PP+F  TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P           P  LP P+ E  S AP  S  D
Subjt:  APPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-----------PPELPPPVIEDGSLAPVVSSND

Query:  VTPATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
           A  G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  VTPATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

KAG6592803.1 hypothetical protein SDJN03_12279, partial [Cucurbita argyrosperma subsp. sororia]1.5e-7665.49Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP    SSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA RA  G  CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP
        P+   +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        A  G+  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

KAG7020289.1 hypothetical protein SDJN02_16972, partial [Cucurbita argyrosperma subsp. argyrosperma]5.8e-7664.71Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP    SSSSSSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA RA  G  CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTP
        P+   +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  KGK+K+K+GRRS++ P           PE+ P  I+DGS+AP  S +    
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPP----------PELPPPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        A  G+  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022953880.1 uncharacterized protein LOC111456285 [Cucurbita moschata]2.4e-7463.28Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP   SSSSSSSSSDGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A RA+ G  CVH+P
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVT
        P+F  TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D  
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVT

Query:  PATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         A  G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  PATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

XP_022960393.1 uncharacterized protein LOC111461129 [Cucurbita moschata]3.7e-7564.71Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP    SSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA RA  G  CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP
        P+   +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        A  G+  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

TrEMBL top hitse value%identityAlignment
A0A1S3CC21 uncharacterized protein LOC1034990741.0e-7062.31Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP   SSSSSSSSSDG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S KK SK+ SKKS   RA     CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD----------APPPELPPPVIEDGSLAPVVSSNDVTP
        P+   TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D           P PE  P  I++GS A   +S     
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTD----------APPPELPPPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
               EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  ATSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A5D3DM01 Uncharacterized protein2.7e-7162.69Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP SSSSSSSSSSSDG+F DVLNYLP AI ALVSVL  +DREVLAFMMRRSMETSSP SS S K+ SK+ SKKS   RA     CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDA----------PPPELPPPVIEDGSLAPVVSSNDVTP
        P+   TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGK+K+K+GRRS D           P PE  P  I++GS A   +S     
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDA----------PPPELPPPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
               EA ++++E      A +VIPSPP + HKGLA KVWPDVLGLFNSRLWSLW PN
Subjt:  ATSGNVEEAEKAKDET-----AALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1GQW4 uncharacterized protein LOC1114562851.2e-7463.28Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP   SSSSSSSSSDGDF +V NYLP A+LAL+S+L  +DREVLAFMMRRSMETSS  S  S  K SK++ KKS A RA+ G  CVH+P
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVT
        P+F  TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P     P LPP       P+ E  S AP  S +D  
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP----PPELPP-------PVIEDGSLAPVVSSNDVT

Query:  PATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
         A  G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP+
Subjt:  PATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1H8S8 uncharacterized protein LOC1114611291.8e-7564.71Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP    SSS SSSSDGDF DV NYLPAAILAL++VL  +DREVLAFMMRRSMETS+P SS SE K SK+ SKKSGA RA  G  CVHAP
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP
        P+   +CFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE  GKN  K K+K+K+GRRS++ P       PP LP   P  I+DGS+AP  S +    
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP-------PPELP---PPVIEDGSLAPVVSSNDVTP

Query:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        A  G+  E  + +   A +V+PS P SNHKGLA KVWPDVLGLFNSRLWSLWGPN
Subjt:  ATSGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGPN

A0A6J1JUQ5 uncharacterized protein LOC1114880307.6e-7463.49Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MKIKNKGKVHPSP       SSSSSDGDF +V NYLP AILAL+SVL A+DREVLAFMMRRSMETSS  S  S  K SK++ KKS A RA+ G  CVH+P
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPAT
        P+F  TCFDCYM YW RW+SSPNG++I+QAIEAF+EQLA GE   KN  KGKK+EK+GR+S+D P        P  LP P+ E  S AP  S +D   A 
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAP--------PPELPPPVIEDGSLAPVVSSNDVTPAT

Query:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP
         G   E  + +DETA +++PSPP SN KGLA KVWPDVLGLFNSRLWSLWGP
Subjt:  SGNVEEAEKAKDETAALVIPSPPRSNHKGLAGKVWPDVLGLFNSRLWSLWGP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12020.1 unknown protein6.7e-1430.99Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAARATGGGECVHA
        K+  KG VHPSP    S+            +L  LP AI +L +VL  EDREVLA+++  +  +   +P S  ++ KA KK+   + +            
Subjt:  KIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETS--SPPSSESEKKASKKSSKKSGAARATGGGECVHA

Query:  PPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP
         P F+  CF CY  YW+RWDSSP+ ++I++ I+AF++ L   +N  KN           GK    +   S      E+P  + E   +  P  SS+++T 
Subjt:  PPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAA--------KGKKKEKVGRRSTDAPPPELPPPVIED-GSLAPVVSSNDVTP

Query:  ---ATSGNVEEAE
             SG +E  E
Subjt:  ---ATSGNVEEAE

AT1G24270.1 unknown protein5.1e-2241.56Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP
        MK+  KGKVHPSP   SSSSS+     D L V   L +AIL LVSVL AED EVLA+++ RS+ T++  S + ++                      H  
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAP

Query:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK
        P  +  CFDCY  YW +WDSS N ++INQ IEAF++ L   E    + +K  KK
Subjt:  PAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKK

AT1G62422.1 unknown protein2.8e-1231.68Show/hide
Query:  KIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAPP
        K+  KG VHPSP        +  +D  FL +   LP AIL+LV+ L  EDREVLA+++          S +S + +  K +K+             H  P
Subjt:  KIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAPP

Query:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEK
         F   CF CY  YW+RWD+SP  ++I++ I+A+++ L   E   K   + K+  K   R        L     E GS +   +  D     +   EEAEK
Subjt:  AFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEK

Query:  AK
         K
Subjt:  AK

AT5G13090.1 unknown protein3.8e-4143.17Show/hide
Query:  MKIKNKGKVHPSPSSSSSSSSSSSS-------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGG
        MK+K KGKV+PSP     SSSSSSS       D D L VL  LPA IL LVSVL +E+REVLA+++ R    S   +S S+ K  KKS+K S        
Subjt:  MKIKNKGKVHPSPSSSSSSSSSSSS-------DGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGG

Query:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP
            H PP F+  CFDCY  YW RWDSSPN ++I++ IEAF+       +  ++ +K GKKKEK GRR TD+   P L      D    PVV   N+ T 
Subjt:  GECVHAPPAFNYTCFDCYMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAK-GKKKEKVGRRSTDA-PPPELPPPVIEDGSLAPVVS-SNDVTP

Query:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN
        + S +V       EAE A               ++E+  +V P+     + HKGLA KV PDVLGLF+S  W LW PN
Subjt:  ATSGNV------EEAEKA---------------KDETAALVIPSPPR--SNHKGLAGKVWPDVLGLFNSRLWSLWGPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCC
GGCGGCGATTTTGGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGG
AAAAGAAAGCTTCCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACGCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGT
TACATGAGGTACTGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAA
CGCCGCGAAAGGGAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTT
CAAGCAATGACGTGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAAACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAG
GGTTTGGCCGGGAAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAAT
mRNA sequenceShow/hide mRNA sequence
ATGAAGATAAAGAACAAGGGCAAAGTTCACCCATCTCCATCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCTTCCGATGGGGATTTTTTGGATGTCTTGAACTATCTGCC
GGCGGCGATTTTGGCTCTGGTTTCCGTTCTGGGGGCGGAGGATCGGGAGGTTTTGGCCTTCATGATGAGGAGGTCGATGGAGACTTCATCGCCGCCGTCTTCTGAGTCGG
AAAAGAAAGCTTCCAAGAAGTCCTCCAAGAAATCCGGCGCTGCACGCGCCACCGGTGGCGGCGAGTGTGTTCACGCGCCGCCGGCGTTTAACTACACTTGCTTCGACTGT
TACATGAGGTACTGGCTGCGGTGGGACTCGTCGCCCAACGGCAAGATTATTAATCAGGCGATTGAGGCTTTCGATGAACAGCTGGCCACCGGCGAAAATCCCGGCAAGAA
CGCCGCGAAAGGGAAGAAAAAGGAGAAGGTCGGCCGGCGGTCGACGGATGCTCCGCCGCCGGAACTTCCCCCTCCGGTGATTGAGGATGGCTCTCTCGCTCCGGTTGTTT
CAAGCAATGACGTGACTCCGGCGACGAGCGGCAACGTGGAGGAGGCGGAGAAGGCGAAGGATGAAACGGCGGCGTTGGTTATTCCGTCGCCGCCGCGGAGCAACCACAAG
GGTTTGGCCGGGAAGGTATGGCCGGACGTGTTAGGGTTATTCAATTCTCGTTTGTGGAGTCTTTGGGGTCCAAAT
Protein sequenceShow/hide protein sequence
MKIKNKGKVHPSPSSSSSSSSSSSSDGDFLDVLNYLPAAILALVSVLGAEDREVLAFMMRRSMETSSPPSSESEKKASKKSSKKSGAARATGGGECVHAPPAFNYTCFDC
YMRYWLRWDSSPNGKIINQAIEAFDEQLATGENPGKNAAKGKKKEKVGRRSTDAPPPELPPPVIEDGSLAPVVSSNDVTPATSGNVEEAEKAKDETAALVIPSPPRSNHK
GLAGKVWPDVLGLFNSRLWSLWGPN