; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC05g1401 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC05g1401
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationMC05:18175161..18176701
RNA-Seq ExpressionMC05g1401
SyntenyMC05g1401
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137815.3 uncharacterized protein LOC101215662 [Cucumis sativus]4.67e-8164.03Show/hide
Query:  MAMNTC-TLCLVSAMDRLWYHQIILWS-DPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKE
        MAMNT  TLCLVSAMDRLWYHQIIL S DPL +SH PN      + PFT F PS  SP SPL ++TI+PSS SS S    D+ISL S E  SN++DK K+
Subjt:  MAMNTC-TLCLVSAMDRLWYHQIILWS-DPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKE

Query:  VEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRK---------QLEEENDEKNDDED
          KRE +E    N LK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL   INK+            ++END+ +DD++
Subjt:  VEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRK---------QLEEENDEKNDDED

Query:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
         KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_008442668.1 PREDICTED: putative uncharacterized protein YGR160W [Cucumis melo]1.95e-8063.92Show/hide
Query:  MAMNTC-TLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSHEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP +S H PNF     + PFT F PS  SP SPL ++TI  +PSS SS S    D+ISL S E  +N++DK+K
Subjt:  MAMNTC-TLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSHEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL    NK+   E          E +D+ +DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_022145659.1 uncharacterized protein LOC111015056 [Momordica charantia]2.77e-15998.33Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSV+DISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
        KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
Subjt:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES

XP_022983601.1 uncharacterized protein LOC111482158 isoform X2 [Cucurbita maxima]9.04e-6555.98Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FPS       L+++ I            +DD SL S E  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR       + +++++N E +DD+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_038903410.1 uncharacterized protein LOC120090009 isoform X1 [Benincasa hispida]1.25e-9269.39Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKE
        MAMNT TLCLVS MDRLWYHQIILWSDPLSS H+PNF  T    FT FPS PSPS    SPL +++I+PSS  S SVSS D+ISL S +  SND+DK K+
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKE

Query:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKND--DEDHKRDKSRP
          K+E +E+  NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFK+ENL+P+MV LLPGLQRL      ++ LEEE+D+ +D  D+D KRD +RP
Subjt:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKND--DEDHKRDKSRP

Query:  YLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        YLSEAWTIKR NSPLL LRM KVSSTSDMKKHLK WAKTVA EIQ
Subjt:  YLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein9.42e-8163.92Show/hide
Query:  MAMNTC-TLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSHEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP +S H PNF     + PFT F PS  SP SPL ++TI  +PSS SS S    D+ISL S E  +N++DK+K
Subjt:  MAMNTC-TLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSHEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL    NK+   E          E +D+ +DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1CVW5 uncharacterized protein LOC1110150566.65e-16098.75Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSVDDISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
        KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
Subjt:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES

A0A6J1F521 uncharacterized protein LOC1114421893.90e-6153.85Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK
        NT T CLVSAMDRLW+HQIIL S    +SHL     T PF+ FPS       L+++ I           S+DD SL SHE    + DK K+  K ES ++
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP
          ++ + ++  KLNKS SC+SLGELELEEVKGF+DLGF+F+ ENL+PQM+ L+PGLQR    ++K+      +D   +D+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRM K+SST+DMKKHL+ WA TVA EIQ
Subjt:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X24.38e-6555.98Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FPS       L+++ I            +DD SL S E  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR       + +++++N E +DD+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X11.71e-6456.07Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FPS       L+++ I            +DD SL S E  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDD----EDHK-RDKSRPYLSEAW
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR    ++K+  LE+++D+ +DD    +DHK RD +RPYLSEAW
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDD----EDHK-RDKSRPYLSEAW

Query:  TIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        TI RPNSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  TIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)1.8e-0531.09Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  K+ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G31560.2 Protein of unknown function (DUF1685)1.8e-0531.09Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  K+ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G42760.1 unknown protein1.2e-0929.8Show/hide
Query:  PSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL
        P   +P+  +TI       LS   V+  ++   E   ++ +++++ +K++S      N+++  G         +S+ +LE EE+KGF+DLGF F   ++ 
Subjt:  PSPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL

Query:  NPQMVTLLPGLQRLGIPINKRKQL-EEENDEKNDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMSKVSSTS--DMKKHLKFWAKTVASEIQ
        +  +V++LPGLQRL   + K   + +EE +E+ +D+      +RPYLSEAW        K+  +P ++ R+   ++ S  D+K +L+ WA  VAS I+
Subjt:  NPQMVTLLPGLQRLGIPINKRKQL-EEENDEKNDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMSKVSSTS--DMKKHLKFWAKTVASEIQ

AT2G43340.1 Protein of unknown function (DUF1685)1.3e-0628.96Show/hide
Query:  TNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTL
        ++  I  SSFSS S S  ++  +++  G      + K++EK++S   +    + S+V   L ++K   SL + +LEE+KG +DLGF F  E + P++   
Subjt:  TNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTL

Query:  LPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKV-SSTSDMKKHLKFWAKTVASEIQ
        LP L           +L     +K  D+DH    S     ++  +  P SP+   ++S    +  D+K  LKFWA+ VA  ++
Subjt:  LPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKV-SSTSDMKKHLKFWAKTVASEIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TCACTTGTGACCACATCCTCCAATATTCCCCCAAAGCCAAAGGCAAGCCATTCCCACTCCAATTTGCCCTTTTTTTTCATTTGCTTTCCCATTCATTCCATTTTTCTTTC
ATTTCAACCACAAACTCTCTCCCTCTTCCATGGTTTTTTATATCCCAAACTTGATTTCTTTCTCTCTTCAATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGCCA
TGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTTTGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCCCA
TCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGTCTCTCTCGGTTTCCTCTGTCGATGATATCTCCCTCGACTCACACGAAGGTTGTAGTAA
TGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCTTCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAGAA
GTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTGGGGTTTGAATTCAAGAGAGAAAATTTGAACCCTCAAATGGTGACATTACTTCCTGGTTTACAA
AGACTTGGAATTCCCATAAACAAACGGAAACAGCTTGAAGAAGAAAATGATGAAAAAAATGATGATGAAGATCATAAGAGAGATAAATCAAGGCCATATCTCTCAGAGGC
ATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGTCAAAGGTTTCTTCGACCTCGGACATGAAGAAACACCTCAAATTTTGGGCTAAAACCGTTGCGT
CTGAAATTCAACAAGAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATTCACTTGTGACCACATCCTCCAATATTCCCCCAAAGCCAAAGGCAAGCCATTCCCACTCCAATTTGCCCTTTTTTTTCATTTGCTTTCCCATTCATTCCATTTTTCTT
TCATTTCAACCACAAACTCTCTCCCTCTTCCATGGTTTTTTATATCCCAAACTTGATTTCTTTCTCTCTTCAATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGC
CATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTTTGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCC
CATCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGTCTCTCTCGGTTTCCTCTGTCGATGATATCTCCCTCGACTCACACGAAGGTTGTAGT
AATGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCTTCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAG
AAGTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTGGGGTTTGAATTCAAGAGAGAAAATTTGAACCCTCAAATGGTGACATTACTTCCTGGTTTAC
AAAGACTTGGAATTCCCATAAACAAACGGAAACAGCTTGAAGAAGAAAATGATGAAAAAAATGATGATGAAGATCATAAGAGAGATAAATCAAGGCCATATCTCTCAGAG
GCATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGTCAAAGGTTTCTTCGACCTCGGACATGAAGAAACACCTCAAATTTTGGGCTAAAACCGTTGC
GTCTGAAATTCAACAAGAATCTTAAAACGCTCATTTTAAGACTCTTTTCAAATAGAAGTTTTTACTTTTGATAAACAAAAATTCAGTAGAAACATGTTTGGATAACTAGT
CTTGAAAAACAGTTGTATTTCAACGTAATAATAACACTATCTAATTAAGTCGTTTTGAATAAAATTTTGACCCTCCATCAACTTTGTTCTTTCACCCTCCAATTGTAAAG
GAATAAAAATTGTACTTTTATGTATAGTTTTTTCTTTCACTTTTATTTTAAGA
Protein sequenceShow/hide protein sequence
SLVTTSSNIPPKPKASHSHSNLPFFFICFPIHSIFLSFQPQTLSLFHGFLYPKLDFFLSSMAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCP
SPSSPLTNETIIPSSFSSLSVSSVDDISLDSHEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQ
RLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES