; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g31490 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g31490
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRna exonuclease 3
Genome locationchr5:23633643..23634753
RNA-Seq ExpressionMoc05g31490
SyntenyMoc05g31490
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137815.3 uncharacterized protein LOC101215662 [Cucumis sativus]3.4e-6564.03Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILW-SDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKE
        MAMNT  TLCLVSAMDRLWYHQIIL  SDPL +SH PN      + PFT F PS  SP SPL ++TI+PSS    S SS ++ISL SQE  SN++DK K+
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILW-SDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKE

Query:  VEKRE-STEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRK---------QLEEENDEKNDDED
          KRE S ++  N LK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL   INK+            ++END+ +DD++
Subjt:  VEKRE-STEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRK---------QLEEENDEKNDDED

Query:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
         KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_008442668.1 PREDICTED: putative uncharacterized protein YGR160W [Cucumis melo]1.3e-6463.92Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSTLSVSSVNDISLDSQEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP  +SH PNF     + PFT F PS  SP SPL ++TI  +PSS    S SS ++ISL SQE  +N++DK+K
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSTLSVSSVNDISLDSQEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL    NK+   E          E +D+ +DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_022145659.1 uncharacterized protein LOC111015056 [Momordica charantia]8.6e-12598.33Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSV+DISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
        KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
Subjt:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES

XP_022983601.1 uncharacterized protein LOC111482158 isoform X2 [Cucurbita maxima]9.7e-5255.98Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            ++D SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR       + +++++N E +DD+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

XP_038903410.1 uncharacterized protein LOC120090009 isoform X1 [Benincasa hispida]5.3e-7469.39Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKE
        MAMNT TLCLVS MDRLWYHQIILWSDPL SSH+PNF  T    FT FPS PSPS    SPL +++I+PSS  + SVSS N ISL SQ+  SND+DK K+
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKE

Query:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKND--DEDHKRDKSRP
          K+E +E+  NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFK+ENL+P+MV LLPGLQRL      ++ LEEE+D+ +D  D+D KRD +RP
Subjt:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKND--DEDHKRDKSRP

Query:  YLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        YLSEAWTIKR NSPLL LRM KVSSTSDMKKHLK WAKTVA EIQ
Subjt:  YLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein6.3e-6563.92Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSTLSVSSVNDISLDSQEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP  +SH PNF     + PFT F PS  SP SPL ++TI  +PSS    S SS ++ISL SQE  +N++DK+K
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSTLSVSSVNDISLDSQEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGLQRL    NK+   E          E +D+ +DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLE----------EENDEKNDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A5D3DPB9 Uncharacterized protein1.2e-4763.3Show/hide
Query:  IIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGL
        ++PSS    S SS ++ISL SQE  +N++DK+K+  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+L+PQMV L+PGL
Subjt:  IIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGL

Query:  QRLGIPINKRKQLE----------EENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        QRL    NK+   E          E +D+ +DD+D KR+ +RPYLSEAW I+RPNSPLL LRM KVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  QRLGIPINKRKQLE----------EENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1CVW5 uncharacterized protein LOC1110150569.3e-12598.33Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSV+DISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
        KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES
Subjt:  KRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQQES

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X24.7e-5255.98Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            ++D SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR       + +++++N E +DD+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  NSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X11.0e-5156.07Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            ++D SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDD----EDH-KRDKSRPYLSEAW
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENL+PQMV L+PGLQR    ++K + LE+++D+ +DD    +DH KRD +RPYLSEAW
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDD----EDH-KRDKSRPYLSEAW

Query:  TIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ
        TI RPNSPLL LRM KVSSTSDMKK LK WA+TVA EIQ
Subjt:  TIKRPNSPLLQLRMSKVSSTSDMKKHLKFWAKTVASEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)1.5e-0531.09Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  K+ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G31560.2 Protein of unknown function (DUF1685)1.5e-0531.09Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  K+ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMSKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G42760.1 unknown protein2.0e-1030.3Show/hide
Query:  PSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL
        P   +P+  +TI       LS   VN  ++  +E   ++ +++++ +K++S      N+++  G         +S+ +LE EE+KGF+DLGF F   ++ 
Subjt:  PSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL

Query:  NPQMVTLLPGLQRLGIPINKRKQL-EEENDEKNDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMSKVSSTS--DMKKHLKFWAKTVASEIQ
        +  +V++LPGLQRL   + K   + +EE +E+ +D+      +RPYLSEAW        K+  +P ++ R+   ++ S  D+K +L+ WA  VAS I+
Subjt:  NPQMVTLLPGLQRLGIPINKRKQL-EEENDEKNDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMSKVSSTS--DMKKHLKFWAKTVASEIQ

AT2G43340.1 Protein of unknown function (DUF1685)3.9e-0628.42Show/hide
Query:  TNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTL
        ++  I  SSFS+ S S   +  +++  G      + K++EK++S   +    + S+V   L ++K   SL + +LEE+KG +DLGF F  E + P++   
Subjt:  TNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTL

Query:  LPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKV-SSTSDMKKHLKFWAKTVASEIQ
        LP L           +L     +K  D+DH    S     ++  +  P SP+   ++S    +  D+K  LKFWA+ VA  ++
Subjt:  LPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKV-SSTSDMKKHLKFWAKTVASEIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGCCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTT
TGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGACTCTCTCGGTTTCCTCTG
TCAATGATATCTCCCTCGACTCACAGGAAGGTTGTAGTAATGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCT
TCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAGAAGTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTGGGGTTTGAATTCAAGAGAGAAAA
TTTGAACCCTCAAATGGTGACATTACTTCCTGGTTTACAAAGACTTGGAATTCCCATAAACAAACGGAAACAACTTGAAGAAGAAAATGATGAAAAAAATGATGATGAAG
ATCATAAGAGAGATAAATCAAGGCCATATCTCTCAGAGGCATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGTCAAAGGTTTCTTCGACCTCGGAC
ATGAAGAAACACCTCAAATTTTGGGCTAAAACCGTTGCGTCTGAAATTCAACAAGAATCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGCCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTT
TGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGACTCTCTCGGTTTCCTCTG
TCAATGATATCTCCCTCGACTCACAGGAAGGTTGTAGTAATGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCT
TCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAGAAGTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTGGGGTTTGAATTCAAGAGAGAAAA
TTTGAACCCTCAAATGGTGACATTACTTCCTGGTTTACAAAGACTTGGAATTCCCATAAACAAACGGAAACAACTTGAAGAAGAAAATGATGAAAAAAATGATGATGAAG
ATCATAAGAGAGATAAATCAAGGCCATATCTCTCAGAGGCATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGTCAAAGGTTTCTTCGACCTCGGAC
ATGAAGAAACACCTCAAATTTTGGGCTAAAACCGTTGCGTCTGAAATTCAACAAGAATCTTAA
Protein sequenceShow/hide protein sequence
MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSTLSVSSVNDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKS
SVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLNPQMVTLLPGLQRLGIPINKRKQLEEENDEKNDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMSKVSSTSD
MKKHLKFWAKTVASEIQQES