; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS019662 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS019662
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptiontranscription initiation factor TFIID subunit 7-like
Genome locationscaffold729:1362182..1363280
RNA-Seq ExpressionMS019662
SyntenyMS019662
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004137815.3 uncharacterized protein LOC101215662 [Cucumis sativus]9.6e-6865.61Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILW-SDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKE
        MAMNT  TLCLVSAMDRLWYHQIIL  SDPL +SH PN      + PFT F PS  SP SPL ++TI+PSS    S SS D+ISL SQE  SN++DK K+
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILW-SDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKE

Query:  VEKRE-STEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQK---------QLEEENDEENDDED
          KRE S ++  N LK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+LSPQMV L+PGLQRL   INKQ            ++END+++DD++
Subjt:  VEKRE-STEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQK---------QLEEENDEENDDED

Query:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
         KR+ +RPYLSEAW I+RPNSPLL LRMPKVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  HKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

XP_008442668.1 PREDICTED: putative uncharacterized protein YGR160W [Cucumis melo]2.8e-6765.49Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSQEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP  +SH PNF     + PFT F PS  SP SPL ++TI  +PSS    S SS D+ISL SQE  +N++DK+K
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSQEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLE----------EENDEENDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+LSPQMV L+PGLQRL    NKQ   E          E +D+++DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLE----------EENDEENDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRMPKVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

XP_022145659.1 uncharacterized protein LOC111015056 [Momordica charantia]1.5e-12197.05Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSV+DISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENL+PQMVTLLPGLQRLGIPINK+KQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        KRPNSPLLQLRM KVSSTSDMKKHLKFWAKTVASEIQ
Subjt:  KRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

XP_022983600.1 uncharacterized protein LOC111482158 isoform X1 [Cucurbita maxima]4.6e-5457.74Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            +DD SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDD----EDH-KRDKSRPYLSEAW
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENLSPQMV L+PGLQR    ++KQ  LE+++D+++DD    +DH KRD +RPYLSEAW
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDD----EDH-KRDKSRPYLSEAW

Query:  TIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        TI RPNSPLL LRMPKVSSTSDMKK LK WA+TVA EIQ
Subjt:  TIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

XP_038903410.1 uncharacterized protein LOC120090009 isoform X1 [Benincasa hispida]5.6e-7670.61Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKE
        MAMNT TLCLVS MDRLWYHQIILWSDPL SSH+PNF  T    FT FPS PSPS    SPL +++I+PSS  S SVSS D+ISL SQ+  SND+DK K+
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQT--LPFTKFPSCPSPS----SPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKE

Query:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEEND--DEDHKRDKSRP
          K+E +E+  NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFK+ENLSP+MV LLPGLQRL      ++ LEEE+D+++D  D+D KRD +RP
Subjt:  VEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEEND--DEDHKRDKSRP

Query:  YLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        YLSEAWTIKR NSPLL LRMPKVSSTSDMKKHLK WAKTVA EIQ
Subjt:  YLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

TrEMBL top hitse value%identityAlignment
A0A1S3B6W6 Uncharacterized protein1.3e-6765.49Show/hide
Query:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSQEGCSNDDDKEK
        MAMNT  TLCLVSAMDRLWYHQIIL SDP  +SH PNF     + PFT F PS  SP SPL ++TI  +PSS    S SS D+ISL SQE  +N++DK+K
Subjt:  MAMNT-CTLCLVSAMDRLWYHQIILWSDPLSSSHLPNF---DQTLPFTKF-PSCPSPSSPLTNETI--IPSSFSSLSVSSVDDISLDSQEGCSNDDDKEK

Query:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLE----------EENDEENDD
        +  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+LSPQMV L+PGLQRL    NKQ   E          E +D+++DD
Subjt:  EVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLE----------EENDEENDD

Query:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        +D KR+ +RPYLSEAW I+RPNSPLL LRMPKVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  EDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

A0A5D3DPB9 Uncharacterized protein2.6e-5065.43Show/hide
Query:  IIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGL
        ++PSS    S SS D+ISL SQE  +N++DK+K+  KRES+E    NNLK SVG KLNKS SC+SLGELELEEVKGF+DLGFEFKRE+LSPQMV L+PGL
Subjt:  IIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMP-NNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGL

Query:  QRLGIPINKQKQLE----------EENDEENDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        QRL    NKQ   E          E +D+++DD+D KR+ +RPYLSEAW I+RPNSPLL LRMPKVSSTSDMKKHL+ WAKTVA EIQ
Subjt:  QRLGIPINKQKQLE----------EENDEENDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1CVW5 uncharacterized protein LOC1110150564.3e-12297.47Show/hide
Query:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRES
        MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFS LSVSSVDDISLDS EGCSNDDDKEKEVEKRES
Subjt:  MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRES

Query:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTI
        TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENL+PQMVTLLPGLQRLGIPINK+KQLEEENDE NDDEDHKRDKSRPYLSEAWTI
Subjt:  TEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTI

Query:  KRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        KRPNSPLLQLRM KVSSTSDMKKHLKFWAKTVASEIQ
Subjt:  KRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1J2S9 uncharacterized protein LOC111482158 isoform X23.8e-5457.26Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            +DD SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTIKRP
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENLSPQMV L+PGLQR       + +++++N E++DD+D KRD +RPYLSEAWTI RP
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTIKRP

Query:  NSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        NSPLL LRMPKVSSTSDMKK LK WA+TVA EIQ
Subjt:  NSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

A0A6J1J6B4 uncharacterized protein LOC111482158 isoform X12.2e-5457.74Show/hide
Query:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK
        NT TLCLVSAMDRLW+HQIIL S   S SHL     T PF+ FP      S L+++ I            +DD SL SQE  SND DK K+  K E+ E+
Subjt:  NTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEK

Query:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDD----EDH-KRDKSRPYLSEAW
           + + ++  KLNK+ SC+SLGELE+EEVKGF+DLGF+F+ ENLSPQMV L+PGLQR    ++KQ  LE+++D+++DD    +DH KRD +RPYLSEAW
Subjt:  MPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDD----EDH-KRDKSRPYLSEAW

Query:  TIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ
        TI RPNSPLL LRMPKVSSTSDMKK LK WA+TVA EIQ
Subjt:  TIKRPNSPLLQLRMPKVSSTSDMKKHLKFWAKTVASEIQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G31560.1 Protein of unknown function (DUF1685)4.2e-0530.25Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMPKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  ++ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMPKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G31560.2 Protein of unknown function (DUF1685)4.2e-0530.25Show/hide
Query:  RSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMPKVSST
        +SL + +LEE+KG +DLGF F  + + P++   LP L+       K    +++N  ++ +ED   D S P  + A    W I  P               
Subjt:  RSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEA----WTIKRPNSPLLQLRMPKVSST

Query:  SDMKKHLKFWAKTVASEIQ
         D+K  LK+WA+TVA  ++
Subjt:  SDMKKHLKFWAKTVASEIQ

AT2G42760.1 unknown protein3.0e-1130.81Show/hide
Query:  PSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL
        P   +P+  +TI       LS   V+  ++  +E   ++ +++++ +K++S      N+++  G         +S+ +LE EE+KGF+DLGF F   ++ 
Subjt:  PSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKR-ENL

Query:  SPQMVTLLPGLQRLGIPINKQKQL-EEENDEENDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMPKVSSTS--DMKKHLKFWAKTVASEIQ
           +V++LPGLQRL   + K   + +EE +EE +D+      +RPYLSEAW        K+  +P ++ R+P  ++ S  D+K +L+ WA  VAS I+
Subjt:  SPQMVTLLPGLQRLGIPINKQKQL-EEENDEENDDEDHKRDKSRPYLSEAWTI------KRPNSPLLQLRMPKVSSTS--DMKKHLKFWAKTVASEIQ

AT2G43340.1 Protein of unknown function (DUF1685)1.4e-0527.87Show/hide
Query:  TNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTL
        ++  I  SSFSS S S  ++  +++  G      + K++EK++S   +    + S+V   L ++K   SL + +LEE+KG +DLGF F  E + P++   
Subjt:  TNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMPNN-LKSSVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTL

Query:  LPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKV-SSTSDMKKHLKFWAKTVASEIQ
        LP L           +L     ++  D+DH    S     ++  +  P SP+   ++     +  D+K  LKFWA+ VA  ++
Subjt:  LPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKV-SSTSDMKKHLKFWAKTVASEIQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGCCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTT
TGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGTCTCTCTCGGTTTCCTCTG
TCGATGATATCTCCCTCGACTCACAGGAAGGTTGTAGTAATGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCT
TCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAGAAGTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTAGGGTTTGAATTCAAGAGAGAAAA
TTTGAGCCCTCAAATGGTGACATTACTTCCTGGTTTACAAAGACTTGGAATTCCCATAAACAAACAGAAACAGCTTGAAGAAGAAAATGATGAAGAAAATGATGATGAAG
ATCATAAGAGAGATAAATCAAGGCCATATCTCTCGGAGGCATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGCCAAAGGTTTCTTCGACCTCGGAC
ATGAAGAAACACCTCAAATTTTGGGCTAAAACTGTTGCGTCTGAAATTCAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTATGAACACTTGTACTTTATGTTTGGTTTCAGCCATGGATCGCCTTTGGTACCACCAAATCATTCTTTGGTCAGATCCTCTAAGTTCTTCCCATCTTCCTAATTT
TGACCAAACTTTGCCTTTCACGAAGTTTCCTTCTTGCCCATCTCCCTCTTCACCTCTAACAAATGAAACAATTATCCCCTCCTCTTTCTCGTCTCTCTCGGTTTCCTCTG
TCGATGATATCTCCCTCGACTCACAGGAAGGTTGTAGTAATGATGATGACAAAGAGAAAGAAGTTGAAAAGAGAGAATCAACTGAAAAAATGCCCAACAATCTCAAATCT
TCAGTGGGGATAAAATTGAACAAATCAAAAAGTTGTAGAAGTTTGGGAGAGTTGGAACTTGAAGAAGTTAAAGGGTTTATTGATTTAGGGTTTGAATTCAAGAGAGAAAA
TTTGAGCCCTCAAATGGTGACATTACTTCCTGGTTTACAAAGACTTGGAATTCCCATAAACAAACAGAAACAGCTTGAAGAAGAAAATGATGAAGAAAATGATGATGAAG
ATCATAAGAGAGATAAATCAAGGCCATATCTCTCGGAGGCATGGACAATAAAAAGACCAAATTCTCCTCTTTTACAACTAAGGATGCCAAAGGTTTCTTCGACCTCGGAC
ATGAAGAAACACCTCAAATTTTGGGCTAAAACTGTTGCGTCTGAAATTCAA
Protein sequenceShow/hide protein sequence
MAMNTCTLCLVSAMDRLWYHQIILWSDPLSSSHLPNFDQTLPFTKFPSCPSPSSPLTNETIIPSSFSSLSVSSVDDISLDSQEGCSNDDDKEKEVEKRESTEKMPNNLKS
SVGIKLNKSKSCRSLGELELEEVKGFIDLGFEFKRENLSPQMVTLLPGLQRLGIPINKQKQLEEENDEENDDEDHKRDKSRPYLSEAWTIKRPNSPLLQLRMPKVSSTSD
MKKHLKFWAKTVASEIQ