; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020825 (gene) of Snake gourd v1 genome

Gene IDTan0020825
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110414781
Genome locationLG03:52189686..52191506
RNA-Seq ExpressionTan0020825
SyntenyTan0020825
Gene Ontology termsGO:0010380 - regulation of chlorophyll biosynthetic process (biological process)
GO:0010581 - regulation of starch biosynthetic process (biological process)
GO:0019430 - removal of superoxide radicals (biological process)
GO:0042744 - hydrogen peroxide catabolic process (biological process)
GO:0043085 - positive regulation of catalytic activity (biological process)
GO:0045454 - cell redox homeostasis (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0004791 - thioredoxin-disulfide reductase activity (molecular function)
GO:0008047 - enzyme activator activity (molecular function)
GO:0016671 - oxidoreductase activity, acting on a sulfur group of donors, disulfide as acceptor (molecular function)
GO:0042802 - identical protein binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027627.1 hypothetical protein SDJN02_11642, partial [Cucurbita argyrosperma subsp. argyrosperma]2.9e-10883.97Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV
        MAEFPCSLERTVASALLLLSTS PPPPPSPP+SV +DEWL EE  +GG   REI+   DYSKSCSS+LT SDESSETRA+EPL FST AYRD+LKLH   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV

Query:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS
             VVRKSRSKLIRISENRNLTSTDDVTLSSGS SSE+SCLSS+SSVVTSAPIHRLVTRAEKKLEMIRHVWRK+H ATAHMRRRAEAILSYLSGGCSS
Subjt:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS

Query:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMVQDTLNSQLPLYLMALNQSR
        EVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIY V DTLNSQLPLYL+ L++ R
Subjt:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMVQDTLNSQLPLYLMALNQSR

XP_022924988.1 uncharacterized protein LOC111432371 [Cucurbita moschata]5.8e-10184.77Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV
        MAEFPCSLERTVASALLLLSTS PPPPPSPP+SV +DEWL EE  +GG   REI+   DYSKSCSS+LT SDESSETRA+EPL FST AYRD+LKLH   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV

Query:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS
             VVRKSRSKLIRISENRNLTSTDDVTLSSGS SSE+SCLSS+SSVVTSAPIHRLVTRAEKKLEMIRHVWRK+H ATAHMRRRAEAILSYLSGGCSS
Subjt:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS

Query:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        EVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIY +
Subjt:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

XP_022928741.1 uncharacterized protein LOC111435559 [Cucurbita moschata]1.6e-9883.4Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPP---PSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH
        MAEFPC+LERTVASALLLLSTSPPPPP   PSP I +S+DEWLFEEKI+GGKCS E+S FCD SKSCSS+LT SDESSETRAQE L FSTSAYRDELKL 
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPP---PSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH

Query:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG
               +VVRKSRS+ +RIS NRNLT TDDVTLSSGSASSET+ CLSS+SSV TSAPI RLVTRAEKKLEMIRH WRKKH A+AHMRRRAEAILSYLSG
Subjt:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG

Query:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIY +
Subjt:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

XP_022967938.1 uncharacterized protein LOC111467304 [Cucurbita maxima]1.6e-9883.4Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPP---PPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH
        MAEFPC+LERTVASALLLLSTSPPP   PPPSP I +S+DEWLFEEKI+GGKCS E+S FCD SKSCSS+LT SDESSETRAQE L FSTSAYRDELKL 
Subjt:  MAEFPCSLERTVASALLLLSTSPPP---PPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH

Query:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG
               +VVRKSRS+ +RIS NRNLT TDDVTLSSGSASSET+ CLSS+SSV TSAPI RLVTRAEKKLEMIRH WRKKH A+AHMRRRAEAILSYLSG
Subjt:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG

Query:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIY +
Subjt:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

XP_023518044.1 uncharacterized protein LOC111781591 [Cucurbita pepo subsp. pepo]3.2e-9983.95Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV
        MAEFPCSLERTVASALLLLSTS PPPPPSPP+SV +DEWL EE  +G    REI+   DYSKSCSS+LT SDESSETRA+EPL  ST AYRD+LKLH   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV

Query:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS
             VVRKSRSKLIRISENRNLTSTDDVTLSSGS SSE+SCLSS+SSVVTSAPIHRLVTRAEKKLEMIRHVWRK+H ATAHMRRRAEAILSYLSGGCSS
Subjt:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS

Query:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        EVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIY +
Subjt:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

TrEMBL top hitse value%identityAlignment
A0A6J1DFG0 uncharacterized protein LOC1110200042.9e-9885.25Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPI-SVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVI
        MAEFPCSLER+VASALLLLSTSPPPPPPSPP  SVSRDEWLFE KI  GK SRE+ AFCDYSKSCSSILTG DESS+TR QEPL FSTSAY DELKL   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPI-SVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVI

Query:  VLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCS
             +VVRKSRSKLIRISENRN +S DD TLSSGSASSETSCLSS+S+VVTSAP  RLVTRAEKKLEMIRHVWRKK  A+AHMRRRAEAIL YLSGGCS
Subjt:  VLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCS

Query:  SEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        SEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYM+
Subjt:  SEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

A0A6J1EE02 uncharacterized protein LOC1114323712.8e-10184.77Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV
        MAEFPCSLERTVASALLLLSTS PPPPPSPP+SV +DEWL EE  +GG   REI+   DYSKSCSS+LT SDESSETRA+EPL FST AYRD+LKLH   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV

Query:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS
             VVRKSRSKLIRISENRNLTSTDDVTLSSGS SSE+SCLSS+SSVVTSAPIHRLVTRAEKKLEMIRHVWRK+H ATAHMRRRAEAILSYLSGGCSS
Subjt:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS

Query:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        EVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIY +
Subjt:  EVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

A0A6J1EL58 uncharacterized protein LOC1114355597.6e-9983.4Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPP---PSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH
        MAEFPC+LERTVASALLLLSTSPPPPP   PSP I +S+DEWLFEEKI+GGKCS E+S FCD SKSCSS+LT SDESSETRAQE L FSTSAYRDELKL 
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPP---PSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH

Query:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG
               +VVRKSRS+ +RIS NRNLT TDDVTLSSGSASSET+ CLSS+SSV TSAPI RLVTRAEKKLEMIRH WRKKH A+AHMRRRAEAILSYLSG
Subjt:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG

Query:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIY +
Subjt:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

A0A6J1HSD4 uncharacterized protein LOC1114661635.7e-8682.27Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV
        MAEFPCSLERTVASALLLLSTS PPPPPSP +SV +DEWL EE  +GG   REI+   DYSKSCSS+ T SDESSETR +EPL FST AYRD+LKLH   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIV

Query:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS
             VVRKSRSKLIRISENRNLTSTDDVTL+SGS SSE+SCLSS+SSVVTSAPIHRLVTRAEKKLEMIRHVWRK+H ATAHMRRRAEAI+SYLSGGCSS
Subjt:  LSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSS

Query:  EVKIRQVLGDSPDTSKALRM
        EVKIRQVLGDSPDTSKALRM
Subjt:  EVKIRQVLGDSPDTSKALRM

A0A6J1HWL4 uncharacterized protein LOC1114673047.6e-9983.4Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPP---PPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH
        MAEFPC+LERTVASALLLLSTSPPP   PPPSP I +S+DEWLFEEKI+GGKCS E+S FCD SKSCSS+LT SDESSETRAQE L FSTSAYRDELKL 
Subjt:  MAEFPCSLERTVASALLLLSTSPPP---PPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLH

Query:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG
               +VVRKSRS+ +RIS NRNLT TDDVTLSSGSASSET+ CLSS+SSV TSAPI RLVTRAEKKLEMIRH WRKKH A+AHMRRRAEAILSYLSG
Subjt:  VIVLSVSSVVRKSRSKLIRISENRNLTSTDDVTLSSGSASSETS-CLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSG

Query:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIY +
Subjt:  GCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57440.1 unknown protein1.2e-2439.45Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRD----EWLFEEKIVGGKCSREISAFCDYSKSCSSILT--GSDESSETRAQEPLSFSTSAYRDEL
        MA +P  +ERTVAS+LLLLS  P    P    SV       +W  E     G  +  +      S+SC S L+  GS   SE R +  ++++   +R   
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRD----EWLFEEKIVGGKCSREISAFCDYSKSCSSILT--GSDESSETRAQEPLSFSTSAYRDEL

Query:  KLHVIVLSVSSVVRKSRSKLIRISEN-----RNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWR--KKHAATAHMRRRA
              L      RK RS++I  S N       +    DV  +    S + SCLS+ SS V+S    R+  R +K  E +R   +  K+ + ++ +RRRA
Subjt:  KLHVIVLSVSSVVRKSRSKLIRISEN-----RNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWR--KKHAATAHMRRRA

Query:  EAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV
        + IL +LS   SSEV IRQ+LGDSPDTSKALRMLLK+EE+KR GTGGR DP+IY +
Subjt:  EAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYMV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGTTCCCTTGCTCTCTAGAACGCACCGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCGCCGCCGTCTCCACCGATATCGGTTTCTCGAGA
CGAGTGGTTGTTTGAGGAGAAAATTGTCGGAGGAAAATGCTCGAGAGAGATATCGGCGTTTTGTGATTATTCGAAGTCTTGCTCTTCGATACTCACTGGATCAGATGAAT
CGTCCGAGACTCGAGCGCAGGAGCCGTTGTCGTTCTCTACTTCGGCTTATCGCGACGAGCTAAAGCTTCATGTAATCGTTCTATCCGTCTCGTCTGTCGTGAGAAAGAGT
CGTTCGAAGCTAATACGGATATCCGAGAACCGGAATCTCACTTCTACAGACGACGTTACCTTGTCCTCAGGCTCCGCGTCCTCGGAGACGTCTTGTTTGTCAAGCACCTC
AAGCGTGGTCACAAGCGCGCCAATCCATCGTCTGGTTACCAGAGCAGAGAAGAAGTTGGAAATGATTCGTCACGTATGGAGGAAAAAGCACGCCGCAACGGCTCACATGC
GCCGGCGGGCCGAAGCTATTCTGAGCTACCTCTCCGGTGGTTGTTCCTCTGAGGTGAAGATACGCCAAGTGCTTGGCGACAGCCCTGACACAAGCAAGGCTCTCAGAATG
CTGTTGAAGCTGGAAGAGATTAAAAGATCCGGAACAGGTGGGCGCCAAGATCCCTATATTTACATGGTACAAGATACTCTCAACTCCCAACTTCCTCTCTACTTGATGGC
TTTAAATCAATCAAGACCAACAAAACCTATTAATTTACACAGATTTACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGTTCCCTTGCTCTCTAGAACGCACCGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCGCCGCCGTCTCCACCGATATCGGTTTCTCGAGA
CGAGTGGTTGTTTGAGGAGAAAATTGTCGGAGGAAAATGCTCGAGAGAGATATCGGCGTTTTGTGATTATTCGAAGTCTTGCTCTTCGATACTCACTGGATCAGATGAAT
CGTCCGAGACTCGAGCGCAGGAGCCGTTGTCGTTCTCTACTTCGGCTTATCGCGACGAGCTAAAGCTTCATGTAATCGTTCTATCCGTCTCGTCTGTCGTGAGAAAGAGT
CGTTCGAAGCTAATACGGATATCCGAGAACCGGAATCTCACTTCTACAGACGACGTTACCTTGTCCTCAGGCTCCGCGTCCTCGGAGACGTCTTGTTTGTCAAGCACCTC
AAGCGTGGTCACAAGCGCGCCAATCCATCGTCTGGTTACCAGAGCAGAGAAGAAGTTGGAAATGATTCGTCACGTATGGAGGAAAAAGCACGCCGCAACGGCTCACATGC
GCCGGCGGGCCGAAGCTATTCTGAGCTACCTCTCCGGTGGTTGTTCCTCTGAGGTGAAGATACGCCAAGTGCTTGGCGACAGCCCTGACACAAGCAAGGCTCTCAGAATG
CTGTTGAAGCTGGAAGAGATTAAAAGATCCGGAACAGGTGGGCGCCAAGATCCCTATATTTACATGGTACAAGATACTCTCAACTCCCAACTTCCTCTCTACTTGATGGC
TTTAAATCAATCAAGACCAACAAAACCTATTAATTTACACAGATTTACTTGA
Protein sequenceShow/hide protein sequence
MAEFPCSLERTVASALLLLSTSPPPPPPSPPISVSRDEWLFEEKIVGGKCSREISAFCDYSKSCSSILTGSDESSETRAQEPLSFSTSAYRDELKLHVIVLSVSSVVRKS
RSKLIRISENRNLTSTDDVTLSSGSASSETSCLSSTSSVVTSAPIHRLVTRAEKKLEMIRHVWRKKHAATAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRM
LLKLEEIKRSGTGGRQDPYIYMVQDTLNSQLPLYLMALNQSRPTKPINLHRFT