; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0020792 (gene) of Snake gourd v1 genome

Gene IDTan0020792
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionBEST Arabidopsis thaliana protein match is: embryo defective 1303 .
Genome locationLG01:20315735..20330741
RNA-Seq ExpressionTan0020792
SyntenyTan0020792
Gene Ontology termsGO:0009658 - chloroplast organization (biological process)
GO:0010027 - thylakoid membrane organization (biological process)
GO:0009507 - chloroplast (cellular component)
InterPro domainsIPR040299 - Protein FERTILITY RESTORER RF2-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6596785.1 Protein YELLOW LEAF 1, choloroplastic, partial [Cucurbita argyrosperma subsp. sororia]8.2e-7786.63Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTT SGAML LLPPVGQHWGRMK ++FHL RDSF+V+  N RR+PV   AGP ++KPI +TG RGGVL SSRKNNAFICFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_022949842.1 uncharacterized protein LOC111453117 [Cucurbita moschata]5.3e-7686.05Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTT SGAML LLPPVGQHWGRMK ++FHL  DSF+V+  N RR+PV   AGP ++KPI +TG RGGVL SSRKNNAFICFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_022965198.1 uncharacterized protein LOC111465127 [Cucurbita maxima]1.5e-7586.13Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE
        MLTTNSGAML LLPPVGQHWG+MKV DF LRRDS ++Q PN RR+ V   AGP FLKPIP+TGT+GG L S+RK NNAFICFAALNARCAAEQTQTVTRE
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE

Query:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023005574.1 uncharacterized protein LOC111498519 isoform X1 [Cucurbita maxima]2.5e-7887.79Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTTNSGAMLPLLPPVGQHWGRMK ++FHL RDSF+V+  N RR+PV   AGP ++KPI +TGT GGVL SSRKNNAFICFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

XP_023540533.1 uncharacterized protein LOC111800870 [Cucurbita pepo subsp. pepo]5.1e-7987.79Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTTNSGAMLPLLPPVGQHWGRM+ ++FHL RDSF+VQ  N RR+PV   AGP ++KPI +TGTRGGVL SSRKNNAF+CFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

TrEMBL top hitse value%identityAlignment
A0A1S4DXD8 uncharacterized protein LOC1034909991.8e-7484.3Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTTNS  MLPLLPPVGQHWGRMK  D  LRR S + Q PN RR+ +   AGP FLKPIP+TGT+GG+L SSRKN+ FIC AALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQ+DRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1F258 uncharacterized protein LOC1114387814.8e-7585.55Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE
        MLTT+SGAML LLPPVGQHWG+MK  DF LRRDS ++Q PN RR+ V   AGP FLKPIP+TGT+GGVL S+RK NNAFICFAALNARCAAEQTQTVTRE
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE

Query:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1GD79 uncharacterized protein LOC1114531172.6e-7686.05Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTT SGAML LLPPVGQHWGRMK ++FHL  DSF+V+  N RR+PV   AGP ++KPI +TG RGGVL SSRKNNAFICFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1HQC3 uncharacterized protein LOC1114651277.5e-7686.13Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE
        MLTTNSGAML LLPPVGQHWG+MKV DF LRRDS ++Q PN RR+ V   AGP FLKPIP+TGT+GG L S+RK NNAFICFAALNARCAAEQTQTVTRE
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRK-NNAFICFAALNARCAAEQTQTVTRE

Query:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEG Y++DRRR
Subjt:  APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

A0A6J1KZM4 uncharacterized protein LOC111498519 isoform X11.2e-7887.79Show/hide
Query:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA
        MLTTNSGAMLPLLPPVGQHWGRMK ++FHL RDSF+V+  N RR+PV   AGP ++KPI +TGT GGVL SSRKNNAFICFAALNARCAAEQTQTVTREA
Subjt:  MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREA

Query:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE PYQNDRRR
Subjt:  PTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

SwissProt top hitse value%identityAlignment
F1SZ41 Protein FERTILITY RESTORER RF2, mitochondrial8.4e-0849.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ42 Protein FERTILITY RESTORER RF2, mitochondrial8.4e-0849.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

F1SZ44 Protein FERTILITY RESTORER RF2, mitochondrial4.9e-0849.4Show/hide
Query:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE
        + RC A QTQ+  R++ T TV      GK + P+LDDG  GFPP   G GGGGGGGGGG  N+ GGF  F  +  L +LK+ E
Subjt:  NARCAAEQTQTVTREAPTITVL----PGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGG--NWSGGFFFFGFLAFLGFLKDKE

Q0E3V2 Protein YELLOW LEAF 1, choloroplastic1.7e-0844.44Show/hide
Query:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR
        C A++   C A QTQT  R+      +P    +  K +SP+LDDG +GFPP   G GGGGGGGGG N +GGF  F  +  L +L  +E E   QN  RR
Subjt:  CFAALNARCAAEQTQTVTRE------APTITVLPGKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR

Arabidopsis top hitse value%identityAlignment
AT1G30475.1 BEST Arabidopsis thaliana protein match is: embryo defective 1303 (TAIR:AT1G56200.1)1.6e-2565.31Show/hide
Query:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.2 FUNCTIONS IN: molecular_function unknown1.6e-2565.31Show/hide
Query:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G30475.3 FUNCTIONS IN: molecular_function unknown1.6e-2565.31Show/hide
Query:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE
        V+ S  K   F C +ALN++C+  QTQTVTR++PTIT  P  GK KSP+LDDG +GFPPRDDG GGGGGGGGGG+ SGGFF FGFL F+G+LKD E E
Subjt:  VLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESE

AT1G56200.1 embryo defective 13037.0e-3465.55Show/hide
Query:  LKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ A G   G+   SR+ +  IC +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR

AT1G56200.2 embryo defective 13037.0e-3465.55Show/hide
Query:  LKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF
        + P+ A G   G+   SR+ +  IC +A+NA+C+  QTQTVTRE+PTIT  P   KEKSP LDDG  GFPPRDDGD GGGGGGGGGNWSGGFFFFGFLAF
Subjt:  LKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLP--GKEKSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAF

Query:  LGFLKDKESEGPYQNDRRR
        LG LKDKE E  Y+  RRR
Subjt:  LGFLKDKESEGPYQNDRRR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTACTACCAATTCTGGTGCCATGCTACCTCTACTACCCCCAGTGGGACAGCATTGGGGAAGAATGAAGGTCCTAGACTTCCATCTTAGAAGGGATAGTTTCCGAGT
TCAAATTCCTAATGAGAGGAGACGACCAGTGACTGGTAATGCAGGACCACCTTTTCTCAAACCAATTCCAGCTACTGGGACGAGGGGAGGTGTTCTACGTTCTAGTAGAA
AGAACAACGCTTTCATATGTTTTGCTGCTTTGAATGCTAGATGTGCCGCGGAGCAGACCCAGACCGTTACAAGAGAGGCTCCTACAATCACTGTTCTTCCTGGCAAGGAG
AAGTCACCGCAACTGGATGATGGTGATTCTGGATTCCCACCTCGTGATGATGGTGATGGCGGCGGTGGAGGTGGCGGCGGCGGAGGCAACTGGTCTGGTGGTTTCTTTTT
CTTTGGCTTCCTTGCTTTCCTAGGCTTCTTAAAAGATAAAGAAAGCGAAGGGCCTTATCAGAATGATCGGAGAAGATAA
mRNA sequenceShow/hide mRNA sequence
ATTAATAAAAGAGAGAGAGAAAGAGAGATGGATGAATATGAAGAAAACCCATCATTCTTGGAAGGTATAGGAAACCCATCCACCATGGGAAAAAGTGATGCATGGGAATA
AGTGATACCTATGCCCATGCTTACAAACCTTGTTCCAAACAAGGAGATTTAAAACCCTTACCCATTCCCATCCACAAAACCTTGCACCAAACGGCCCCATCTCTTCTCCA
TTCCAATGCTCCAAGTCTCCAACTCCATTCTCAATTCTGTCATTCACCCTTTCGATTCCCAGTCTCATCTACATTCATTCACCCGTAGCATCTTCTTGCGAGAAGGAGAG
GAGAGACCAGAAGTTCAGCGCCGACTCCGGCTCTGGCGTCGATCGTGCGAGGTGCGGCAACCGTGCTTTCATTCACCCTTTCAGTTTCTACAGCACTGCCTCCGGCTCCG
GCGAAACAATCGACGTCTTCTTCAGCAGTCAGCGACCTCCAGTCCCTCGACCTCCGACTCCGACTTCTCCAGCCTCAGATCTGACGTCTCCAGCCCCAGATCCGGATCCG
ACATCTCTTGATTTGTTCCTCTGCTTTGGATTCCTCTGCCTCCTTTAGTTTCTGCTTGTGTCAGGTTATCATCCACCCTCGTAGACGGAGATGCTTACTACCAATTCTGG
TGCCATGCTACCTCTACTACCCCCAGTGGGACAGCATTGGGGAAGAATGAAGGTCCTAGACTTCCATCTTAGAAGGGATAGTTTCCGAGTTCAAATTCCTAATGAGAGGA
GACGACCAGTGACTGGTAATGCAGGACCACCTTTTCTCAAACCAATTCCAGCTACTGGGACGAGGGGAGGTGTTCTACGTTCTAGTAGAAAGAACAACGCTTTCATATGT
TTTGCTGCTTTGAATGCTAGATGTGCCGCGGAGCAGACCCAGACCGTTACAAGAGAGGCTCCTACAATCACTGTTCTTCCTGGCAAGGAGAAGTCACCGCAACTGGATGA
TGGTGATTCTGGATTCCCACCTCGTGATGATGGTGATGGCGGCGGTGGAGGTGGCGGCGGCGGAGGCAACTGGTCTGGTGGTTTCTTTTTCTTTGGCTTCCTTGCTTTCC
TAGGCTTCTTAAAAGATAAAGAAAGCGAAGGGCCTTATCAGAATGATCGGAGAAGATAAACATACTGTGATCAAATATATAATGTAGATTCTATAGATACAATCTCAAAA
TTAACTTTTGTTTTTAGTCATGTTAATCTCCATTAAATTCATTTCACCCAGTAGTTTGCTTTTAATTCAGTAATCCCTATGGACTGGAGTAAGAGCATAAAGTTTTCTGA
GATCTTATTTGAAATTTCAGTAGTCAAAGATATAAATGGTGGAGATGAAAGCTAATGAGTTCCTTGTTGATAGAAACAATGGTTGCTGATTTTAGGGTATTTGTAGTCCT
TGACCTTTAAGCCAGAGGCTGAGGGGTTCTGTTCAAATGTTAGTTCTGAGGGACTCCTAGCTGTGAAGCCTATCCCATTTCCTTAGTTGTGCTAAATGGAAATCTGGTAT
AATTTAATAAAATATTTCAAGTTTATTAACTAATAAACTATTTTAATATAATATATTAGGAGTTGTTTGGGCTGAGTAGATTATAATAATATGAGTTCTAATAATTTGTG
GGTTATTATAATCTGTGGAATCAAGTAATATTATTTAAAATACATAGTAGTATAGTCTGAGATTATAATAGCCTGTGTGTGATGTAGAGTATTCCAAGTTTGTAATAGAT
CCTTATTGATTGTACTAAAATAATTTGTAAATTTTCTTTGATCATGCCAGAAATAGAAAAGGCCATCTTTTATTGTTTTTTTTTTTAGGTGTAACAACACTTATTCAATA
ATTTTGTATTAGTCAACCTTGTATTTTTGTTGCTAAATAATTGTTGTATAAAGTATAATGTATTAAGGCGTC
Protein sequenceShow/hide protein sequence
MLTTNSGAMLPLLPPVGQHWGRMKVLDFHLRRDSFRVQIPNERRRPVTGNAGPPFLKPIPATGTRGGVLRSSRKNNAFICFAALNARCAAEQTQTVTREAPTITVLPGKE
KSPQLDDGDSGFPPRDDGDGGGGGGGGGGNWSGGFFFFGFLAFLGFLKDKESEGPYQNDRRR