; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sed0003642 (gene) of Chayote v1 genome

Gene IDSed0003642
OrganismSechium edule (Chayote v1)
DescriptionBeta-1,3-N-Acetylglucosaminyltransferase family protein
Genome locationLG13:24950244..24954399
RNA-Seq ExpressionSed0003642
SyntenySed0003642
Gene Ontology termsGO:0001709 - cell fate determination (biological process)
InterPro domainsIPR040361 - Tapetum determinant 1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0063321.1 TPD1 protein-like protein 1A-like [Cucumis melo var. makuwa]1.2e-5383.46Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN STMIAAI+IL LS IN+G  A SCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQKQI+LSC GFQT EPVD SILS+Q D CLLINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSVSFSYAWDPP  M PRFSV+ CPT
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

XP_022141679.1 uncharacterized protein LOC111011980 [Momordica charantia]8.5e-5585.71Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN STMIAAILIL LS INKGS   +CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPV  SILS+QGDTCLLING  VQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCP
         +SVSFSYAWDPPF MFPRFSV QCP
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCP

XP_022937408.1 uncharacterized protein LOC111443708 [Cucurbita moschata]1.1e-5485.04Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN S MIAAILILFLS IN+G  A SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPVD SI+S+QGD C LINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSV FSYAWDPPF +FPRFSV+QC T
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

XP_022976215.1 uncharacterized protein LOC111476671 [Cucurbita maxima]2.9e-5585.83Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN S MIAAILILFLS IN+G  A SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPVD SI+S+QGD CLLINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSV FSYAWDPPF +FPRFSV+QC T
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

XP_038899927.1 uncharacterized protein LOC120087115 [Benincasa hispida]1.7e-5586.61Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN  TMIAAI ILFLS IN+GS A SCSLDTINIGTQRSGREIGGQPEWNVQVINNC+CPQKQIVLSCPGFQT EPVD SILS+Q DTCLLINGG VQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSVSFSYAWDPP  MFPR SV+QCPT
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

TrEMBL top hitse value%identityAlignment
A0A5A7VC88 TPD1 protein-like protein 1A-like5.9e-5483.46Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN STMIAAI+IL LS IN+G  A SCSLDTINIGTQRSGREIGGQPEWNVQ+INNCDCPQKQI+LSC GFQT EPVD SILS+Q D CLLINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSVSFSYAWDPP  M PRFSV+ CPT
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

A0A5D3E811 TPD1 protein-like protein 1A-like9.5e-5284.3Show/hide
Query:  MIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSF
        MIAAI+IL LS IN+G  A SCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQI+LSC GFQT EPVD SILS+Q D CLLINGGIVQPGSSVSF
Subjt:  MIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSF

Query:  SYAWDPPFTMFPRFSVTQCPT
        SYAWDPP  M PRFSV+ CPT
Subjt:  SYAWDPPFTMFPRFSVTQCPT

A0A6J1CJF7 uncharacterized protein LOC1110119804.1e-5585.71Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN STMIAAILIL LS INKGS   +CSLDTINIGT+RSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPV  SILS+QGDTCLLING  VQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCP
         +SVSFSYAWDPPF MFPRFSV QCP
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCP

A0A6J1FFZ8 uncharacterized protein LOC1114437085.4e-5585.04Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN S MIAAILILFLS IN+G  A SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPVD SI+S+QGD C LINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSV FSYAWDPPF +FPRFSV+QC T
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

A0A6J1IIV5 uncharacterized protein LOC1114766711.4e-5585.83Show/hide
Query:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP
        MIN S MIAAILILFLS IN+G  A SCSLDTINIGTQRSGR IGGQPEWNVQVINNCDCPQKQIVLSCPGFQT EPVD SI+S+QGD CLLINGGIVQP
Subjt:  MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQP

Query:  GSSVSFSYAWDPPFTMFPRFSVTQCPT
        GSSV FSYAWDPPF +FPRFSV+QC T
Subjt:  GSSVSFSYAWDPPFTMFPRFSVTQCPT

SwissProt top hitse value%identityAlignment
Q1G3T1 TPD1 protein homolog 16.4e-0530.34Show/hide
Query:  VAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKQIVLSCPGFQTAEPVDTSILSQ-QGDTCLLINGGIVQPGSSVSFSYA
        + + CS D I +    +     G P + V++ N+C  DC   +I +SC  F +   V+  +  +   D CL+ +G  + PG S+SF YA
Subjt:  VAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKQIVLSCPGFQTAEPVDTSILSQ-QGDTCLLINGGIVQPGSSVSFSYA

Q8S6P9 TPD1 protein homolog 1B2.9e-0532.56Show/hide
Query:  ESCSLDTINIGTQRSGREIGGQPEWNVQVINNCD-CPQKQIVLSCPGFQTAEPVDTSILSQQG-DTCLLINGGIVQPGSSVSFSYA
        +SCS   + +    +     G P ++V++IN C  C    + +SC  F +AE VD S   + G + CL+  GG + P  +VSF Y+
Subjt:  ESCSLDTINIGTQRSGREIGGQPEWNVQVINNCD-CPQKQIVLSCPGFQTAEPVDTSILSQQG-DTCLLINGGIVQPGSSVSFSYA

Arabidopsis top hitse value%identityAlignment
AT1G32583.1 FUNCTIONS IN: molecular_function unknown4.6e-0630.34Show/hide
Query:  VAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKQIVLSCPGFQTAEPVDTSILSQ-QGDTCLLINGGIVQPGSSVSFSYA
        + + CS D I +    +     G P + V++ N+C  DC   +I +SC  F +   V+  +  +   D CL+ +G  + PG S+SF YA
Subjt:  VAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC--DCPQKQIVLSCPGFQTAEPVDTSILSQ-QGDTCLLINGGIVQPGSSVSFSYA

AT4G32090.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.6e-2247.06Show/hide
Query:  ILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW
        +++L LS+++ G     C+   I IG  R+GREIGGQPEW V VIN C+C QK + LSC GF  A+PV   +L  QG+TCL+I G  +  G++  F+YA 
Subjt:  ILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW

Query:  DP
         P
Subjt:  DP

AT4G32100.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein4.9e-1634.29Show/hide
Query:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAWD
        L+L  + + +G   +S SL+++++   ++G  +  +PEW V+V+N+  C      LSC  F++  P+D+ +LS+ GDTCLL NG  +     +SF Y WD
Subjt:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAWD

Query:  PPFTM
          F +
Subjt:  PPFTM

AT4G32105.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein1.5e-1737.93Show/hide
Query:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+ +N+G     C L+ +++   ++G+ +  +PEW V+V N C +C  +   LSC GFQ+  PV TS+LS+ GD CLL  G  + P     F+Y W
Subjt:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW

Query:  DPPFTMFPRFSVTQCP
        D  F +     V  CP
Subjt:  DPPFTMFPRFSVTQCP

AT4G32110.1 Beta-1,3-N-Acetylglucosaminyltransferase family protein9.8e-1735.85Show/hide
Query:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW
        L+LFL+ +N+G     CSL+++++   ++G+ +  +PEW V+V N C +C  +   L C GF +  P+DTS+L + GD CL+  G  + P   + F Y W
Subjt:  LILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNC-DCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW

Query:  DPPFTM
        D  F +
Subjt:  DPPFTM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCAACTTCTCCACCATGATTGCAGCAATCCTCATCCTGTTCCTAAGTATCATCAACAAAGGGTCTGTTGCTGAAAGCTGCAGTTTGGACACCATTAATATAGGAAC
ACAGAGAAGCGGAAGGGAGATTGGGGGGCAGCCCGAATGGAACGTACAAGTGATCAACAACTGTGATTGTCCTCAGAAGCAGATTGTCCTGTCCTGTCCCGGGTTTCAAA
CGGCTGAGCCCGTTGATACATCGATTCTGTCGCAGCAAGGGGATACTTGCCTTCTCATAAACGGAGGAATTGTGCAGCCTGGTTCTTCAGTTTCGTTTTCCTATGCTTGG
GATCCCCCTTTCACCATGTTCCCTCGTTTCTCTGTTACTCAGTGTCCTACCTGA
mRNA sequenceShow/hide mRNA sequence
CCTTCAAACTTTCATGTATATTATATTCCAACGAGCAAAATTTAGTCGAGGTCCATCAATGTAAACGTTTGTCCAATTCTGACAGCTTTCAAATCGGTCCAAGACAACTA
GTGGGATGATGGGGTGGTTCATATTAGATAACATGTGATTCCATTTCAAGACCAATTGACAATGAGCAGGAACTTATGCATCTTATAAATGTTGGAAGGCCTCTCCTTCC
CAATATAGAATCATCGACGTGTCTCATCAAGATGGTGCATCTTTGGATTCACAAGAATCCCAATTTATTTTTTTGGAACGTATACGCATTTGGTATTTATGAGCTGGGAC
CATATTAAATAACAATAAGTTTCATCTCAAAATTAATTGGCACTGAGTTGAGTTACTCGTGCATCTTATAAATGTTGGGAGGTCTCTCATCTTTCCAACGCGGTATCATC
GACCGCCTCTTGGGTGTTAGAAAGAGATTGTGGTTTAGCTGTTGACAGGTTATAGGAGAGAAGGAGATATGCTTTGTGTGAAAGAGAGAAAGAGATGGAAAAATAGCAGA
GTCATGCATGAAAGAAAGAGAAAGGTGGGTCTTGCATGTTAAAAGAGAGGAGAAAGATAGAAAAATAAATGAGTGGAGGGCCATGGGAAAAAAAGAGGAATGGATTTGGT
TGGCAAGATTGCATGTGAGCAAGAGGAGAAGAGAAAGATGGATTAGATTGGTGGATGATAAAAAGGAATGAAGAGCACTGAAAACTAATAAATGGGAAGGAAAAAGAGAG
TGGGGGAGGAAAGAAGAATAACAAAAGGTTGACAGATAAACGCTATAATGAAAGATAGTACGAAGAGAAATAGAGTTTTAAATATAGGGTCGGTAACAGGGAGTGTGCAT
TCATTCTAACCATAGGACAGGGTCGTTTTGAATCTTCAACCATCGGTTCGACAGCGAACTAACCGAGGTCAATTTGGTGGCAGACACCTACACGAAGTGTTTGAGATAAT
GAGAATTAGAATGATTATAGGGAAGGAAAATACGGAAAAGAATGATTAACGAGAAGAATTATTATCTTTATTTGGGATGAAGAAAATTATTATAAAAAATAAAATAAAAT
AAAAGTTATTCCTGTCTTTGGAGTACATAGAATGATCATGAGGAACTGTTACCAGTATTTGGAGGGTATAGAATGATTAAGAGGAACAAAATATTAATTATGGAGAGAAC
GAGATTACAGAAGAAAAGGAAATTATTCTACATCTCCCCAGTTTTGGTCCCCTTCCACATACACCCCTAGGTGGGGAAAAGTTTACATTGAAATAAAATAAAATTAATTA
TAAGATAAGATAGGATGCAACAAAAAAAAATGGCTATCAATTAGGTCATGGAATCGGATTTAAACTAAAGTATTAATTTGAAGTAAAAGAGAGGGAGAAAAATAGGAAAC
ATCGGGAGAAATATTGGTAAATCAGGTCACGATTAGCATTTTATTATTTTTGTTATCTTACATGGGATGTTCTGTGTATAAAAACAGTCAACGCAAATGAAATATTATGT
TGACAGCACATCACACTAAAGACTTGCGTGAGAGTTTCAAATTATTTCCTTTTCTTCTAGACTATTAATAATGTTAATTATTAACATAACCAATGAGAAACCCCTTTCAA
AATCCAGATATGAAAACAAAAACATACCATAATTTCCAAGGAAAAAGATAATCATAAAAACAGAGAGAGTTTGGTTTTTCTAAACTCACTTATCCCAAAAGCACTGCCAG
CCAGCAAAAGAAAGCTTCCCATCAGCTATGATCAACTTCTCCACCATGATTGCAGCAATCCTCATCCTGTTCCTAAGTATCATCAACAAAGGGTCTGTTGCTGAAAGCTG
CAGTTTGGACACCATTAATATAGGAACACAGAGAAGCGGAAGGGAGATTGGGGGGCAGCCCGAATGGAACGTACAAGTGATCAACAACTGTGATTGTCCTCAGAAGCAGA
TTGTCCTGTCCTGTCCCGGGTTTCAAACGGCTGAGCCCGTTGATACATCGATTCTGTCGCAGCAAGGGGATACTTGCCTTCTCATAAACGGAGGAATTGTGCAGCCTGGT
TCTTCAGTTTCGTTTTCCTATGCTTGGGATCCCCCTTTCACCATGTTCCCTCGTTTCTCTGTTACTCAGTGTCCTACCTGAACAACCAAAAAGAAATTGGCTATAAATAA
GTCTTGAGTTCTTTTTTATATTGAGCGAAAATTTGGAAAATGATTGTTCTTTTAGTTTGAAAAGAGTTATAACTGATTGTTGATGATGTCTTCAGCTCATATATCTAGTG
GTCTGTGGTTGTTTTGTTTTCTTCACCTGATTGATCTACTAAAGATTGGAATATACAGCTTATAGTTTTATATCAAAACATGGTTTGGAACTTTGGTATAAGTCTGCACA
AATGTTGCCATCTTCACTTTAGTTTGACTATAGAACCTGCCTTTTGAAAGATGAATATGATATAATAAGCAACTTGTGTAGCTTGTACAGCATGATTTCTCATTCAGGGT
CTGGTGTTGTTGTGTGTAAAAGAAGCTTTCATCAGACTATATCACGTTATCCGAGTGAACTCGTTTGAACCTGACTGCAATCAGCCTCATGAGAAGCTCCAAAGTTTGCT
AATACTATTCAGATATCCTAAAACGTGGGAGAAAAGCTCCTCCCACTCTACCCTATATGATTATCATAAAGATCATCATCTTCATTACATAACTTTTACAGGCTTCAAAA
TCTTCTTCAGCATATCCATCGTTGAAATATGGTAGCTGCTTCTAGCTCATGCGTCAATCCCTTTATTCTCATCTTGATTGTGATCGTTTAGTTCCTACAATCACTTCGAC
AGATGGACATTTATTCATACCAATCCTTTCCTTCATTCTTTCCTCATTGAGAATCGGAGTTTCATGCCACTACATACCTGGAACATGAACACATGTGCTCGTTCCAAGAC
TCCGGACTCGAGAGACAATACGGGATTTTACAAGCGAGCTCTTGGGAAACTTGAGAACTCCAAAGAAGCATATCTTGGAGACCCCAAGCTTCTACCAACGAGGTGATCTC
TTGGGGACATGTGACATGTCCTAAGAAATTGCATCATCCTAAGCTTCGTACAGTCATTTTGACTCGCATGAACATTTAGTTTTTCATTCAGCTTAGTGAATAGAGAATTA
TAAAACAAAACTGGGTTGTTATCATGCTCCCAAAGAAAGCAAAACACACGCACAAACATAAATATAAATTTTACAGCACTGCAAGTTGCAACTAACCCTTAACCTCGTCA
ATTGTCGTGTAAAATAGGCAGTGCACCAATCTATAAAATCCAAACCATAATTTAGACGTAAAAATAAGTAAAAAGAGAACAACTTTATAAATAGACGCTAATCTGAATAA
TAATATCGAACTACAAAAGTAACAGAAAAAGCTACATAAGAACTTTTAACCTCTATTCATAGCATACGGCTGTGTGCATATAAACAGGAAGATAACCGAAGAATGGATAA
ACAGTCTCCAAGGGGTAGCACAGTGGTTGAAGACTTGGGCTTTGAGGGTATGCTCCCCTCAAAGTCCTAGGTTCGAGACTCAGCTGTGACATTATGATAGAATCTACCAC
ACTATATTATTAAACTACGATAATCTCTCGAGAAAATCTCTCAAAGAGCCAAGAACAAAGACGACAATTCAATTCGATGATTGACAATGACCTAACAACTCCTATTTATA
GCCCTAAACCCTAATTTGACCTAATAAATAAAAGACTAATAATAGGCCCAAGCCCAATACATTTAAAATAGCAATTAACTAATAATTAATAAATACTAATATTCCTAAAA
TGCCAAGCGCGGATCGTATCAATACTTTCCCCCCAAGAGCCACCTTGTCCTCAAGGTGTTCCATCTCCGATGCCATCTTTTACAGCCCCGTCTTCTGGCAAAAATTCTGT
CACATCAGCATCCTGGCACTCGTGCGCATCCCGAACGACCCACACATCCAAGTTCTCGGAGCTCCTTCTTCCGACAAC
Protein sequenceShow/hide protein sequence
MINFSTMIAAILILFLSIINKGSVAESCSLDTINIGTQRSGREIGGQPEWNVQVINNCDCPQKQIVLSCPGFQTAEPVDTSILSQQGDTCLLINGGIVQPGSSVSFSYAW
DPPFTMFPRFSVTQCPT