; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007035 (gene) of Snake gourd v1 genome

Gene IDTan0007035
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein canopy-1-like isoform X1
Genome locationLG01:116622225..116625927
RNA-Seq ExpressionTan0007035
SyntenyTan0007035
Gene Ontology termsNA
InterPro domainsIPR021852 - Domain of unknown function DUF3456


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008452994.1 PREDICTED: protein canopy-1 isoform X1 [Cucumis melo]2.6e-8992.86Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MK NAWVLLLLVIYS  VDCIDDKCAACN VAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK GSTGQQW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHC+ AS VQLN+DDDE DGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

XP_022936575.1 protein canopy-1-like isoform X1 [Cucurbita moschata]2.6e-8991.76Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLL+ IY   V+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STG+QW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR G VSKVLCHDLSRHCSKAS VQLN+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

XP_022975788.1 protein canopy-1-like isoform X1 [Cucurbita maxima]1.4e-9092.86Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLL+VIY   V+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STG+QW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCSKAS VQLN+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

XP_038896985.1 protein canopy-1 isoform X1 [Benincasa hispida]3.6e-9193.41Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLLLVIYS  V CIDDKCAACN VAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STGQQW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCSKAS V LN++DDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

XP_038896987.1 protein canopy-1 isoform X2 [Benincasa hispida]6.8e-9093.41Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLLLVIYS  V CIDDKCAACN VAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STGQQW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCSKAS V LN++DDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

TrEMBL top hitse value%identityAlignment
A0A1S3BWB0 protein canopy-1 isoform X11.3e-8992.86Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MK NAWVLLLLVIYS  VDCIDDKCAACN VAEEIE GLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK GSTGQQW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSV  GDVSKVLCHDLSRHC+ AS VQLN+DDDE DGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

A0A6J1CJM8 protein seele isoform X11.3e-8991.76Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MK NAW LLLLVIYS AV+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  S+GQQW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTN+QEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCS AS VQ+N+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

A0A6J1FE32 protein canopy-1-like isoform X11.3e-8991.76Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLL+ IY   V+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STG+QW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR G VSKVLCHDLSRHCSKAS VQLN+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

A0A6J1IHQ2 protein canopy-1-like isoform X21.3e-8992.86Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLL+VIY   V+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STG+QW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCSKAS VQLN+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

A0A6J1IKA3 protein canopy-1-like isoform X16.7e-9192.86Show/hide
Query:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW
        MKSNAWVLLL+VIY   V+CIDDKCAACN VAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEK  STG+QW
Subjt:  MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQW

Query:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL
        IKVD+WDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVR GDVSKVLCHDLSRHCSKAS VQLN+DDDEADGEL
Subjt:  IKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL

SwissProt top hitse value%identityAlignment
Q5HZV5 Protein canopy homolog 31.2e-0424.85Show/hide
Query:  WVLLLLVIYSAAVDCID-------DKCAACNTVAEEIERGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTG
        W+LLLL ++    +  D        KC  C  VA E++       + R  +D R+    +  +K K I Y  S++R++E+ +GLC ++ +Y + K  S  
Subjt:  WVLLLLVIYSAAVDCID-------DKCAACNTVAEEIERGL-SNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTG

Query:  QQWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKA
         ++ K  +    T         K +      L  ET  ++A++ K+  V +     V+  D  RH  +A
Subjt:  QQWIKVDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKA

Q5M7D4 Protein canopy homolog 11.1e-0525Show/hide
Query:  CAACNTVAEEIERGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIK----VDNWD----NLTNKQEA
        C AC  + +E+   +    P+  +D+   R+   G+++   + +  SEL + ++L+ +CEKM DY +    +T ++  K     DN      +  N Q  
Subjt:  CAACNTVAEEIERGLSNEKPRNHLDM-RHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIK----VDNWD----NLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL
           S  +   C R++EE ED++  +I K +  + D    LC + +  C +  H +L
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL

Q7JXF7 Protein seele2.2e-0624.22Show/hide
Query:  KCAACNTVAEEIERGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWI-------KVDNWDNLTNKQEA
        KC  C  V  E+E  ++ E P    D+   RLD++G    K +    SE+ + EL++ +CEKM DY      S G+  +       +++   +L +  + 
Subjt:  KCAACNTVAEEIERGLSNEKPRNHLDMR-HRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWI-------KVDNWDNLTNKQEA

Query:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDD
           +K +  +C  +LE+ ++   +  +   +   D+   +C + + +C + S VQ   D D
Subjt:  RAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDD

Arabidopsis top hitse value%identityAlignment
AT1G42480.1 unknown protein2.1e-6062.15Show/hide
Query:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIKV
        LV+++A +         +DDKCAACN VAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K+    +QW+KV
Subjt:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIKV

Query:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL-NEDDDEAD
         N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS++ G+V KVLC  LS HC ++S     +E+DD+AD
Subjt:  DNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL-NEDDDEAD

AT1G42480.2 unknown protein2.5e-3462.04Show/hide
Query:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIKV
        LV+++A +         +DDKCAACN VAEE+E  L  EKPRNHLDMR+RL+SKGQR+GKVIDYR+S+LRVV+LLDGLC++MQDYT++K+    +QW+KV
Subjt:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIKV

Query:  DNWDNLTN
         N+DNLT+
Subjt:  DNWDNLTN

AT1G42480.3 unknown protein2.3e-4350.56Show/hide
Query:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKIGSTGQQWIK
        LV+++A +         +DDKCAACN VAEE+E  L     +    +   + +KG+   ++   RV S+LRVV+LLDGLC++MQDYT++K+    +QW+K
Subjt:  LVIYSAAV-------DCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRV-SELRVVELLDGLCEKMQDYTIEKIGSTGQQWIK

Query:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL-NEDDDEAD
        V N+DNLTNKQEA+A++ DISTYCGRLLEETED+L E+IK GS++ G+V KVLC  LS HC ++S     +E+DD+AD
Subjt:  VDNWDNLTNKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQL-NEDDDEAD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATCCAACGCGTGGGTTCTATTGTTATTAGTGATCTACTCCGCCGCTGTCGATTGCATTGATGACAAATGTGCCGCTTGCAATACTGTTGCCGAGGAGATAGAACG
TGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCAGCGTAAGGGGAAGGTGATTGATTACAGGGTCAGTGAGCTAAGAG
TTGTCGAACTCCTGGATGGGCTTTGTGAAAAGATGCAAGACTACACTATTGAGAAGATAGGTTCAACTGGACAACAGTGGATCAAGGTGGATAACTGGGACAACTTGACA
AATAAACAAGAAGCTCGGGCTTATTCAAAAGACATATCAACTTATTGTGGGAGATTATTAGAGGAAACAGAAGATGATTTGGCAGAGTTAATTAAGAAAGGATCTGTCAG
AGTAGGTGATGTTAGCAAAGTCCTATGCCATGATTTGAGCAGGCATTGCAGCAAGGCGAGTCATGTTCAGCTGAATGAAGATGATGACGAAGCAGATGGAGAACTATGA
mRNA sequenceShow/hide mRNA sequence
TCTTCAGACAGGCCCATACATTCGTCAAGGAAGAGATTGCCATTTTCTTGCTTCGTTTCGTTGACGCTTCAGATTCGCAAACGGAAGAGAAGCTCTGAAGGCGTTCGGAG
AGAGGTGAAAGTCAGGACTCGGATTGGACCAGACGCCAACGACGATGAAATCCAACGCGTGGGTTCTATTGTTATTAGTGATCTACTCCGCCGCTGTCGATTGCATTGAT
GACAAATGTGCCGCTTGCAATACTGTTGCCGAGGAGATAGAACGTGGACTTTCCAATGAAAAACCGAGGAATCATTTAGATATGAGACATCGGTTGGATTCTAAAGGTCA
GCGTAAGGGGAAGGTGATTGATTACAGGGTCAGTGAGCTAAGAGTTGTCGAACTCCTGGATGGGCTTTGTGAAAAGATGCAAGACTACACTATTGAGAAGATAGGTTCAA
CTGGACAACAGTGGATCAAGGTGGATAACTGGGACAACTTGACAAATAAACAAGAAGCTCGGGCTTATTCAAAAGACATATCAACTTATTGTGGGAGATTATTAGAGGAA
ACAGAAGATGATTTGGCAGAGTTAATTAAGAAAGGATCTGTCAGAGTAGGTGATGTTAGCAAAGTCCTATGCCATGATTTGAGCAGGCATTGCAGCAAGGCGAGTCATGT
TCAGCTGAATGAAGATGATGACGAAGCAGATGGAGAACTATGAAATATTATTAAGATAGACTGGAGGGAAAGCTATAAACCATGCAACAGTTAGCTTGTTTCAAATCAGG
AGATATGTGTTTACGTGCTTATTTGCTTGTATAACAGTAAAACTCTTGATTTCACCCTCAATTTATAATTTTGATAGAAGGCTCAGAGGATGGGGTTATTACCGACTTAA
AAGCCTCATATTGATGTTTTTGTAGTAGAGCCTTCATTATGAGACTTCTTTCTAAGGAGGCAGCCAGACCCCGAACAACATTAAGCTAAAATTCTAAATGATCCAAAACT
AACCATTGTTGATGGGATTTTCTGCATTAGGTGACAAAAGACGATCAGGCGAATTAGTTTTCCAAGTGCTTTTTGCTCATAAGCTAGATACTATGAAAGGTCATGTCACT
GTCAATTTTTAAGTTTTTACTTATGGTCAGTCACATCAAATTGGCATGCTTCATTGAAATTAGTGTGTGATTACTACGCAGCCCTTTAGCATATGTGAAAGTGTAATCTT
GTCTTCATTGTTCTCAGGG
Protein sequenceShow/hide protein sequence
MKSNAWVLLLLVIYSAAVDCIDDKCAACNTVAEEIERGLSNEKPRNHLDMRHRLDSKGQRKGKVIDYRVSELRVVELLDGLCEKMQDYTIEKIGSTGQQWIKVDNWDNLT
NKQEARAYSKDISTYCGRLLEETEDDLAELIKKGSVRVGDVSKVLCHDLSRHCSKASHVQLNEDDDEADGEL