; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003457 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003457
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionDIOX_N domain-containing protein
Genome locationscaffold4:48050819..48055200
RNA-Seq ExpressionSpg003457
SyntenySpg003457
Gene Ontology termsGO:0046872 - metal ion binding (molecular function)
GO:0051213 - dioxygenase activity (molecular function)
InterPro domainsIPR026992 - Non-haem dioxygenase N-terminal domain
IPR027443 - Isopenicillin N synthase-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6586418.1 hypothetical protein SDJN03_19151, partial [Cucurbita argyrosperma subsp. sororia]6.5e-12384.53Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSSS QSPFPA+RTVTISY ELQDR VDLS KIEEGFGPKGLGILSV+DVP FPSLRQDLLRLSSRLARLPED KKELEDP SRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILDSPTTD SLIQRYPSYCGSNIWP++ELPELEL                       AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PA QST  ED+DNLSSWCGWHTDHGSLTGLTCAMFTRDG EIPCPDSAAGLYVRTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

XP_022157594.1 uncharacterized protein LOC111024250 isoform X1 [Momordica charantia]1.4e-12585.28Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSS FQSPFPAIRTVTISYSELQDR VDLSTKIEEGFGPKGLGILSVTDVP FPSLR DLLRLSSRLARLPE+ KKELEDPHSRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANP LDSPTTD SLI+RYPSYCGSNIWPSRELPELEL                       A+KMMKLHEDE LEK LLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQQSTCAED +NLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLY+RTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

XP_022937450.1 uncharacterized protein LOC111443853 [Cucurbita moschata]6.5e-12384.53Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSSS QSPFPA+RTVTISY ELQDR VDLS KIEEGFGPKGLGILSV+DVP FPSLRQDLLRLSSRLARLPED KKELEDP SRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILDSPTTD SLIQRYPSYCGSNIWP++ELPELEL                       AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PA QST  ED+DNLSSWCGWHTDHGSLTGLTCAMFTRDG EIPCPDSAAGLYVRTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

XP_038889556.1 uncharacterized protein LOC120079448 isoform X1 [Benincasa hispida]3.7e-12684.13Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        M+SS QS  PAIRTVTISYSELQDR VDLS KIEEGFGPKGLGILSVTDVP FPSLRQDLLRLSSR  +LPED KKELEDPHSRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILD+PTTDTS IQRYPSYCGSNIWPSRELPELEL                       AAKMMK+HEDE LEK LLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEYYSLSLM
        PAQQSTC+EDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPD+AAGLY+RTRADEIVKVEY SLSLM
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEYYSLSLM

XP_038889557.1 uncharacterized protein LOC120079448 isoform X2 [Benincasa hispida]2.9e-12383.77Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        M+SS QS  PAIRTVTISYSELQDR VDLS KIEEGFGPKGLGILSVTDVP FPSLRQDLLRLSSR  +LPED KKELEDPHSRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILD+PTTDTS IQRYPSYCGSNIWPSRELPELEL                       AAKMMK+HEDE LEK LLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQQSTC+EDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPD+AAGLY+RTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

TrEMBL top hitse value%identityAlignment
A0A1S3C6L2 uncharacterized protein LOC103497625 isoform X23.8e-12182.26Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        M+SSF S  PAIRTVTIS+SELQDR VDLS KIEEGFGPKGLGILSVTDVP FPSLR+DLLRLSSRLA+LPEDVKKELEDPH+RYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILD+PTTD SLIQRYPSYCGSNIWPSR+LPELEL                       AAKMMKL+EDE LEKI+LNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQQSTC+EDSD LSSWCGWHTDHGSLTGLTCA FTRDG+EIPCPDSAAGLY+RTRA EIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

A0A5D3CFM7 Oxoglutarate/iron-dependent dioxygenase3.8e-12182.26Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        M+SSF S  PAIRTVTIS+SELQDR VDLS KIEEGFGPKGLGILSVTDVP FPSLR+DLLRLSSRLA+LPEDVKKELEDPH+RYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILD+PTTD SLIQRYPSYCGSNIWPSR+LPELEL                       AAKMMKL+EDE LEKI+LNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQQSTC+EDSD LSSWCGWHTDHGSLTGLTCA FTRDG+EIPCPDSAAGLY+RTRA EIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

A0A6J1DUW6 uncharacterized protein LOC111024250 isoform X16.8e-12685.28Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSS FQSPFPAIRTVTISYSELQDR VDLSTKIEEGFGPKGLGILSVTDVP FPSLR DLLRLSSRLARLPE+ KKELEDPHSRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANP LDSPTTD SLI+RYPSYCGSNIWPSRELPELEL                       A+KMMKLHEDE LEK LLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQQSTCAED +NLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLY+RTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

A0A6J1FGP0 uncharacterized protein LOC1114438533.1e-12384.53Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSSS QSPFPA+RTVTISY ELQDR VDLS KIEEGFGPKGLGILSV+DVP FPSLRQDLLRLSSRLARLPED KKELEDP SRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILDSPTTD SLIQRYPSYCGSNIWP++ELPELEL                       AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PA QST  ED+DNLSSWCGWHTDHGSLTGLTCAMFTRDG EIPCPDSAAGLYVRTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

A0A6J1HPT0 uncharacterized protein LOC1114655871.2e-12284.53Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MSSS QSPFPAIRTVTISY ELQDR VDLS KIEEGFGPKGLGILSV+DVP FPSLRQDLLRLSSRLARLPED KKELEDP SRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
        PDLLKGSFYANPILD PTTD SLIQRYPSYCGSNIWP++ELPELEL                       AAKMMKLHEDETLEKILLNSRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELEL-----------------------AAKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PA QST  ED+DNLSSWCGWHTDHGSLTGLTCAMFTRDG EIPCPDSAAGLYVRTRADEIVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G13400.1 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily protein4.2e-9664.15Show/hide
Query:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK
        MS S  S  P + TVTISYSEL++  +DLS +IEEGFGP GLGILSV DVP + +LRQ+LL+L+ RLA LPE+VK+ELEDPHSRYNFGWSHGKEKLESGK
Subjt:  MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGK

Query:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELELA-----------------------AKMMKLHEDETLEKILLNSRCHKGRLLYYF
         D+LKGS+YANP+ D PT+++  IQRYPSYCGSNIWP   LPELE A                       +K +K HE + LEKILL SRCHKGRLLYYF
Subjt:  PDLLKGSFYANPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELELA-----------------------AKMMKLHEDETLEKILLNSRCHKGRLLYYF

Query:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY
        PAQ+S+   D+D++SSWCGWHTDHGSLTGLT A+F+RD VE+PCPD A+GLY++TR+ +IVKV Y
Subjt:  PAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIPCPDSAAGLYVRTRADEIVKVEY


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTTCCTCCTTCCAGTCCCCTTTTCCGGCCATCAGAACCGTCACCATATCATACTCAGAGCTCCAAGATCGGAGAGTTGACTTGTCGACGAAGATTGAAGAAGGATT
TGGACCTAAGGGGCTGGGAATTCTGTCGGTCACTGATGTCCCTCGATTTCCTTCGTTGAGGCAGGATCTTCTACGTCTTTCATCGAGATTAGCTAGGCTACCAGAAGATG
TGAAGAAGGAGCTTGAAGACCCTCATAGTAGGTATAATTTTGGATGGAGTCATGGGAAAGAGAAACTTGAATCTGGGAAGCCTGATTTGTTGAAAGGCTCGTTCTATGCC
AATCCAATATTGGATAGCCCAACAACAGACACATCTTTAATTCAAAGGTATCCATCATATTGTGGTTCAAATATTTGGCCTTCTAGAGAACTGCCAGAACTTGAATTAGC
TGCTAAAATGATGAAACTGCATGAGGATGAAACCCTCGAAAAGATACTTCTCAACTCCCGGTGTCATAAAGGGCGCTTGCTTTATTATTTTCCAGCACAGCAGAGCACTT
GTGCTGAAGATAGTGATAATCTATCCTCTTGGTGTGGATGGCATACAGATCATGGTTCCTTAACAGGTCTGACCTGTGCAATGTTTACAAGAGATGGTGTGGAGATACCT
TGCCCTGATAGTGCTGCTGGCCTTTATGTTAGGACACGAGCTGATGAAATTGTTAAAGTAGAATATTATTCTCTATCATTGATGACATGCCTTTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTTCCTCCTTCCAGTCCCCTTTTCCGGCCATCAGAACCGTCACCATATCATACTCAGAGCTCCAAGATCGGAGAGTTGACTTGTCGACGAAGATTGAAGAAGGATT
TGGACCTAAGGGGCTGGGAATTCTGTCGGTCACTGATGTCCCTCGATTTCCTTCGTTGAGGCAGGATCTTCTACGTCTTTCATCGAGATTAGCTAGGCTACCAGAAGATG
TGAAGAAGGAGCTTGAAGACCCTCATAGTAGGTATAATTTTGGATGGAGTCATGGGAAAGAGAAACTTGAATCTGGGAAGCCTGATTTGTTGAAAGGCTCGTTCTATGCC
AATCCAATATTGGATAGCCCAACAACAGACACATCTTTAATTCAAAGGTATCCATCATATTGTGGTTCAAATATTTGGCCTTCTAGAGAACTGCCAGAACTTGAATTAGC
TGCTAAAATGATGAAACTGCATGAGGATGAAACCCTCGAAAAGATACTTCTCAACTCCCGGTGTCATAAAGGGCGCTTGCTTTATTATTTTCCAGCACAGCAGAGCACTT
GTGCTGAAGATAGTGATAATCTATCCTCTTGGTGTGGATGGCATACAGATCATGGTTCCTTAACAGGTCTGACCTGTGCAATGTTTACAAGAGATGGTGTGGAGATACCT
TGCCCTGATAGTGCTGCTGGCCTTTATGTTAGGACACGAGCTGATGAAATTGTTAAAGTAGAATATTATTCTCTATCATTGATGACATGCCTTTATTAA
Protein sequenceShow/hide protein sequence
MSSSFQSPFPAIRTVTISYSELQDRRVDLSTKIEEGFGPKGLGILSVTDVPRFPSLRQDLLRLSSRLARLPEDVKKELEDPHSRYNFGWSHGKEKLESGKPDLLKGSFYA
NPILDSPTTDTSLIQRYPSYCGSNIWPSRELPELELAAKMMKLHEDETLEKILLNSRCHKGRLLYYFPAQQSTCAEDSDNLSSWCGWHTDHGSLTGLTCAMFTRDGVEIP
CPDSAAGLYVRTRADEIVKVEYYSLSLMTCLY