; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0027781 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0027781
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionEndoglucanase
Genome locationchr01:5749707..5753070
RNA-Seq ExpressionPI0027781
SyntenyPI0027781
Gene Ontology termsGO:0008152 - metabolic process (biological process)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004148538.1 uncharacterized protein LOC101213547 [Cucumis sativus]1.0e-12390.8Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVEGSISS LDS PESKEQVDGLTPED+AWVDSCLIKE+PDISDGNWN IKDALLEI+DLYPQGFESSLALSDNVPGA NGDIDVDMLPSNNVKE TFSS
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQ-KHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELV
        R+S DLMNETRMV EDHPM DTGIASEDPQ  HD+IDTSLP TL+KNPFLPTYKEEVEGNDENNQAG+GHEL EIGS+SPIN+IFHVWDLNFPPVEDELV
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQ-KHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELV

Query:  EQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        EQLNKALTENSVE VPSMDSNLGV KDLKEDLLDDLINSISDLSLEQTKY
Subjt:  EQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

XP_008444927.1 PREDICTED: uncharacterized protein LOC103488123 [Cucumis melo]4.0e-12893.17Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVEGSISS LD  PESKEQVDGLT ED+AWVDSCLIKEIPDISDGNWNH+KDALLEILDLYPQGFESSLALSDNVPGA NGDIDVDMLPSNNVKEPTFSS
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE
        R+S DLMNETR  LEDHPM DTGIASEDPQKHD+IDTSLPLTLIKNPFLPTYKEEVEGNDEN+QAG+GHEL EIGSESPINDIFHVWDLNFPPVEDEL+E
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE

Query:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
Subjt:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

XP_022135637.1 uncharacterized protein LOC111007546 [Momordica charantia]1.1e-8067.89Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVE SISS LDSRPESKE VD LTPED+AWVDSCLI E PDISDGNWN++K ALLEILDLYP+ F SS  LSDN PG    DI  DMLPSNN KE  F  
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE
           GD         +D PM + GIASEDP+ HD+IDTSL L+  KNPFLPTYKE+V G  EN Q G   +L EIG E P+NDIF VWDLN PPVEDEL+E
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE

Query:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQ
        QLNKAL+ENS +S+PSM  N  VLKD KE+ LDDLINSISDLSLEQ
Subjt:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQ

XP_022925089.1 uncharacterized protein LOC111432435 isoform X1 [Cucurbita moschata]2.0e-7956.86Show/hide
Query:  LSMVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTF
        +SM+EGS SS L+S  ESKEQ+D LTPED+AW DSCLIK+IPDI DGNWNHIKDALLE+LDLYPQGFES LA+SD VPG IN DIDVD+L   NVK+PT 
Subjt:  LSMVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTF

Query:  SSRNSGDLMNETRMVL------------------------------------------------EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLP
          R+S D MN+    L                                                 D  M +TG+ASED Q HD+IDTSL L+  KNPFLP
Subjt:  SSRNSGDLMNETRMVL------------------------------------------------EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLP

Query:  TYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        TYKEEV+G  E+ Q    H+L EIG E PINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  KY
Subjt:  TYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

XP_038887167.1 uncharacterized protein LOC120077354 [Benincasa hispida]3.7e-10284.19Show/hide
Query:  ESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSSRNSGDLMNETRMVL
        ESKEQVD LTPED+AWVDSCLIKE PDISDGNWNHIKDALLEILDLYPQ FESS A+S N PG  NGDIDVDML  NNVKEPTFSSR+S D MNE     
Subjt:  ESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSSRNSGDLMNETRMVL

Query:  EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESV
               TGIASEDPQKHD+IDTSL  TLIKNPFLPTYKEEVEGNDENNQAG  HEL EIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTEN VES 
Subjt:  EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESV

Query:  PSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTK
        PSMD NL VLKDLKEDLLDDLINSISDLSLEQTK
Subjt:  PSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTK

TrEMBL top hitse value%identityAlignment
A0A0A0K2C8 Uncharacterized protein4.9e-12490.8Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVEGSISS LDS PESKEQVDGLTPED+AWVDSCLIKE+PDISDGNWN IKDALLEI+DLYPQGFESSLALSDNVPGA NGDIDVDMLPSNNVKE TFSS
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQ-KHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELV
        R+S DLMNETRMV EDHPM DTGIASEDPQ  HD+IDTSLP TL+KNPFLPTYKEEVEGNDENNQAG+GHEL EIGS+SPIN+IFHVWDLNFPPVEDELV
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQ-KHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELV

Query:  EQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        EQLNKALTENSVE VPSMDSNLGV KDLKEDLLDDLINSISDLSLEQTKY
Subjt:  EQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

A0A1S3BBI4 uncharacterized protein LOC1034881231.9e-12893.17Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVEGSISS LD  PESKEQVDGLT ED+AWVDSCLIKEIPDISDGNWNH+KDALLEILDLYPQGFESSLALSDNVPGA NGDIDVDMLPSNNVKEPTFSS
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE
        R+S DLMNETR  LEDHPM DTGIASEDPQKHD+IDTSLPLTLIKNPFLPTYKEEVEGNDEN+QAG+GHEL EIGSESPINDIFHVWDLNFPPVEDEL+E
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE

Query:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
Subjt:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

A0A5D3BCZ2 Uncharacterized protein1.9e-12893.17Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVEGSISS LD  PESKEQVDGLT ED+AWVDSCLIKEIPDISDGNWNH+KDALLEILDLYPQGFESSLALSDNVPGA NGDIDVDMLPSNNVKEPTFSS
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE
        R+S DLMNETR  LEDHPM DTGIASEDPQKHD+IDTSLPLTLIKNPFLPTYKEEVEGNDEN+QAG+GHEL EIGSESPINDIFHVWDLNFPPVEDEL+E
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE

Query:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
Subjt:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

A0A6J1C3A0 uncharacterized protein LOC1110075465.1e-8167.89Show/hide
Query:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS
        MVE SISS LDSRPESKE VD LTPED+AWVDSCLI E PDISDGNWN++K ALLEILDLYP+ F SS  LSDN PG    DI  DMLPSNN KE  F  
Subjt:  MVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSS

Query:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE
           GD         +D PM + GIASEDP+ HD+IDTSL L+  KNPFLPTYKE+V G  EN Q G   +L EIG E P+NDIF VWDLN PPVEDEL+E
Subjt:  RNSGDLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVE

Query:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQ
        QLNKAL+ENS +S+PSM  N  VLKD KE+ LDDLINSISDLSLEQ
Subjt:  QLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQ

A0A6J1EB39 uncharacterized protein LOC111432435 isoform X19.7e-8056.86Show/hide
Query:  LSMVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTF
        +SM+EGS SS L+S  ESKEQ+D LTPED+AW DSCLIK+IPDI DGNWNHIKDALLE+LDLYPQGFES LA+SD VPG IN DIDVD+L   NVK+PT 
Subjt:  LSMVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTF

Query:  SSRNSGDLMNETRMVL------------------------------------------------EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLP
          R+S D MN+    L                                                 D  M +TG+ASED Q HD+IDTSL L+  KNPFLP
Subjt:  SSRNSGDLMNETRMVL------------------------------------------------EDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLP

Query:  TYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY
        TYKEEV+G  E+ Q    H+L EIG E PINDIF VWDLN PP+E++LVEQLNKAL+ENS ESV   DSN  V+KD  +DLLD LI+SISDLSLE  KY
Subjt:  TYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G38980.1 unknown protein3.5e-1327.74Show/hide
Query:  DGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESS----------------------LALSDNVPGAINGDID---------VDML-
        + L+PE VAW DSC+I  + D  + NW   +DAL EI+D++P+ F  S                      L  S+   G+ N   +         + ML 
Subjt:  DGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESS----------------------LALSDNVPGAINGDID---------VDML-

Query:  ----PSNNVKEPTFSSRNSGDLMNETRMVLEDHPMYDTGIAS-------EDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSE
            PS N  E  +   +  +  N  R   +       G+ S        + +  +    S+   + K+ F+ TY E     D      V  + +++ S+
Subjt:  ----PSNNVKEPTFSSRNSGDLMNETRMVLEDHPMYDTGIAS-------EDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSE

Query:  SPINDIFHVWDLNF---PPVEDELVEQLNKALTENS-VESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQT
            +IF VWDL        ED LV QL KAL E+S V+ +P   ++  V+ +  +  +DDLI+ ISDLSL +T
Subjt:  SPINDIFHVWDLNF---PPVEDELVEQLNKALTENS-VESVPSMDSNLGVLKDLKEDLLDDLINSISDLSLEQT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAACTTTTTTGAGCATGGTCGAAGGGTCTATTTCTTCTCGTCTTGACAGTCGACCAGAATCTAAAGAGCAAGTCGATGGTCTTACTCCTGAAGACGTTGCTTGGGT
TGATTCTTGTCTGATTAAAGAGATACCAGATATTTCAGATGGCAATTGGAACCACATAAAGGATGCCTTACTAGAAATCCTTGACCTTTATCCTCAAGGCTTTGAATCTT
CTCTTGCTCTGAGCGATAATGTTCCAGGAGCTATTAATGGCGACATCGACGTAGACATGCTTCCCTCTAACAATGTGAAGGAGCCTACATTTTCCTCAAGAAATAGTGGT
GATCTTATGAACGAAACAAGAATGGTGTTGGAAGATCATCCTATGTATGATACAGGAATAGCTTCGGAAGATCCCCAAAAGCACGACAACATCGATACTTCTCTGCCTTT
AACTCTTATCAAGAATCCATTTTTACCCACTTACAAAGAGGAAGTAGAAGGGAATGATGAGAATAATCAAGCTGGAGTTGGGCATGAATTATTAGAAATTGGATCTGAGT
CCCCAATCAATGATATTTTCCATGTCTGGGATTTGAACTTTCCTCCAGTCGAGGACGAGCTCGTGGAGCAGCTGAACAAAGCCCTTACCGAAAATTCTGTTGAATCAGTC
CCTTCAATGGACAGTAATCTTGGTGTGTTGAAAGACTTGAAGGAAGATTTACTTGATGACTTGATCAATAGCATTTCTGATCTATCTTTGGAACAGACTAAATATTAG
mRNA sequenceShow/hide mRNA sequence
CCCGGCCTCTCTATTTGAACAAGAGTCCTTCTGGCTTCTACCGTTACCTCATTTTCTACTAGCAACATCAAGCACTTCATCTGGATCGTCACTCATCTTCAACCAAACTA
GGGTTTCAATTATATACGAAGTGTATGGAACAGCATGTTGAAGGATCATGGGAACTTTTTTGAGCATGGTCGAAGGGTCTATTTCTTCTCGTCTTGACAGTCGACCAGAA
TCTAAAGAGCAAGTCGATGGTCTTACTCCTGAAGACGTTGCTTGGGTTGATTCTTGTCTGATTAAAGAGATACCAGATATTTCAGATGGCAATTGGAACCACATAAAGGA
TGCCTTACTAGAAATCCTTGACCTTTATCCTCAAGGCTTTGAATCTTCTCTTGCTCTGAGCGATAATGTTCCAGGAGCTATTAATGGCGACATCGACGTAGACATGCTTC
CCTCTAACAATGTGAAGGAGCCTACATTTTCCTCAAGAAATAGTGGTGATCTTATGAACGAAACAAGAATGGTGTTGGAAGATCATCCTATGTATGATACAGGAATAGCT
TCGGAAGATCCCCAAAAGCACGACAACATCGATACTTCTCTGCCTTTAACTCTTATCAAGAATCCATTTTTACCCACTTACAAAGAGGAAGTAGAAGGGAATGATGAGAA
TAATCAAGCTGGAGTTGGGCATGAATTATTAGAAATTGGATCTGAGTCCCCAATCAATGATATTTTCCATGTCTGGGATTTGAACTTTCCTCCAGTCGAGGACGAGCTCG
TGGAGCAGCTGAACAAAGCCCTTACCGAAAATTCTGTTGAATCAGTCCCTTCAATGGACAGTAATCTTGGTGTGTTGAAAGACTTGAAGGAAGATTTACTTGATGACTTG
ATCAATAGCATTTCTGATCTATCTTTGGAACAGACTAAATATTAGGGAAATAGTTGTTTTGGAGCCACTGATTTCTGAAAGTTCATGTTTTCGAATCTCAACACGTGGCA
CTCTGTTACTCCATAATTTTAAATGTCTCAAATTGGAATAGGGGTGATTCTGGAGATAGTAGGCTCAGACAATATTGTTCTACAATTTTTATAGGCAAACTAGTGGGATA
TGTGAATACCTTCCCCCCCCCCAAAAAAAAAATCTGATTGACAGGCTTTGTTAACAATGATGTATTAGTAATTAGTACATTTAAGAGGTAATTTCATGGACAGTTGAATG
CTTGCTTAATGACTGATTATGTTTTTACAGAAGTAATTAATAAATCTAGTAGATTGTTGAAA
Protein sequenceShow/hide protein sequence
MGTFLSMVEGSISSRLDSRPESKEQVDGLTPEDVAWVDSCLIKEIPDISDGNWNHIKDALLEILDLYPQGFESSLALSDNVPGAINGDIDVDMLPSNNVKEPTFSSRNSG
DLMNETRMVLEDHPMYDTGIASEDPQKHDNIDTSLPLTLIKNPFLPTYKEEVEGNDENNQAGVGHELLEIGSESPINDIFHVWDLNFPPVEDELVEQLNKALTENSVESV
PSMDSNLGVLKDLKEDLLDDLINSISDLSLEQTKY