; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc07G17110 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc07G17110
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein of unknown function (DUF1218)
Genome locationClcChr07:31583921..31586782
RNA-Seq ExpressionClc07G17110
SyntenyClc07G17110
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576843.1 Protein MODIFYING WALL LIGNIN-2, partial [Cucurbita argyrosperma subsp. sororia]1.9e-9069.71Show/hide
Query:  SSYATSNFSACH---------SPLNNI-FKNSSLIRSSTQNSQGF----SYSSGDSPDMERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPT
        SS ATS+FSAC          SP   +  + S+LIRSST+  + F    S     + +MERK LAVCS+V  LG+L+VATGFAAEGTR K NQV+QVTPT
Subjt:  SSYATSNFSACH---------SPLNNI-FKNSSLIRSSTQNSQGF----SYSSGDSPDMERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPT

Query:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAF
         CKYPQSPA  LGLTAALSLLLAQI INVSTGCICC RGPRPPAS              T+V AFLLLL GAALNDGR EQS+YF+YY CYVLKPGVFA 
Subjt:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAF

Query:  ATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTADPVFVHEDTYMRRQFT
        AT+VGAASLALGLFY+LILNSAKNDP VWGNPS+PP ANIAMAQPQFPPPPP  P TADPVFVHEDTY RRQFT
Subjt:  ATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTADPVFVHEDTYMRRQFT

KAG7014863.1 hypothetical protein SDJN02_22493 [Cucurbita argyrosperma subsp. argyrosperma]6.7e-8879.17Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK LAVCS+V  LG+L+VATGFAAEGTR K NQV+QVTPT CKYPQSPA  LGLTAALSLLLAQI INVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA
         T+V AFLLLL GAALNDGR EQS+YF+YY CYVLKPGVFA AT+VGAASLALGLFY+LILNSAKNDP VWGNPS+PP ANIAMAQPQFPPPPP  P TA
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_022140830.1 uncharacterized protein LOC111011403 [Momordica charantia]9.1e-8575.46Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK +AVCS+V  LG+L+VATGFAAEGTR K +QVIQVTP TC YP+SPALGLGL AALSLL+AQ+TINVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPP--RTA
         T++ AFLLLL GAALND RGE+S YF YYECYVLKPGVFA ATI+  AS+ LGLFY+LILNSAKN+PTVWGNPSVPPQANIAM QPQFPPPPPP  R+ 
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPP--RTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

XP_022922463.1 uncharacterized protein LOC111430466 [Cucurbita moschata]8.8e-8879.17Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK LAVCS+V  LG+L+VATGFAAEGTR K NQV+QVTPT CKYPQSPA  LGLTAALSLLLAQI INVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA
         T+V AFLLLL GAALNDGR EQS+YF YY CYVLKPGVFA AT+VGAASLALGLFY+LILNSAKNDP VWGNPS+PP ANIAMAQPQFPPPPP  P TA
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_023551924.1 uncharacterized protein LOC111809750 [Cucurbita pepo subsp. pepo]2.0e-8779.17Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK LAVCS+V ILG+L+VATGFAAEGTR K NQV+QVTPT CKYPQSPA  LGLTAALSLLLAQI INVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA
         T+V AFLLLL GAALN GR EQS+YF YY CYVLKPGVFA AT+VGAASLALGLFY+LILNSAKNDP VWGNPS+PP ANIAMAQPQFPPPPP  P TA
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein9.5e-8075.23Show/hide
Query:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS------------
        MERK  L V  +V ILGI+M+ATGFAAE TRTK NQV +V P  CKYP+SPALGLGLTAALSLL AQITI  STGC+CC RGPRPPAS            
Subjt:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS------------

Query:  -VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF--PPPPPPR
         VTYV AFLL L GAALN+GRGEQ NYF  Y+CYVLKPGVF+FATIVG ASL LG+ YFLILNSAKNDP TVWG+PSVPPQ NIAMAQPQF  PPPPP R
Subjt:  -VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF--PPPPPPR

Query:  TADPVFVHEDTYMRRQFT
        TADPVFVHEDTYMRRQFT
Subjt:  TADPVFVHEDTYMRRQFT

A0A1S3BX72 uncharacterized protein LOC1034944353.2e-7570.64Show/hide
Query:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPP-----------ASV
        MERK  L V  +V  LGI+M+ATGFAAE TRTK  QV +V P  CKYP+SPA+GLG TAALSLL AQITI  STGC+CC RGPRPP           + +
Subjt:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPP-----------ASV

Query:  TYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF----PPPPPPR
        TYV AFLL L GAALN+GR +Q NY   YECYVLKPGVF+FATIVG ASL LG+ YFLILNSAKNDP TVWG+PSVPPQ NIAMAQPQF    PPPPP R
Subjt:  TYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF----PPPPPPR

Query:  TADPVFVHEDTYMRRQFT
        T DPVFVHEDTYMRRQFT
Subjt:  TADPVFVHEDTYMRRQFT

A0A5D3D069 DUF1218 domain-containing protein3.2e-7570.64Show/hide
Query:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPP-----------ASV
        MERK  L V  +V  LGI+M+ATGFAAE TRTK  QV +V P  CKYP+SPA+GLG TAALSLL AQITI  STGC+CC RGPRPP           + +
Subjt:  MERK-VLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPP-----------ASV

Query:  TYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF----PPPPPPR
        TYV AFLL L GAALN+GR +Q NY   YECYVLKPGVF+FATIVG ASL LG+ YFLILNSAKNDP TVWG+PSVPPQ NIAMAQPQF    PPPPP R
Subjt:  TYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDP-TVWGNPSVPPQANIAMAQPQF----PPPPPPR

Query:  TADPVFVHEDTYMRRQFT
        T DPVFVHEDTYMRRQFT
Subjt:  TADPVFVHEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114034.4e-8575.46Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK +AVCS+V  LG+L+VATGFAAEGTR K +QVIQVTP TC YP+SPALGLGL AALSLL+AQ+TINVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPP--RTA
         T++ AFLLLL GAALND RGE+S YF YYECYVLKPGVFA ATI+  AS+ LGLFY+LILNSAKN+PTVWGNPSVPPQANIAM QPQFPPPPPP  R+ 
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPP--RTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304664.3e-8879.17Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------
        MERK LAVCS+V  LG+L+VATGFAAEGTR K NQV+QVTPT CKYPQSPA  LGLTAALSLLLAQI INVSTGCICC RGPRPPAS             
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------

Query:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA
         T+V AFLLLL GAALNDGR EQS+YF YY CYVLKPGVFA AT+VGAASLALGLFY+LILNSAKNDP VWGNPS+PP ANIAMAQPQFPPPPP  P TA
Subjt:  VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPP--PRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-18.2e-0428.93Show/hide
Query:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPASVTYVFAFLLLL--------------AGAALNDGRGEQSNYFSYYECYVLKPGVFA
        +C  P++ A GLG+ A + + +AQI  NV    + CR   +   + T +F  +LLL               GA++N  +     + +  ECY++K GVFA
Subjt:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPASVTYVFAFLLLL--------------AGAALNDGRGEQSNYFSYYECYVLKPGVFA

Query:  FATIVGAASLA--LGLFYFLI
         +  +   ++A  LG F F +
Subjt:  FATIVGAASLA--LGLFYFLI

Arabidopsis top hitse value%identityAlignment
AT1G31720.1 Protein of unknown function (DUF1218)5.8e-0528.93Show/hide
Query:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPASVTYVFAFLLLL--------------AGAALNDGRGEQSNYFSYYECYVLKPGVFA
        +C  P++ A GLG+ A + + +AQI  NV    + CR   +   + T +F  +LLL               GA++N  +     + +  ECY++K GVFA
Subjt:  TCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPASVTYVFAFLLLL--------------AGAALNDGRGEQSNYFSYYECYVLKPGVFA

Query:  FATIVGAASLA--LGLFYFLI
         +  +   ++A  LG F F +
Subjt:  FATIVGAASLA--LGLFYFLI

AT5G17210.1 Protein of unknown function (DUF1218)2.9e-4445.16Show/hide
Query:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQV---IQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS----------
        MER+ + +C ++ +LG+L   T F AE TR K +QV   +  + T C YP+SPA  LG T+AL L++AQI ++VS+GC CCR+GP P  S          
Subjt:  MERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQV---IQVTPTTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS----------

Query:  ---VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPPRT
            T+V AFL+LL+GAALND   E+S     Y CY++KPGVF+   ++   ++ALG+ Y+L L S K         +      IAM QPQ     P R 
Subjt:  ---VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPPRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQFT
Subjt:  ADPVFVHEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)1.3e-3647.4Show/hide
Query:  TTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFA
        T C YP+SPA  LG T+AL L++AQI ++VS+GC CCR+GP P  S              T+V AFL+LL+GAALND   E+S     Y CY++KPGVF+
Subjt:  TTCKYPQSPALGLGLTAALSLLLAQITINVSTGCICCRRGPRPPAS-------------VTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFA

Query:  FATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPPRTADPVFVHEDTYMRRQFT
           ++   ++ALG+ Y+L L S K         +      IAM QPQ     P R  DPVFVHEDTYMRRQFT
Subjt:  FATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIAMAQPQFPPPPPPRTADPVFVHEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACTTTCTCTACTCTTTCATCGTACGCAACGTCAAATTTCAGCGCATGTCATTCTCCCCTCAACAACATCTTCAAAAACTCCTCCTTAATCCGGAGCTCCACCCAAAA
CTCGCAGGGTTTTTCTTATTCTTCCGGGGACTCGCCGGACATGGAGAGGAAGGTTCTGGCGGTGTGTTCTATGGTTACTATTTTGGGAATTTTGATGGTCGCCACTGGCT
TCGCCGCTGAAGGCACGAGAACTAAGTTTAATCAAGTCATTCAAGTCACTCCTACTACATGCAAATATCCTCAAAGTCCCGCATTGGGCCTTGGTTTGACAGCCGCTCTA
TCGCTTTTGCTTGCTCAAATAACGATAAATGTTTCGACGGGATGCATTTGCTGCAGACGGGGTCCTCGGCCTCCTGCTTCGGTTACATATGTGTTTGCGTTCCTCCTGTT
GCTCGCCGGTGCTGCACTGAACGATGGACGGGGTGAACAAAGCAACTATTTCAGTTACTACGAGTGCTATGTTCTCAAACCGGGAGTTTTTGCTTTCGCTACTATTGTGG
GTGCTGCAAGTTTAGCTCTAGGATTGTTCTACTTCCTCATACTGAACTCCGCAAAGAATGACCCTACTGTGTGGGGCAATCCTTCCGTTCCTCCTCAAGCAAACATTGCA
ATGGCGCAGCCCCAATTCCCGCCCCCACCTCCACCGAGAACTGCAGACCCCGTGTTTGTTCACGAAGACACGTACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGACTTTCTCTACTCTTTCATCGTACGCAACGTCAAATTTCAGCGCATGTCATTCTCCCCTCAACAACATCTTCAAAAACTCCTCCTTAATCCGGAGCTCCACCCAAAA
CTCGCAGGGTTTTTCTTATTCTTCCGGGGACTCGCCGGACATGGAGAGGAAGGTTCTGGCGGTGTGTTCTATGGTTACTATTTTGGGAATTTTGATGGTCGCCACTGGCT
TCGCCGCTGAAGGCACGAGAACTAAGTTTAATCAAGTCATTCAAGTCACTCCTACTACATGCAAATATCCTCAAAGTCCCGCATTGGGCCTTGGTTTGACAGCCGCTCTA
TCGCTTTTGCTTGCTCAAATAACGATAAATGTTTCGACGGGATGCATTTGCTGCAGACGGGGTCCTCGGCCTCCTGCTTCGGTTACATATGTGTTTGCGTTCCTCCTGTT
GCTCGCCGGTGCTGCACTGAACGATGGACGGGGTGAACAAAGCAACTATTTCAGTTACTACGAGTGCTATGTTCTCAAACCGGGAGTTTTTGCTTTCGCTACTATTGTGG
GTGCTGCAAGTTTAGCTCTAGGATTGTTCTACTTCCTCATACTGAACTCCGCAAAGAATGACCCTACTGTGTGGGGCAATCCTTCCGTTCCTCCTCAAGCAAACATTGCA
ATGGCGCAGCCCCAATTCCCGCCCCCACCTCCACCGAGAACTGCAGACCCCGTGTTTGTTCACGAAGACACGTACATGAGACGACAATTCACGTGATTGGTAAATGTAGG
TCGATACCAAATGCGTTAAAGAAAACCATATAGCTCATACCACATGATTGATGTATCTTTTGAAACTAGGGAATTGAAGGTGAACAAATATGTTGGCTCTATAGAAAATA
GCTGTTTTGGAGTTGTATAGGCTCTCTCTCATGGAGGAACTGATTGAATTTGTGTAAGGAGGTGTTATTTTATATTTCCCTTTGATCCAAGGGGAGTGAATAAACTGTAA
TGTAATAAAATGTATATGCACCGTTGATTTGTAGTGGGAAGCAGTTGTTTAATAATACTGTTGAGGCTCAC
Protein sequenceShow/hide protein sequence
MTFSTLSSYATSNFSACHSPLNNIFKNSSLIRSSTQNSQGFSYSSGDSPDMERKVLAVCSMVTILGILMVATGFAAEGTRTKFNQVIQVTPTTCKYPQSPALGLGLTAAL
SLLLAQITINVSTGCICCRRGPRPPASVTYVFAFLLLLAGAALNDGRGEQSNYFSYYECYVLKPGVFAFATIVGAASLALGLFYFLILNSAKNDPTVWGNPSVPPQANIA
MAQPQFPPPPPPRTADPVFVHEDTYMRRQFT