; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003318 (gene) of Snake gourd v1 genome

Gene IDTan0003318
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function (DUF1218)
Genome locationLG01:4370896..4374206
RNA-Seq ExpressionTan0003318
SyntenyTan0003318
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR009606 - Modifying wall lignin-1/2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6576843.1 Protein MODIFYING WALL LIGNIN-2, partial [Cucurbita argyrosperma subsp. sororia]2.0e-9683.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KAL VCSVVAFLGLLLVATGFAAEGTRVK +QV++V+PT CKYP+SPA  LGLTAALSLLLAQ +INVSTGCICCTRGPRPP SKWRT V+CFV+SW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA
        FTFVIAFLLL+ GAALND R EQ  YF YY CYVLKPGVFAVAT+VGAASLALGLFYYLILNSAKNDP VWGNPSIPP ANIAM QPQFPPPP  +  TA
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

KAG7014863.1 hypothetical protein SDJN02_22493 [Cucurbita argyrosperma subsp. argyrosperma]2.0e-9683.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KAL VCSVVAFLGLLLVATGFAAEGTRVK +QV++V+PT CKYP+SPA  LGLTAALSLLLAQ +INVSTGCICCTRGPRPP SKWRT V+CFV+SW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA
        FTFVIAFLLL+ GAALND R EQ  YF YY CYVLKPGVFAVAT+VGAASLALGLFYYLILNSAKNDP VWGNPSIPP ANIAM QPQFPPPP  +  TA
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_022140830.1 uncharacterized protein LOC111011403 [Momordica charantia]5.0e-10083.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KA+ VCSVVAFLGLL+VATGFAAEGTR+KLSQVI+V+P TC YPRSPA+GLGL AALSLL+AQ  INVSTGCICC RGPRPP SKWRTTV+CFVISW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQF--PPPPTQRTA
        FTF+IAFLLL+ GAALNDRRGE+ YYFGYYECYVLKPGVFAVATI+  AS+ LGLFYYLILNSAKN+PTVWGNPS+PPQANIAMGQPQF  PPPP QR+ 
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQF--PPPPTQRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

XP_022922463.1 uncharacterized protein LOC111430466 [Cucurbita moschata]2.6e-9683.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KAL VCSVVAFLGLLLVATGFAAEGTRVK +QV++V+PT CKYP+SPA  LGLTAALSLLLAQ +INVSTGCICCTRGPRPP SKWRT V+CFV+SW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA
        FTFVIAFLLL+ GAALND R EQ  YF YY CYVLKPGVFAVAT+VGAASLALGLFYYLILNSAKNDP VWGNPSIPP ANIAM QPQFPPPP  +  TA
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

XP_023551924.1 uncharacterized protein LOC111809750 [Cucurbita pepo subsp. pepo]8.3e-9582.87Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KAL VCSVVA LGLLLVATGFAAEGTRVK +QV++V+PT CKYP+SPA  LGLTAALSLLLAQ +INVSTGCICCTRGPRPP SKWRT V+CFV+SW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA
        FTFVIAFLLL+ GAALN  R EQ  YF YY CYVLKPGVFAVAT+VGAASLALGLFYYLILNSAKNDP VWGNPSIPP ANIAM QPQFPPPP  +  TA
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

TrEMBL top hitse value%identityAlignment
A0A0A0KU80 Uncharacterized protein2.7e-8374.42Show/hide
Query:  KKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFT
        K AL+V  VVA LG++++ATGFAAE TR K +QV +V+P  CKYPRSPA+GLGLTAALSLL AQ  I  STGC+CC RGPRPP SKWRT VICF ISW T
Subjt:  KKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFT

Query:  FVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF--PPPPTQRTAD
        +VIAFLL + GAALN+ RGEQR YF  Y+CYVLKPGVF+ ATIVG ASL LG+ Y+LILNSAKNDP TVWG+PS+PPQ NIAM QPQF  PPPP QRTAD
Subjt:  FVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF--PPPPTQRTAD

Query:  PVFVHEDTYMRRQFT
        PVFVHEDTYMRRQFT
Subjt:  PVFVHEDTYMRRQFT

A0A1S3BX72 uncharacterized protein LOC1034944355.6e-8172.73Show/hide
Query:  MEKK-ALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVIS
        ME+K ALVV  VVAFLG++++ATGFAAE TR K  QV  V+P  CKYPRSPAMGLG TAALSLL AQ  I  STGC+CC RGPRPP  KWRT VICF +S
Subjt:  MEKK-ALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVIS

Query:  WFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF----PPPPT
        W T+VIAFLL + GAALN+ R +QR Y G YECYVLKPGVF+ ATIVG ASL LG+ Y+LILNSAKNDP TVWG+PS+PPQ NIAM QPQF    PPPP 
Subjt:  WFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF----PPPPT

Query:  QRTADPVFVHEDTYMRRQFT
        QRT DPVFVHEDTYMRRQFT
Subjt:  QRTADPVFVHEDTYMRRQFT

A0A5D3D069 DUF1218 domain-containing protein5.6e-8172.73Show/hide
Query:  MEKK-ALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVIS
        ME+K ALVV  VVAFLG++++ATGFAAE TR K  QV  V+P  CKYPRSPAMGLG TAALSLL AQ  I  STGC+CC RGPRPP  KWRT VICF +S
Subjt:  MEKK-ALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVIS

Query:  WFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF----PPPPT
        W T+VIAFLL + GAALN+ R +QR Y G YECYVLKPGVF+ ATIVG ASL LG+ Y+LILNSAKNDP TVWG+PS+PPQ NIAM QPQF    PPPP 
Subjt:  WFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP-TVWGNPSIPPQANIAMGQPQF----PPPPT

Query:  QRTADPVFVHEDTYMRRQFT
        QRT DPVFVHEDTYMRRQFT
Subjt:  QRTADPVFVHEDTYMRRQFT

A0A6J1CG96 uncharacterized protein LOC1110114032.4e-10083.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KA+ VCSVVAFLGLL+VATGFAAEGTR+KLSQVI+V+P TC YPRSPA+GLGL AALSLL+AQ  INVSTGCICC RGPRPP SKWRTTV+CFVISW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQF--PPPPTQRTA
        FTF+IAFLLL+ GAALNDRRGE+ YYFGYYECYVLKPGVFAVATI+  AS+ LGLFYYLILNSAKN+PTVWGNPS+PPQANIAMGQPQF  PPPP QR+ 
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQF--PPPPTQRTA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTYMRRQ+T
Subjt:  DPVFVHEDTYMRRQFT

A0A6J1E6P1 uncharacterized protein LOC1114304661.2e-9683.8Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        ME+KAL VCSVVAFLGLLLVATGFAAEGTRVK +QV++V+PT CKYP+SPA  LGLTAALSLLLAQ +INVSTGCICCTRGPRPP SKWRT V+CFV+SW
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA
        FTFVIAFLLL+ GAALND R EQ  YF YY CYVLKPGVFAVAT+VGAASLALGLFYYLILNSAKNDP VWGNPSIPP ANIAM QPQFPPPP  +  TA
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQR--TA

Query:  DPVFVHEDTYMRRQFT
        DPVFVHEDTY RRQFT
Subjt:  DPVFVHEDTYMRRQFT

SwissProt top hitse value%identityAlignment
A2RVU1 Protein MODIFYING WALL LIGNIN-12.3e-0735.2Show/hide
Query:  TCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVI---SWFTFVIAFLLLIIGAALNDRRGEQRYYFGYY--ECYVLKP
        +C  P + A GLG+ A + + +AQ + NV    IC  RG      K RTT+ C ++   SW  F +A  L+ +GA++N    EQ Y  G+   ECY++K 
Subjt:  TCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVI---SWFTFVIAFLLLIIGAALNDRRGEQRYYFGYY--ECYVLKP

Query:  GVFAVATIVGAASLA--LGLFYYLI
        GVFA +  +   ++A  LG F + +
Subjt:  GVFAVATIVGAASLA--LGLFYYLI

Arabidopsis top hitse value%identityAlignment
AT1G13380.1 Protein of unknown function (DUF1218)2.5e-0928.9Show/hide
Query:  KKALVVCSVVAFLGLLLVATGFAAEGTRV--KLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW
        K + +V  +V  L L+      AAE  R   K  Q    + T C Y    A G G+ A L LL +++++   T C+C  R P  P S    ++I F+ SW
Subjt:  KKALVVCSVVAFLGLLLVATGFAAEGTRV--KLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISW

Query:  FTFVIAFLLLIIGAALNDRRGEQRYYFGY-----YECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP
         TF++A   +I GA  N       Y+  Y     + C  L+ G+F    +   A++ L ++YY+    + + P
Subjt:  FTFVIAFLLLIIGAALNDRRGEQRYYFGY-----YECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDP

AT1G68220.1 Protein of unknown function (DUF1218)2.2e-0831.06Show/hide
Query:  VCSVVAFLGLLLVATGFAAEGTRVKLSQVIEV--SPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFVI
        + +VV  L LL     F AE  R     V +     T CKY    +   G++A   LL++Q ++N  T C+C  +G     S     ++ FV+SW +F+ 
Subjt:  VCSVVAFLGLLLVATGFAAEGTRVKLSQVIEV--SPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFVI

Query:  AFLLLIIGAALN--DRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSA
        A   L+ G+A N    + E  Y      C VL  GVFA        SL   + YYL  + A
Subjt:  AFLLLIIGAALN--DRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSA

AT4G27435.1 Protein of unknown function (DUF1218)5.7e-0929.01Show/hide
Query:  VVCSVVAFLGLLLVATGFAAEGTR--VKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFV
        +V ++V    L+      AAE  R   ++ Q  EV    C Y    A G G+ A L  + +Q +I + + C CC + P  P       +I F++SW  F+
Subjt:  VVCSVVAFLGLLLVATGFAAEGTR--VKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFV

Query:  IAFLLLIIGAALNDRRGEQRYYF--GYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSA
        IA + L+ G+  N    + R  F     +C  L+ GVFA        +  +  FYY    SA
Subjt:  IAFLLLIIGAALNDRRGEQRYYF--GYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSA

AT5G17210.1 Protein of unknown function (DUF1218)6.2e-5651.61Show/hide
Query:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQV---IEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFV
        ME++ +V+C V+  LGLL   T F AE TR+K SQV   +  S T C YPRSPA  LG T+AL L++AQ +++VS+GC CC +GP P  S W  ++ICFV
Subjt:  MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQV---IEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFV

Query:  ISWFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQRT
        +SWFTFVIAFL+L+ GAALND   E+    G Y CY++KPGVF+   ++   ++ALG+ YYL L S K         +      IAMGQPQ P    +R 
Subjt:  ISWFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQRT

Query:  ADPVFVHEDTYMRRQFT
         DPVFVHEDTYMRRQFT
Subjt:  ADPVFVHEDTYMRRQFT

AT5G17210.2 Protein of unknown function (DUF1218)4.0e-4753.14Show/hide
Query:  SPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGV
        S T C YPRSPA  LG T+AL L++AQ +++VS+GC CC +GP P  S W  ++ICFV+SWFTFVIAFL+L+ GAALND   E+    G Y CY++KPGV
Subjt:  SPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFVIAFLLLIIGAALNDRRGEQRYYFGYYECYVLKPGV

Query:  FAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQRTADPVFVHEDTYMRRQFT
        F+   ++   ++ALG+ YYL L S K         +      IAMGQPQ P    +R  DPVFVHEDTYMRRQFT
Subjt:  FAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQRTADPVFVHEDTYMRRQFT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGAAGAAGGCTCTGGTTGTTTGCTCTGTGGTCGCCTTCTTAGGGCTTTTGTTGGTCGCCACTGGCTTCGCCGCTGAGGGCACAAGAGTTAAGCTTAGTCAAGTTAT
TGAAGTCAGTCCTACTACATGCAAATATCCCCGAAGTCCTGCGATGGGCCTTGGTTTGACTGCAGCTCTATCACTTTTGCTTGCTCAGACAATGATAAATGTTTCGACGG
GATGCATTTGCTGCACGAGGGGCCCTCGGCCTCCTCCTTCTAAATGGCGAACGACCGTGATCTGCTTCGTCATTTCCTGGTTTACATTTGTGATAGCATTCCTCCTGTTG
ATCATCGGTGCTGCACTAAACGACCGACGAGGCGAACAACGCTACTATTTTGGTTACTACGAGTGCTATGTTCTGAAACCGGGAGTTTTTGCTGTTGCTACCATTGTGGG
TGCTGCAAGTTTAGCACTGGGATTGTTCTATTACCTCATATTGAACTCTGCAAAGAACGACCCAACTGTGTGGGGCAATCCTTCCATTCCTCCTCAAGCAAACATTGCAA
TGGGGCAGCCCCAATTCCCTCCCCCTCCTACACAGAGAACTGCAGACCCGGTATTCGTTCATGAAGATACGTACATGAGACGACAATTCACGTGA
mRNA sequenceShow/hide mRNA sequence
CTCTCACCCCATCGGACAAAACGCCAAATTTCAGTGCCTGTCATTTTCCTCAACAACTTTTACTATACGCTTCTCCACCACGGTTTCTTCGCCCATCGAAATCTCGACCT
CAATTCGAAACTCCACAAAATCTCGCCGGTTTTTTTCTCCCCGGAGACTCACCGGAGGGCGTCGGCGGCGGTGGAGATGGAGAAGAAGGCTCTGGTTGTTTGCTCTGTGG
TCGCCTTCTTAGGGCTTTTGTTGGTCGCCACTGGCTTCGCCGCTGAGGGCACAAGAGTTAAGCTTAGTCAAGTTATTGAAGTCAGTCCTACTACATGCAAATATCCCCGA
AGTCCTGCGATGGGCCTTGGTTTGACTGCAGCTCTATCACTTTTGCTTGCTCAGACAATGATAAATGTTTCGACGGGATGCATTTGCTGCACGAGGGGCCCTCGGCCTCC
TCCTTCTAAATGGCGAACGACCGTGATCTGCTTCGTCATTTCCTGGTTTACATTTGTGATAGCATTCCTCCTGTTGATCATCGGTGCTGCACTAAACGACCGACGAGGCG
AACAACGCTACTATTTTGGTTACTACGAGTGCTATGTTCTGAAACCGGGAGTTTTTGCTGTTGCTACCATTGTGGGTGCTGCAAGTTTAGCACTGGGATTGTTCTATTAC
CTCATATTGAACTCTGCAAAGAACGACCCAACTGTGTGGGGCAATCCTTCCATTCCTCCTCAAGCAAACATTGCAATGGGGCAGCCCCAATTCCCTCCCCCTCCTACACA
GAGAACTGCAGACCCGGTATTCGTTCATGAAGATACGTACATGAGACGACAATTCACGTGATCACTGATCGGTAAATGTAGGTCGATATCGAATGCGTTTTAAGAAAACC
ATGTAACTCATACACTTGATTTATGTATCTTTGAAACCAGGCATTGAAGGTGAACAAATTAAATAGCTGTTTTGGAGTTGTATAGGATCTCTCTTAGAGGAACTGAATTG
AATTTATAAATGGAGGTGACATTTTATTTCCCTTTGATCCAAGGGGAAGTAAATAAAGCTGTTGTCATGTAATAAAATGTATATCCACTGTTGATTTGTAGTGGGAAGCA
GTTGTTAAATTGTTTGGAGGCTTACAGAATAACTGAAGAATATTGGGCTTGGATTTGAGGTTGTTAGATTAAAAATATTTATACTTTTGTTTGTAATAAAGGACATTTCT
ATCATTGTAATTTATTTATGTGAG
Protein sequenceShow/hide protein sequence
MEKKALVVCSVVAFLGLLLVATGFAAEGTRVKLSQVIEVSPTTCKYPRSPAMGLGLTAALSLLLAQTMINVSTGCICCTRGPRPPPSKWRTTVICFVISWFTFVIAFLLL
IIGAALNDRRGEQRYYFGYYECYVLKPGVFAVATIVGAASLALGLFYYLILNSAKNDPTVWGNPSIPPQANIAMGQPQFPPPPTQRTADPVFVHEDTYMRRQFT