; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0012468 (gene) of Snake gourd v1 genome

Gene IDTan0012468
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function, DUF538
Genome locationLG02:56337325..56339954
RNA-Seq ExpressionTan0012468
SyntenyTan0012468
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007493 - Protein of unknown function DUF538
IPR036758 - At5g01610-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008441719.1 PREDICTED: uncharacterized protein LOC103485796 [Cucumis melo]4.3e-8286.02Show/hide
Query:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
        M L L SPF    F +FFASPL+AA DPSTIYDHLHLHGLPIGLLPKNIT+FSIDSSTGRF VFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
Subjt:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS

Query:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        QELFLWFPVKGIRVDL +SGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFA + AS+D D  FS  EAQ+L LEDSELRATS
Subjt:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

XP_022946011.1 uncharacterized protein LOC111450229 [Cucurbita moschata]5.3e-8090.48Show/hide
Query:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
        MILS+SSPFSLL+F+LFFASPL+AA +P+TIYDHLH HGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRL+YGQIAELAGISS
Subjt:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS

Query:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFS
        QELFLWFPVKGIRVDL TSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQS AL G S+ LD AFS
Subjt:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFS

XP_022949506.1 uncharacterized protein LOC111452833 [Cucurbita moschata]4.2e-8592.05Show/hide
Query:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
        LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
Subjt:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK

Query:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        GI VDL +SG+IHFDVGVVDKQFSLSLFESPPDCTAADPVD SF L+GASVDL+VA SM+EAQ+L LED ELRA S
Subjt:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

XP_022998326.1 uncharacterized protein LOC111492997 [Cucurbita maxima]1.6e-8491.48Show/hide
Query:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
        LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPK+ITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
Subjt:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK

Query:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        GI VDL +SG+IHFDVGVVDKQFSLSLFESPPDCTAADPVD SF L+GASVDL+VA SM+EAQ+L LED ELRA S
Subjt:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

XP_038891231.1 uncharacterized protein LOC120080586 [Benincasa hispida]8.4e-8689.84Show/hide
Query:  MILSLSSPFSLLAFLL-FFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGIS
        M LSLSS FSLL F+L FFASPL+AA DPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGIS
Subjt:  MILSLSSPFSLLAFLL-FFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGIS

Query:  SQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        SQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSF  +GAS+D    FS  EAQ+L LEDSELRATS
Subjt:  SQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

TrEMBL top hitse value%identityAlignment
A0A1S3B435 uncharacterized protein LOC1034857962.1e-8286.02Show/hide
Query:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
        M L L SPF    F +FFASPL+AA DPSTIYDHLHLHGLPIGLLPKNIT+FSIDSSTGRF VFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
Subjt:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS

Query:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        QELFLWFPVKGIRVDL +SGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFA + AS+D D  FS  EAQ+L LEDSELRATS
Subjt:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

A0A6J1CU10 uncharacterized protein LOC1110142132.4e-7881.91Show/hide
Query:  ILSLSSPFSLLAFLLF---FASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGI
        + SLSSPFSL+  LL    F++   A++DPSTIYDHLHLHGLPIGLLPKNIT FS+DS+TGRFQVFLDQPCNAKFENEVHYD NVSG LSYGQIAELAGI
Subjt:  ILSLSSPFSLLAFLLF---FASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGI

Query:  SSQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        SSQELFLWFPVKGIR+DL TSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQS A  GA V    AFS NE Q+LPLEDSELRATS
Subjt:  SSQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

A0A6J1G2H3 uncharacterized protein LOC1114502292.5e-8090.48Show/hide
Query:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS
        MILS+SSPFSLL+F+LFFASPL+AA +P+TIYDHLH HGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRL+YGQIAELAGISS
Subjt:  MILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISS

Query:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFS
        QELFLWFPVKGIRVDL TSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQS AL G S+ LD AFS
Subjt:  QELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFS

A0A6J1GC72 uncharacterized protein LOC1114528332.0e-8592.05Show/hide
Query:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
        LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
Subjt:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK

Query:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        GI VDL +SG+IHFDVGVVDKQFSLSLFESPPDCTAADPVD SF L+GASVDL+VA SM+EAQ+L LED ELRA S
Subjt:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

A0A6J1K9Y4 uncharacterized protein LOC1114929977.7e-8591.48Show/hide
Query:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
        LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPK+ITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK
Subjt:  LLAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVK

Query:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS
        GI VDL +SG+IHFDVGVVDKQFSLSLFESPPDCTAADPVD SF L+GASVDL+VA SM+EAQ+L LED ELRA S
Subjt:  GIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G61667.1 Protein of unknown function, DUF5381.1e-3042.11Show/hide
Query:  LAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKG
        +  LL    P +  S  S+I + L   GLP GL P N+  +S+D  TG  +V L  PC A+FEN V++D  +   LSYG +  L G++ +ELFLW PVKG
Subjt:  LAFLLFFASPLMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKG

Query:  IRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDL
        I V+ P+SG++ FD+GV  KQ S SLFE PP C     + +    S   + L
Subjt:  IRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALSGASVDL

AT3G07460.1 Protein of unknown function, DUF5384.5e-3753.62Show/hide
Query:  SLLAFLLFFASPLMAA-SDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFP
        +LL F+L     + A  ++  +I + L  +GLP+GL PK +  F+++  TGRF V+L+Q C AK+E E+HYD  VSG + Y QI +L+GIS+QELFLW  
Subjt:  SLLAFLLFFASPLMAA-SDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFP

Query:  VKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTA
        VKGIRVD+P+SG+I FDVGV+ KQ+SLSLFE+P DC A
Subjt:  VKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTA

AT3G07460.2 Protein of unknown function, DUF5384.5e-3753.62Show/hide
Query:  SLLAFLLFFASPLMAA-SDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFP
        +LL F+L     + A  ++  +I + L  +GLP+GL PK +  F+++  TGRF V+L+Q C AK+E E+HYD  VSG + Y QI +L+GIS+QELFLW  
Subjt:  SLLAFLLFFASPLMAA-SDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFP

Query:  VKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTA
        VKGIRVD+P+SG+I FDVGV+ KQ+SLSLFE+P DC A
Subjt:  VKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTA

AT3G07470.1 Protein of unknown function, DUF5383.9e-4160.33Show/hide
Query:  AASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKGIRVDLPTSGVIH
        A S+  TIY+ L  +GLP G+ PK +  F+ D  TGRF V+L+Q C AK+E E+HYD N++G +   QI++L+GIS+QELFLWFPVKGIRVD+P+SG+I+
Subjt:  AASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKGIRVDLPTSGVIH

Query:  FDVGVVDKQFSLSLFESPPDC
        FDVGVV KQ+SLSLFE+P DC
Subjt:  FDVGVVDKQFSLSLFESPPDC

AT5G16380.1 Protein of unknown function, DUF5383.9e-4158.82Show/hide
Query:  LMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKGIRVDLPTSGV
        L +  DPS  YD+L    LP G++PK +T FSID  TGRF V L  PC+AKFEN+ H+D+N+SG LS G+I  L+G++ +ELFLWF VKGI VD  +SG+
Subjt:  LMAASDPSTIYDHLHLHGLPIGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKGIRVDLPTSGV

Query:  IHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALS
        IHFDVGV DKQ SLSLFESP DCTAA+   ++  LS
Subjt:  IHFDVGVVDKQFSLSLFESPPDCTAADPVDQSFALS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATACGCCAAAAGAAAAAGAACGAAGAGTTCACGCAAACCTCAGAAAATCCCATTCTTCTGTTCTTCCAGGCTTTGCCTCTGCTCTCGCGCCGTTCTTTTCTGCTCC
CTTCTTCAACCTCAAAATCCTCTCAATCCCAACCCCATTTCTCAAAATCCCAATTCCAATTCCAATTCCAATTCCAATTCCAAATTCCACCATCACCATGATCCTCTCTC
TCTCTTCCCCTTTCTCACTCCTCGCTTTCCTTCTCTTCTTCGCCTCTCCTCTCATGGCCGCTTCTGACCCATCCACCATCTACGACCATCTCCACCTTCATGGCCTCCCT
ATCGGCCTCCTTCCCAAGAACATCACCAGATTCTCAATCGACTCTTCCACCGGCCGATTCCAGGTCTTCCTCGACCAGCCCTGCAATGCCAAGTTCGAGAATGAGGTTCA
CTATGATTTCAACGTCTCCGGCAGGCTCAGCTACGGCCAGATCGCTGAATTGGCCGGAATTTCCTCGCAGGAGCTCTTTCTTTGGTTTCCTGTTAAGGGAATTCGTGTCG
ATTTGCCCACTTCTGGTGTAATTCATTTCGACGTCGGCGTTGTTGACAAGCAATTCTCTCTGTCTCTCTTTGAGTCCCCGCCTGATTGCACTGCGGCTGATCCGGTTGAT
CAATCCTTCGCGTTGAGTGGAGCTTCTGTCGATTTGGATGTGGCTTTTTCGATGAATGAAGCACAGAGCCTTCCGCTTGAAGACAGTGAACTGCGAGCAACATCATAG
mRNA sequenceShow/hide mRNA sequence
ATGAATACGCCAAAAGAAAAAGAACGAAGAGTTCACGCAAACCTCAGAAAATCCCATTCTTCTGTTCTTCCAGGCTTTGCCTCTGCTCTCGCGCCGTTCTTTTCTGCTCC
CTTCTTCAACCTCAAAATCCTCTCAATCCCAACCCCATTTCTCAAAATCCCAATTCCAATTCCAATTCCAATTCCAATTCCAAATTCCACCATCACCATGATCCTCTCTC
TCTCTTCCCCTTTCTCACTCCTCGCTTTCCTTCTCTTCTTCGCCTCTCCTCTCATGGCCGCTTCTGACCCATCCACCATCTACGACCATCTCCACCTTCATGGCCTCCCT
ATCGGCCTCCTTCCCAAGAACATCACCAGATTCTCAATCGACTCTTCCACCGGCCGATTCCAGGTCTTCCTCGACCAGCCCTGCAATGCCAAGTTCGAGAATGAGGTTCA
CTATGATTTCAACGTCTCCGGCAGGCTCAGCTACGGCCAGATCGCTGAATTGGCCGGAATTTCCTCGCAGGAGCTCTTTCTTTGGTTTCCTGTTAAGGGAATTCGTGTCG
ATTTGCCCACTTCTGGTGTAATTCATTTCGACGTCGGCGTTGTTGACAAGCAATTCTCTCTGTCTCTCTTTGAGTCCCCGCCTGATTGCACTGCGGCTGATCCGGTTGAT
CAATCCTTCGCGTTGAGTGGAGCTTCTGTCGATTTGGATGTGGCTTTTTCGATGAATGAAGCACAGAGCCTTCCGCTTGAAGACAGTGAACTGCGAGCAACATCATAGAT
TAACAATTATGAAACATTGTGGATTTCCCGAGGACTTCGTTCAAGAGTCTGGCTTTGCTGGTCTGGTAAATTTGCTTCTACTATTTGTGGATTGCTTTTGTTTTGTATAG
TCTATGTTTCCTGATAGTCCTGTCTGATGGGGATGAGAGGCTATTGTCTTTATATAAAGAGCTTACAGAAATATTTAGCCTCCAATCCTTCAAGTCTATACTATTAGCTG
AATTTAATCGAGGGAGACTGTTTCTTGGGTCGGCTTATAAGAATCCTAGAAGTCTGGTATAATGTTAGCAGTCGTCTCGTCTGTTGTAAGTGCTTTTTATATCTCTCTAT
GAAGTATATTACTGCTTCTGGCTGTGATTGGTTTTGGCTGCTCACAAAAAGAAGTGTTGTTTTTGTTGTGTTGCCCTGTTATAGCCATAAAACTGGGGCTTGTAAATTGT
ACATTGCAAAAGTGATCTTTTAGTAGTCAAAAGAGTATTTACATGTATTTTATAAAGCTCACATTTTCATTTGATATTTAGGATATAGATCCATATTATTCTTCAACTAT
CCTTCACACAAGTGTTTGTGTTATGAAGCTTAAAAACATAGAGTATCCATACAGTATACTTTTGTAGGAAAAAAAGAAAAGGAATTGTATCAGGTACAAAAATGAGGAGG
ATTATCGTATATCTTGTGAAAGAACCAATTGATAAAGATTAAGAAAAAAGGATATTCAACTGACATACATAAGAAATAGAGGATCACAATCATATATACTCAGTAGACAC
TAGTTATAGTTGTTTTCAAATGTCTTGG
Protein sequenceShow/hide protein sequence
MNTPKEKERRVHANLRKSHSSVLPGFASALAPFFSAPFFNLKILSIPTPFLKIPIPIPIPIPIPNSTITMILSLSSPFSLLAFLLFFASPLMAASDPSTIYDHLHLHGLP
IGLLPKNITRFSIDSSTGRFQVFLDQPCNAKFENEVHYDFNVSGRLSYGQIAELAGISSQELFLWFPVKGIRVDLPTSGVIHFDVGVVDKQFSLSLFESPPDCTAADPVD
QSFALSGASVDLDVAFSMNEAQSLPLEDSELRATS