; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0017654 (gene) of Snake gourd v1 genome

Gene IDTan0017654
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionPentatricopeptide repeat-containing protein
Genome locationLG07:34652953..34655593
RNA-Seq ExpressionTan0017654
SyntenyTan0017654
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR002885 - Pentatricopeptide repeat
IPR011990 - Tetratricopeptide-like helical domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0045802.1 pentatricopeptide repeat-containing protein [Cucumis melo var. makuwa]5.0e-5074.31Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        M FSLKY TVCS LNR++ RF SLPT  +A+  T  L F    HQSLH SLE+C+SMRELKVLHAQIIL GLVS+N+TLGKLISFC+VSQAGDL YA+LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYSTS H INAI LY KMM+SG LPN+
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

KAG6571425.1 Pentatricopeptide repeat-containing protein, mitochondrial, partial [Cucurbita argyrosperma subsp. sororia]3.6e-4878.2Show/hide
Query:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM
        S LNR++FRF SLP + A+K+     DFK P HQSLHLSLERCSSMR+LKVLHAQIIL GLVSE LTLGKL+SFCAVSQAGDL+YA+LVFDH  QPNKFM
Subjt:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM

Query:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FNCLIRGYSTSQH I AI LYFKMM+SG LPNQ
Subjt:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

XP_008457803.1 PREDICTED: pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like [Cucumis melo]5.0e-5074.31Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        M FSLKY TVCS LNR++ RF SLPT  +A+  T  L F    HQSLH SLE+C+SMRELKVLHAQIIL GLVS+N+TLGKLISFC+VSQAGDL YA+LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYSTS H INAI LY KMM+SG LPN+
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

XP_022971672.1 pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like [Cucurbita maxima]2.7e-4878.95Show/hide
Query:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM
        S LNR++FRF SLP + A+K    + DFK P HQSLHLSLERCSSMR+LKVLHAQIIL GLVSE LTLGKL+SFCAVSQAGDL YA+LVFDH  QPNKFM
Subjt:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM

Query:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FNCLIRGYSTSQH INAI LYFKMM+SG LPNQ
Subjt:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

XP_038900653.1 pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like [Benincasa hispida]1.5e-5478.47Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        MFFSLKY + CS LNR++FR  SLPT  +AK  TF LDFK P HQ LH+SLERCSSMRELKVLHAQIIL GLVSE LTLGKL+SFC+VSQAGDL+YA+LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYS SQH INAI LYFKM++SG LPNQ
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

TrEMBL top hitse value%identityAlignment
A0A0A0LLZ3 Uncharacterized protein1.1e-4772.22Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        M FSLK  TVCS L R+ FRF S+PT  +A   T  LDFK   HQSL  SLE+CSSMRELKVLHA+IIL GLVS+N+TLGKLISFC+VSQ GDL+YA LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYSTS H INAI LY +MM+SG LPN+
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

A0A1S3C6H9 pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like2.4e-5074.31Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        M FSLKY TVCS LNR++ RF SLPT  +A+  T  L F    HQSLH SLE+C+SMRELKVLHAQIIL GLVS+N+TLGKLISFC+VSQAGDL YA+LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYSTS H INAI LY KMM+SG LPN+
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

A0A5A7TQY7 Pentatricopeptide repeat-containing protein2.4e-5074.31Show/hide
Query:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV
        M FSLKY TVCS LNR++ RF SLPT  +A+  T  L F    HQSLH SLE+C+SMRELKVLHAQIIL GLVS+N+TLGKLISFC+VSQAGDL YA+LV
Subjt:  MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLV

Query:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FDHL QPNKFMFNCLIRGYSTS H INAI LY KMM+SG LPN+
Subjt:  FDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

A0A6J1EJQ7 pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like5.0e-4877.44Show/hide
Query:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM
        S LNR++FRF S P + A+K+     DFK P HQSLHLSLERCSSMR+LKVLHAQIIL GLVSE LTLGKL+SFCAVSQAGDL+YA+LVFDH  QPNKFM
Subjt:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM

Query:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FNCLIRGYSTSQH I AI LYFKMM+SG LPNQ
Subjt:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

A0A6J1I2L8 pentatricopeptide repeat-containing protein At2g22410, mitochondrial-like1.3e-4878.95Show/hide
Query:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM
        S LNR++FRF SLP + A+K    + DFK P HQSLHLSLERCSSMR+LKVLHAQIIL GLVSE LTLGKL+SFCAVSQAGDL YA+LVFDH  QPNKFM
Subjt:  SCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFM

Query:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ
        FNCLIRGYSTSQH INAI LYFKMM+SG LPNQ
Subjt:  FNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQ

SwissProt top hitse value%identityAlignment
O82380 Pentatricopeptide repeat-containing protein At2g29760, chloroplastic1.8e-1036.54Show/hide
Query:  DHQSLHLSL-ERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMM-QSGV
        + +S H+SL ERC S+R+LK  H  +I  G  S+  +  KL +  A+S    L YA+ VFD + +PN F +N LIR Y++    + +I  +  M+ +S  
Subjt:  DHQSLHLSL-ERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMM-QSGV

Query:  LPNQ
         PN+
Subjt:  LPNQ

Q9CA54 Pentatricopeptide repeat-containing protein At1g746302.4e-1036.46Show/hide
Query:  HQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSG
        H  L L L  C ++R L  +H   I +G+ +++   GKLI  CA+S +  L YA+ +     +P+ FMFN L+RGYS S    N+++++ +MM+ G
Subjt:  HQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSG

Q9FJY7 Pentatricopeptide repeat-containing protein At5g665209.6e-1238.3Show/hide
Query:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGD-LYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPN
        L+RCS   ELK +HA+++  GL+ ++  + K +SFC  S + D L YA++VFD   +P+ F++N +IRG+S S     ++ LY +M+ S    N
Subjt:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGD-LYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPN

Q9LXY5 Pentatricopeptide repeat-containing protein At3g565501.3e-1139.56Show/hide
Query:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHL-CQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGV
        L+ C+SM++L+ +H+ +I++GL         L+ FCAVS  G L +A+L+FDH    P+   +N LIRG+S S   +N+I  Y +M+ S V
Subjt:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHL-CQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGV

Q9SJG6 Pentatricopeptide repeat-containing protein At2g42920, chloroplastic4.3e-1240.59Show/hide
Query:  LHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQS--GVLPNQ
        L L   +CS+MRELK +HA +I  GL+S+ +T  ++++FC  S + D+ YA LVF  +   N F++N +IRG+S S     AIS++  M+ S   V P +
Subjt:  LHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQS--GVLPNQ

Query:  L
        L
Subjt:  L

Arabidopsis top hitse value%identityAlignment
AT1G74630.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.7e-1136.46Show/hide
Query:  HQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSG
        H  L L L  C ++R L  +H   I +G+ +++   GKLI  CA+S +  L YA+ +     +P+ FMFN L+RGYS S    N+++++ +MM+ G
Subjt:  HQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSG

AT2G29760.1 Tetratricopeptide repeat (TPR)-like superfamily protein1.3e-1136.54Show/hide
Query:  DHQSLHLSL-ERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMM-QSGV
        + +S H+SL ERC S+R+LK  H  +I  G  S+  +  KL +  A+S    L YA+ VFD + +PN F +N LIR Y++    + +I  +  M+ +S  
Subjt:  DHQSLHLSL-ERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMM-QSGV

Query:  LPNQ
         PN+
Subjt:  LPNQ

AT2G42920.1 Pentatricopeptide repeat (PPR-like) superfamily protein3.1e-1340.59Show/hide
Query:  LHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQS--GVLPNQ
        L L   +CS+MRELK +HA +I  GL+S+ +T  ++++FC  S + D+ YA LVF  +   N F++N +IRG+S S     AIS++  M+ S   V P +
Subjt:  LHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQS--GVLPNQ

Query:  L
        L
Subjt:  L

AT3G56550.1 Pentatricopeptide repeat (PPR) superfamily protein8.9e-1339.56Show/hide
Query:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHL-CQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGV
        L+ C+SM++L+ +H+ +I++GL         L+ FCAVS  G L +A+L+FDH    P+   +N LIRG+S S   +N+I  Y +M+ S V
Subjt:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHL-CQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGV

AT5G66520.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.8e-1338.3Show/hide
Query:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGD-LYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPN
        L+RCS   ELK +HA+++  GL+ ++  + K +SFC  S + D L YA++VFD   +P+ F++N +IRG+S S     ++ LY +M+ S    N
Subjt:  LERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGD-LYYAKLVFDHLCQPNKFMFNCLIRGYSTSQHSINAISLYFKMMQSGVLPN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTTTTTCTCTCAAATACTATACTGTTTGTTCCTGCCTTAATCGCACTATCTTTCGTTTCCCCTCTTTACCAACTCTTCTTGCTGCAAAAACTGTCACTTTTGCCCT
TGACTTCAAATTCCCTGATCATCAATCCCTTCACCTTTCATTAGAGAGATGCTCCTCCATGAGAGAGCTCAAGGTCCTCCATGCCCAAATCATCCTCCATGGATTGGTTT
CTGAAAATTTGACCCTTGGAAAATTGATTTCTTTCTGTGCTGTTTCTCAAGCTGGGGATCTTTACTATGCCAAACTTGTTTTTGACCACCTTTGCCAACCCAACAAGTTT
ATGTTCAATTGCTTGATAAGGGGATACTCCACTTCTCAACATTCAATAAATGCTATTTCTCTCTACTTTAAAATGATGCAATCTGGGGTTTTACCCAATCAATTACTCTC
CCATTTGTACTCAAGGCTTGTGCTTCTCAGTATGCATACTGGGAAGCTCTTGTTGTCCATTGCCATGCTATTCGACTAG
mRNA sequenceShow/hide mRNA sequence
TGGATTTAGGAGTGCTTTAGTGTAAGTAATTCTTACATGCGGTTTATGTATTAACTCTTAATATTACCTGTGTTGTTGTAAGTTCTCTTCGTTATCATGCTGTATGCTGA
TATTATATACTTATAACTTCTTGAATATAGTTACATATGCGATGAGAATTTGTAGGCATTGTGCTTTGTCCTGATTTGTGTTAATAAGCATGTTGGAAATTTTATAGGAC
GCATGGTTGGGTGACTTAAATTTAGTCGATAGCATGTTGGAAAATTGTGTTTGACTTTGATGTAACTTAAGCTTGCAACGTGGCATGGCCATGATGTGTTTGTAAGGTTT
GCTGTAGGTCTTCGAAGAACATTCAAACGTGTTCATGGAATGGGATTGGGACGTACTCCAATTTAGATATTTAAAACTAAATCTCATCGATTTGCTTGGTCAGAGATCAA
TTCTGAGACGGGTATAGAAGACCATCAAACTGCAAAAGGAAGATACTCATTGGATCTTGAAATAGATACTCTTTTTGATCTCAAATTTGTACTCCTTTCTCTTTCGAGCT
ATGCTGAGATTGGTCCATATCGACAGTTTTATTTCAAGATCGATACATATTTTCAATCTGCAGTGCGTCAAGAAGTCATTGATCCTTTAGATCAGAATCAACAATATATT
TTCTTTATCATCCACTCCACGTCAAGTGCGTCAAGTGCGTGAGCATCATTTTCTTCATGTTTTTTTCTCTCAAATACTATACTGTTTGTTCCTGCCTTAATCGCACTATC
TTTCGTTTCCCCTCTTTACCAACTCTTCTTGCTGCAAAAACTGTCACTTTTGCCCTTGACTTCAAATTCCCTGATCATCAATCCCTTCACCTTTCATTAGAGAGATGCTC
CTCCATGAGAGAGCTCAAGGTCCTCCATGCCCAAATCATCCTCCATGGATTGGTTTCTGAAAATTTGACCCTTGGAAAATTGATTTCTTTCTGTGCTGTTTCTCAAGCTG
GGGATCTTTACTATGCCAAACTTGTTTTTGACCACCTTTGCCAACCCAACAAGTTTATGTTCAATTGCTTGATAAGGGGATACTCCACTTCTCAACATTCAATAAATGCT
ATTTCTCTCTACTTTAAAATGATGCAATCTGGGGTTTTACCCAATCAATTACTCTCCCATTTGTACTCAAGGCTTGTGCTTCTCAGTATGCATACTGGGAAGCTCTTGTT
GTCCATTGCCATGCTATTCGACTAGGAATTTTGTCTCATGTGTGCGTACAGAATTCTCTAATTAATGTTTATACTGTTTGTGGTTTGGTCCAATGCGCACACCAACTGTT
TGATGAAATGTCACATAGAACTCTGGTGTCCTCGAATTCGATGATTGGTGGGTATTCTAGGAATATGAAGCCTTTTTGTTGTTTCGAGAGATGAAAGAGTTGGGATTTCA
GCCGGATCAATTCACATTGGTTCATCTACTTTCCATTTGTTCAAGAAGTTATAGCTTAGATATTGGTAAATGTTTGCATCTCTATATTGAGATTACTGGGATTGAGGTTG
ATCAAATTTTAAGAAATGCTCAGAGTGATTCATCCTCTAACTAAACTCTTGGTTGGAATCAATTCGTTAAGTTAGCCTTGTTCTCGAAGTTTACAAAGATTCTAGAAGGA
CCTCTAGCTTTGTTAAACTAACAAGTTCGACGCTTATCATGTTTTATCTCAACGTTTTTTGCATTTTTGTAGATCTGGATATAATCTAAAACTTTGTCTTCTTTGAGATT
TTCTATTTTAGCTTTGTTGATGGCAAAAACTGTATTTTTACAACTTATTCAAAAAACAACTCTACCCGAGAACATGTGTTCTTGATTTGTTCTTGTTGTTTTCTCGCCTT
TAAAGCATGTTTAATACGAAAATTTCAAATTTAATCTGATCTTTATGTTGAAATTTAACCATTGGGAATTCAATCTATGATGTTACATTTTTAAAGTTTACAAAGTGATT
GTTACACATTAAATATGGGCAGTTACATTTCTCGCACATTGAATTTTCGAGCCATTAGTTTTCAAATCAGTAACTCATATATTTTAAAAATCCAGCCATTAGTTTTAAAT
TGTAACTCCATGTTTAAAAGTCCAACCATTGTCCTTAATAGAAGAGTTACATTTCTTGTTCTTAATTTTTTTCTACCCTTAGAATTTCCATAAGTTCATCATTTCATAAT
CCTAGTTTTTCTTGTTTGTATTCAAAGTGTGTTACTATCTATGGGATTGCAAATCTTGGCTTAGATACTTCAAAAAAATTGAGAGGTAAGAATGAGAGCTTATATAGATT
GATTTGTGAAAATGAGTGCAAACACAAGTGATCTACATTTCCTTGAGTGTAAACATGTGAGAGAGTGTTGTGAGGTCTTTTGTGTTTTCTTAACATGGTTTTGTAGCAAT
GTGTTAAGTAACTCCCAAGAATTAGACCTAACAACCTTGTTGAGAAAACAAGACCTTTAGATCTAAAACTTAAAAACCCCAAAACGACATGGAATCAACCCAAGGAAAAG
TTTGGAACGGATTACCTTGATGATCAAGATCAAGAGTAAAGAAAACTCGTTTCTAAACTCGTGATTCGAATCACTCCACAAGAGGTACGATCAACACCACTTGAATGACT
C
Protein sequenceShow/hide protein sequence
MFFSLKYYTVCSCLNRTIFRFPSLPTLLAAKTVTFALDFKFPDHQSLHLSLERCSSMRELKVLHAQIILHGLVSENLTLGKLISFCAVSQAGDLYYAKLVFDHLCQPNKF
MFNCLIRGYSTSQHSINAISLYFKMMQSGVLPNQLLSHLYSRLVLLSMHTGKLLLSIAMLFD