; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0000384 (gene) of Snake gourd v1 genome

Gene IDTan0000384
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionProtein of unknown function, DUF599
Genome locationLG08:73964914..73965945
RNA-Seq ExpressionTan0000384
SyntenyTan0000384
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044233.1 DUF599 domain-containing protein [Cucumis melo var. makuwa]4.8e-5863.3Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+SVLLV GYH +LWQCLK+ PEKTS GIQ  GRRAW+E+ LQ    SMQ VQ+LRNNLMIIILRASISIT+S SVAAL NNAY++   
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV
            L SS       +F VKY AAFVVSVSSF+CSSFGVGFLVD+ LL++          P T   HI RL+D GFA AFVGNRLMW +F++L+W LGP+
Subjt:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV

Query:  AVALCSLALVWGFSIMDF
         VAL S ALVWGFS++DF
Subjt:  AVALCSLALVWGFSIMDF

KAG7017182.1 hypothetical protein SDJN02_19044, partial [Cucurbita argyrosperma subsp. argyrosperma]7.8e-6163.8Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+S+LLV GYHA+LWQCLK+ PEKT+ GIQR GRRAWLE  LQ    SMQ VQ LRNNLMIIILRASISI VS SVAAL NNAY+    
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
            LF SG         +F VKYAAAFVVSVSSF+ SSFGVGFL+D+ +LVS +          T   HIQRLVD GFALAF+GNRLMW++F +L+W L
Subjt:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALVWGFSIMDF
        GP+ VALCS A VWGFS +DF
Subjt:  GPVAVALCSLALVWGFSIMDF

XP_016899668.1 PREDICTED: uncharacterized protein LOC107990610 [Cucumis melo]1.3e-5863.76Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+SVLLV GYH +LWQCLK+ PEKTS GIQ  GRRAW+E+ LQ    SMQ VQ+LRNNLMIIILRASISIT+S SVAAL NNAY++   
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV
            L SS       +F VKYAAAFVVSVSSF+CSSFGVGFLVD+ LL++          P T   HI RL+D GFA AFVGNRLMW +F++L+W LGP+
Subjt:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV

Query:  AVALCSLALVWGFSIMDF
         VAL S ALVWGFS++DF
Subjt:  AVALCSLALVWGFSIMDF

XP_022982646.1 uncharacterized protein LOC111481460 [Cucurbita maxima]4.6e-6163.8Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+S+LLV GYHA+LWQCLK+ PEKT+ GIQR GRRAWLE TLQ    SMQ VQ LRNNLMIIILRASISI VS SVAAL NNAY+    
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
            LF SG         +F VKYAAAFVVSVSSF+ SSFGVGFL+D+ +LVS +          T   HIQRLVD GFALAF+GNRLMW++F +L+W L
Subjt:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALVWGFSIMDF
        GP+ VALCS A +WGFS +DF
Subjt:  GPVAVALCSLALVWGFSIMDF

XP_023528701.1 uncharacterized protein LOC111791548 [Cucurbita pepo subsp. pepo]7.8e-6163.8Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+S+LLV GYHA+LWQCLK+ PEKT+ GIQR GRRAWLE  LQ    SMQ VQ LRNNLMIIILRASISI VS SVAAL NNAY+    
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
            LF SG         +F VKYAAAFVVSVSSF+ SSFGVGFL+D+ +LVS +          T   HIQRLVD GFALAF+GNRLMW++F +L+W L
Subjt:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALVWGFSIMDF
        GP+ VALCS A VWGFS +DF
Subjt:  GPVAVALCSLALVWGFSIMDF

TrEMBL top hitse value%identityAlignment
A0A1S4DUM5 uncharacterized protein LOC1079906106.1e-5963.76Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+SVLLV GYH +LWQCLK+ PEKTS GIQ  GRRAW+E+ LQ    SMQ VQ+LRNNLMIIILRASISIT+S SVAAL NNAY++   
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV
            L SS       +F VKYAAAFVVSVSSF+CSSFGVGFLVD+ LL++          P T   HI RL+D GFA AFVGNRLMW +F++L+W LGP+
Subjt:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV

Query:  AVALCSLALVWGFSIMDF
         VAL S ALVWGFS++DF
Subjt:  AVALCSLALVWGFSIMDF

A0A5A7TL41 DUF599 domain-containing protein2.3e-5863.3Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+SVLLV GYH +LWQCLK+ PEKTS GIQ  GRRAW+E+ LQ    SMQ VQ+LRNNLMIIILRASISIT+S SVAAL NNAY++   
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV
            L SS       +F VKY AAFVVSVSSF+CSSFGVGFLVD+ LL++          P T   HI RL+D GFA AFVGNRLMW +F++L+W LGP+
Subjt:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV

Query:  AVALCSLALVWGFSIMDF
         VAL S ALVWGFS++DF
Subjt:  AVALCSLALVWGFSIMDF

A0A5D3DMR3 DUF599 domain-containing protein6.1e-5963.76Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+SVLLV GYH +LWQCLK+ PEKTS GIQ  GRRAW+E+ LQ    SMQ VQ+LRNNLMIIILRASISIT+S SVAAL NNAY++   
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV
            L SS       +F VKYAAAFVVSVSSF+CSSFGVGFLVD+ LL++          P T   HI RL+D GFA AFVGNRLMW +F++L+W LGP+
Subjt:  RELLLFSSGA-----MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPV

Query:  AVALCSLALVWGFSIMDF
         VAL S ALVWGFS++DF
Subjt:  AVALCSLALVWGFSIMDF

A0A6J1F3D7 uncharacterized protein LOC1114417781.6e-5663.38Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+S+LLV GYHA+LWQCLK+ PEKT+ GIQR GRRAWLE  LQ    SMQ VQ LRNNLMIIILRASISI VS SVAAL NNAY+    
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
            LF SG         +F VKYAAAFVVSVSSF+ SSFGVGFL+D+ +LVS +          T   HIQRLVD GFALAF+GNRLMW++F +L+W L
Subjt:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALV
        GP+ VALCS A V
Subjt:  GPVAVALCSLALV

A0A6J1IZX2 uncharacterized protein LOC1114814602.2e-6163.8Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        MEE Y+D  LMS+S+LLV GYHA+LWQCLK+ PEKT+ GIQR GRRAWLE TLQ    SMQ VQ LRNNLMIIILRASISI VS SVAAL NNAY+    
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
            LF SG         +F VKYAAAFVVSVSSF+ SSFGVGFL+D+ +LVS +          T   HIQRLVD GFALAF+GNRLMW++F +L+W L
Subjt:  RELLLFSSG--------AMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALVWGFSIMDF
        GP+ VALCS A +WGFS +DF
Subjt:  GPVAVALCSLALVWGFSIMDF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G31330.1 Protein of unknown function, DUF5991.0e-1830.56Show/hide
Query:  EFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQP-PVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATAR
        E YLD IL+ + +++ A YH YLW  L+  P  T IG     RR W+   ++     ++  VQTLRN +M   L A+ SI +   +AA+ ++ Y      
Subjt:  EFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQP-PVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATAR

Query:  ELLLFSSGAMFVV--KYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPP--VTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPVAV
           +F +   F+V  KY     + + SF   S  + F+    +L++     +       +T   ++  L++RGF L  VGNRL +    +++W  GPV V
Subjt:  ELLLFSSGAMFVV--KYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPP--VTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPVAV

Query:  ALCSLALVWGFSIMDF
         LCS+ +V     +DF
Subjt:  ALCSLALVWGFSIMDF

AT5G10580.1 Protein of unknown function, DUF5992.0e-1427.27Show/hide
Query:  EEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQP-PVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA
        E++YLD +L+  ++L++ GYH YLW  ++ +P  T +G     RR+W+   ++     ++  VQTLRN +M   L A+  I +   +AA+ ++ Y     
Subjt:  EEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQP-PVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATA

Query:  RELLLFSSGAMFVV--KYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSV-----SDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLG
            ++ +   F V  KY     + + +F   S  + F+    +L++      SD        VT   ++  L+++ F L  VGNRL ++   +++W  G
Subjt:  RELLLFSSGAMFVV--KYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSV-----SDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLG

Query:  PVAVALCSLALVWGFSIMDF
        PV V L S  ++     +DF
Subjt:  PVAVALCSLALVWGFSIMDF

AT5G24600.1 Protein of unknown function, DUF5992.2e-0823.21Show/hide
Query:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQ-PPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAAT
        M+  YLD  L+ + + L+  YH +L   +   P  T +G+    RR W++  ++    + +  VQTLRNN+M   L AS +I +   +A L  +A    +
Subjt:  MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQ-PPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAAT

Query:  ARELLLFSSGAMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGG-----------HIQRLVDRGFALAFVGNRLMWVTFLVL
           +    S   F +K+ A  V  + +F+ +   + +   + +L++V  +  +    V+ GG           ++   V+RG     +G R  + +  + 
Subjt:  ARELLLFSSGAMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGG-----------HIQRLVDRGFALAFVGNRLMWVTFLVL

Query:  VWFLGPVAVALCSLALVWGFSIMD
        +W  GP+ + +    LV     +D
Subjt:  VWFLGPVAVALCSLALVWGFSIMD

AT5G24790.1 Protein of unknown function, DUF5993.7e-1628.25Show/hide
Query:  EEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQT----VQTLRNNLMIIILRASISITVSISVAALANNAYRA
        +++YLD IL+ ++++++  YH YL   ++ NP  T +GI   GRR W+   ++   D+ +T    VQTLRN +M   L A+  + +   +AA+ ++ Y  
Subjt:  EEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQT----VQTLRNNLMIIILRASISITVSISVAALANNAYRA

Query:  ATARELLLFSSGAMFV--VKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTG-----GGHIQRLVDRGFALAFVGNRLMWVTFLVLVW
               +F +   F   +KY     + + SF   S  + FL    +LV++ +      P  +G       H+  + ++G  L  VGNRL +  F +++W
Subjt:  ATARELLLFSSGAMFV--VKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTG-----GGHIQRLVDRGFALAFVGNRLMWVTFLVLVW

Query:  FLGPVAVALCSLALVWGFSIMDF
          GP+ V    L +V   S +DF
Subjt:  FLGPVAVALCSLALVWGFSIMDF

AT5G43180.1 Protein of unknown function, DUF5991.9e-2835.75Show/hide
Query:  DCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQ-PPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAA---TARE
        D I++ +S+L+  GYH +LW   K NP +TS+GI    R++W     +      M  VQ+LRN  M+ IL A+I+I + +S+AA+ NNA++A+   TA +
Subjt:  DCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQ-PPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAA---TARE

Query:  LLLFSS--GAMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQ----ICRPPVTGGG----HIQRLVDRGFALAFVGNRLMWVTFLVLVWFL
         + F S    +FV+KYA+A ++  +SF  SS  + +L+D+  L++   +       C   +TG      + + +++RGF +A VGNR+M V+  +L+W  
Subjt:  LLLFSS--GAMFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQ----ICRPPVTGGG----HIQRLVDRGFALAFVGNRLMWVTFLVLVWFL

Query:  GPVAVALCSLALVWGFSIMDF
        GP+ V   SL LVW     DF
Subjt:  GPVAVALCSLALVWGFSIMDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAATTTTACTTAGATTGCATATTGATGAGCATGAGCGTGTTGCTTGTGGCCGGGTACCACGCGTATCTGTGGCAATGCCTGAAGCAGAACCCCGAGAAGACGAG
CATCGGAATCCAACGGCTGGGTCGGCGAGCCTGGCTCGAGAAGACGCTGCAGCCGCCGGTCGACAGCATGCAGACGGTGCAGACGCTGAGAAACAATCTCATGATCATAA
TTCTCAGAGCTTCCATATCAATTACTGTGAGCATTTCCGTAGCGGCCCTCGCCAACAATGCTTACAGAGCTGCAACGGCTCGAGAATTGTTATTATTCAGTAGTGGGGCG
ATGTTCGTTGTGAAATATGCGGCTGCGTTTGTGGTGTCGGTGTCGAGCTTCGTGTGCAGTTCTTTTGGGGTGGGATTTTTGGTCGACAGTGGCTTGTTGGTTAGTGTTAG
TGATCGGGTTCAAATTTGTCGTCCGCCGGTCACCGGCGGTGGTCATATTCAGAGGCTGGTCGACAGAGGATTTGCATTGGCTTTTGTAGGGAACCGTTTGATGTGGGTTA
CTTTTCTTGTATTGGTATGGTTCCTTGGTCCTGTTGCTGTGGCTCTCTGTTCCTTGGCTCTGGTTTGGGGGTTTTCTATCATGGATTTTAATTAA
mRNA sequenceShow/hide mRNA sequence
ATTGGGTTTAATTGAATTTGCCAAATATTTCTCAATCAGTAAATAATAATCCATTATGGAAGAATTTTACTTAGATTGCATATTGATGAGCATGAGCGTGTTGCTTGTGG
CCGGGTACCACGCGTATCTGTGGCAATGCCTGAAGCAGAACCCCGAGAAGACGAGCATCGGAATCCAACGGCTGGGTCGGCGAGCCTGGCTCGAGAAGACGCTGCAGCCG
CCGGTCGACAGCATGCAGACGGTGCAGACGCTGAGAAACAATCTCATGATCATAATTCTCAGAGCTTCCATATCAATTACTGTGAGCATTTCCGTAGCGGCCCTCGCCAA
CAATGCTTACAGAGCTGCAACGGCTCGAGAATTGTTATTATTCAGTAGTGGGGCGATGTTCGTTGTGAAATATGCGGCTGCGTTTGTGGTGTCGGTGTCGAGCTTCGTGT
GCAGTTCTTTTGGGGTGGGATTTTTGGTCGACAGTGGCTTGTTGGTTAGTGTTAGTGATCGGGTTCAAATTTGTCGTCCGCCGGTCACCGGCGGTGGTCATATTCAGAGG
CTGGTCGACAGAGGATTTGCATTGGCTTTTGTAGGGAACCGTTTGATGTGGGTTACTTTTCTTGTATTGGTATGGTTCCTTGGTCCTGTTGCTGTGGCTCTCTGTTCCTT
GGCTCTGGTTTGGGGGTTTTCTATCATGGATTTTAATTAATTTGCTGTTAAATCACACTCACGAGTGTAAAAGTTGTTTGAAGCGGTAAGTGAATTATAAAATATGATAG
ATTATAATAAACTTTATGTTTATAATGGATAGTTATGAGTTATAATATAGTTTGTATTTAGAATGAAAAGTAACCTACGTCCAAAGAGTTTCGTTGTGATTTATTGATGC
AAAAACTAGGTGTTATTAGTATATCAAGCCTTTTTTTCAGTTAGAGACCCAATTTCAAATTCTTAGACTACCCAAACTTGTCCCACATTTTATTTCTTTTATTTTTTTTT
CTCTTTTAGGTTAGATTTTATCATTCAAGTTGTACTTAGGTG
Protein sequenceShow/hide protein sequence
MEEFYLDCILMSMSVLLVAGYHAYLWQCLKQNPEKTSIGIQRLGRRAWLEKTLQPPVDSMQTVQTLRNNLMIIILRASISITVSISVAALANNAYRAATARELLLFSSGA
MFVVKYAAAFVVSVSSFVCSSFGVGFLVDSGLLVSVSDRVQICRPPVTGGGHIQRLVDRGFALAFVGNRLMWVTFLVLVWFLGPVAVALCSLALVWGFSIMDFN