; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg12434 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg12434
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionProtein of unknown function, DUF599
Genome locationCarg_Chr14:48349..49005
RNA-Seq ExpressionCarg12434
SyntenyCarg12434
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017182.1 hypothetical protein SDJN02_19044, partial [Cucurbita argyrosperma subsp. argyrosperma]1.1e-110100Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SVWGFSFVDFVGKSVKHM
        SVWGFSFVDFVGKSVKHM
Subjt:  SVWGFSFVDFVGKSVKHM

KGN58953.2 hypothetical protein Csa_002405 [Cucumis sativus]1.7e-7976.5Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        ++ +MEELYID TLMSLS+LLVVGYH HLWQCLKKKPEKTT GIQREGRRAW+E ALQ+EGGSMQVVQ LRNNLMIIILRASISI +SSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        ++  F   T  S   S LFAVKYAAAFVVSVSSFL SSFGVGFL+DTCML++T T +THI RL+DTGFA AF+G+RLMW S  +LLWSLGPIPVAL SFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SVWGFSFVDFVGKSVKH
         VWGFS  DFV KS  +
Subjt:  SVWGFSFVDFVGKSVKH

XP_022934642.1 uncharacterized protein LOC111441778 [Cucurbita moschata]8.7e-10099.5Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAA+SGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SV
        SV
Subjt:  SV

XP_022982646.1 uncharacterized protein LOC111481460 [Cucurbita maxima]9.3e-11098.62Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLEL LQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SVWGFSFVDFVGKSVKHM
        S+WGFSF+DFVGKSVKHM
Subjt:  SVWGFSFVDFVGKSVKHM

XP_023528701.1 uncharacterized protein LOC111791548 [Cucurbita pepo subsp. pepo]1.4e-11099.54Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SVWGFSFVDFVGKSVKHM
        SVWGFSF+DFVGKSVKHM
Subjt:  SVWGFSFVDFVGKSVKHM

TrEMBL top hitse value%identityAlignment
A0A1S4DUM5 uncharacterized protein LOC1079906104.1e-7977.46Show/hide
Query:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL
        MEELYID TLMSLS+LLVVGYH HLWQCLKKKPEKT+ GIQ EGRRAW+E ALQ+EGGSMQVVQ LRNNLMIIILRASISI +SSSVAALTNNAYK++  
Subjt:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL

Query:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG
        F  G+   +  S LFAVKYAAAFVVSVSSFL SSFGVGFL+DTC+L++T T +THI RL+DTGFA AF+GNRLMW SF +LLWSLGPIPVAL SFA VWG
Subjt:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG

Query:  FSFVDFVGKSVKH
        FS VDFV KS  +
Subjt:  FSFVDFVGKSVKH

A0A5A7TL41 DUF599 domain-containing protein5.9e-7876.53Show/hide
Query:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL
        MEELYID TLMSLS+LLVVGYH HLWQCLKKKPEKT+ GIQ EGRRAW+E ALQ+EGGSMQVVQ LRNNLMIIILRASISI +SSSVAALTNNAYK++  
Subjt:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL

Query:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG
        F  G+   +  S LFAVKY AAFVVSVSSFL SSFGVGFL+DTC+L++T T +THI RL+D GFA AF+GNRLMW SF +LLWSLGPIPVAL SFA VWG
Subjt:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG

Query:  FSFVDFVGKSVKH
        FS VDFV KS  +
Subjt:  FSFVDFVGKSVKH

A0A5D3DMR3 DUF599 domain-containing protein4.1e-7977.46Show/hide
Query:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL
        MEELYID TLMSLS+LLVVGYH HLWQCLKKKPEKT+ GIQ EGRRAW+E ALQ+EGGSMQVVQ LRNNLMIIILRASISI +SSSVAALTNNAYK++  
Subjt:  MEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL

Query:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG
        F  G+   +  S LFAVKYAAAFVVSVSSFL SSFGVGFL+DTC+L++T T +THI RL+DTGFA AF+GNRLMW SF +LLWSLGPIPVAL SFA VWG
Subjt:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWG

Query:  FSFVDFVGKSVKH
        FS VDFV KS  +
Subjt:  FSFVDFVGKSVKH

A0A6J1F3D7 uncharacterized protein LOC1114417784.2e-10099.5Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAA+SGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SV
        SV
Subjt:  SV

A0A6J1IZX2 uncharacterized protein LOC1114814604.5e-11098.62Show/hide
Query:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
        MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLEL LQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK
Subjt:  MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYK

Query:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
        TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA
Subjt:  TQQLFRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFA

Query:  SVWGFSFVDFVGKSVKHM
        S+WGFSF+DFVGKSVKHM
Subjt:  SVWGFSFVDFVGKSVKHM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18215.1 Protein of unknown function, DUF5992.9e-0824.42Show/hide
Query:  IDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL--ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQ----Q
        +D  L+   L+++V YH  L   +  +P+ T   +  E RR W+   +   L+ G++  VQ +RNN+M   L A+ +I + S +    +N+  ++     
Subjt:  IDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL--ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQ----Q

Query:  LFRSGTQSSAASSGLFAVK--YAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFAS
        L         AS   FA+   +  AF+ ++ S  + +  V FL+   +         ++ R ++       +G R  + SF L LW+ GPIP+ +C    
Subjt:  LFRSGTQSSAASSGLFAVK--YAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFAS

Query:  VWGFSFVDFVGKSVKHM
             F+D      +H+
Subjt:  VWGFSFVDFVGKSVKHM

AT4G31330.1 Protein of unknown function, DUF5998.3e-1628.71Show/hide
Query:  ELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL-ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLF
        E Y+D  L+ L L++   YH +LW  L+ +P  T  G     RR W+  +    +  ++  VQ LRN +M   L A+ SI + + +AA+ ++ Y  ++  
Subjt:  ELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL-ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLF

Query:  RSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVST------------ATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIP
               A    + A+KY     + + SF   S  + F+    +L++T             TA  ++  L++ GF L  +GNRL + +  L+LW  GP+ 
Subjt:  RSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVST------------ATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIP

Query:  VALCSFASV
        V LCS   V
Subjt:  VALCSFASV

AT5G10580.1 Protein of unknown function, DUF5994.3e-1225.34Show/hide
Query:  EELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQ-LEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL
        E+ Y+D+ L+  +LL++ GYH +LW  ++  P  T  G     RR+W+   ++  E  ++  VQ LRN +M   L A+  I + + +AA+ ++ Y  ++ 
Subjt:  EELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQ-LEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQL

Query:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVS--------------TATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLG
                A      A+KY     + + +F   S  + F+    +L++              +     ++  L++  F L  +GNRL ++   L+LW  G
Subjt:  FRSGTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVS--------------TATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLG

Query:  PIPVALCSFASVWGFSFVDFV
        P+ V L S   +     +DFV
Subjt:  PIPVALCSFASVWGFSFVDFV

AT5G24790.1 Protein of unknown function, DUF5991.4e-1527.06Show/hide
Query:  YIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQ-LEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLFRS
        Y+D+ L+ L+L++++ YH +L   ++  P  T  GI   GRR W+   ++  +  ++  VQ LRN +M   L A+  + + + +AA+ ++ Y  ++    
Subjt:  YIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQ-LEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLFRS

Query:  GTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTAT-----------ASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVAL
             A      ++KY     + + SF F S  + FL    +LV+               S H+  + + G  L  +GNRL +  F+L+LW  GPI V  
Subjt:  GTQSSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTAT-----------ASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVAL

Query:  CSFASVWGFSFVDFVGKS
             V   S +DFV ++
Subjt:  CSFASVWGFSFVDFVGKS

AT5G43180.1 Protein of unknown function, DUF5993.1e-2633.94Show/hide
Query:  DSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL-ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLFRSGT
        DS ++ LSLL+ VGYH  LW   K  P +T+ GI    R++W  ++    +   M  VQ LRN  M+ IL A+I+I +  S+AA+TNNA+K   L  +  
Subjt:  DSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWL-ELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLFRSGT

Query:  Q--SSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVS------------------TATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSL
             + ++ +F +KYA+A ++  +SF FSS  + +L+D   L++                  T++   + + +++ GF +A +GNR+M +S  LLLW  
Subjt:  Q--SSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVS------------------TATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSL

Query:  GPIPVALCSFASVWGFSFVDF
        GP+PV   S   VW     DF
Subjt:  GPIPVALCSFASVWGFSFVDF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTTTAAGATGGAAGAATTATACATTGATAGCACATTGATGAGCCTGAGCCTGTTGCTTGTGGTGGGGTATCACGCGCATCTGTGGCAATGCTTGAAGAAGAAACC
TGAGAAGACGACCTGGGGAATCCAGCGGGAGGGCCGGAGAGCGTGGCTAGAGCTGGCGCTGCAGCTGGAGGGTGGCAGCATGCAGGTGGTGCAGATCTTGAGAAACAATC
TGATGATCATAATCCTCAGAGCTTCCATATCAATCGCTGTAAGCTCCTCCGTGGCAGCCCTCACCAACAATGCTTACAAAACCCAACAACTATTCCGAAGCGGAACTCAA
TCGAGCGCCGCCAGTAGTGGGCTGTTCGCTGTGAAATATGCGGCCGCCTTTGTGGTGTCAGTGTCGAGCTTCCTATTCAGCTCATTTGGGGTGGGGTTTCTGATCGACAC
CTGCATGTTGGTCAGCACTGCAACTGCAAGCACCCACATACAGAGACTGGTGGACACAGGGTTCGCCTTGGCTTTTATAGGCAACCGCTTGATGTGGCTCAGTTTTGCCC
TCTTGTTATGGTCCCTTGGTCCTATTCCGGTCGCCCTCTGCTCCTTCGCATCGGTTTGGGGTTTTTCTTTTGTTGATTTTGTTGGCAAATCAGTCAAACACATGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTTTAAGATGGAAGAATTATACATTGATAGCACATTGATGAGCCTGAGCCTGTTGCTTGTGGTGGGGTATCACGCGCATCTGTGGCAATGCTTGAAGAAGAAACC
TGAGAAGACGACCTGGGGAATCCAGCGGGAGGGCCGGAGAGCGTGGCTAGAGCTGGCGCTGCAGCTGGAGGGTGGCAGCATGCAGGTGGTGCAGATCTTGAGAAACAATC
TGATGATCATAATCCTCAGAGCTTCCATATCAATCGCTGTAAGCTCCTCCGTGGCAGCCCTCACCAACAATGCTTACAAAACCCAACAACTATTCCGAAGCGGAACTCAA
TCGAGCGCCGCCAGTAGTGGGCTGTTCGCTGTGAAATATGCGGCCGCCTTTGTGGTGTCAGTGTCGAGCTTCCTATTCAGCTCATTTGGGGTGGGGTTTCTGATCGACAC
CTGCATGTTGGTCAGCACTGCAACTGCAAGCACCCACATACAGAGACTGGTGGACACAGGGTTCGCCTTGGCTTTTATAGGCAACCGCTTGATGTGGCTCAGTTTTGCCC
TCTTGTTATGGTCCCTTGGTCCTATTCCGGTCGCCCTCTGCTCCTTCGCATCGGTTTGGGGTTTTTCTTTTGTTGATTTTGTTGGCAAATCAGTCAAACACATGTAG
Protein sequenceShow/hide protein sequence
MSFKMEELYIDSTLMSLSLLLVVGYHAHLWQCLKKKPEKTTWGIQREGRRAWLELALQLEGGSMQVVQILRNNLMIIILRASISIAVSSSVAALTNNAYKTQQLFRSGTQ
SSAASSGLFAVKYAAAFVVSVSSFLFSSFGVGFLIDTCMLVSTATASTHIQRLVDTGFALAFIGNRLMWLSFALLLWSLGPIPVALCSFASVWGFSFVDFVGKSVKHM