; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg008601 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg008601
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein FAF-like
Genome locationscaffold10:34164435..34167686
RNA-Seq ExpressionSpg008601
SyntenySpg008601
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR021410 - The fantastic four family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579018.1 hypothetical protein SDJN03_23466, partial [Cucurbita argyrosperma subsp. sororia]1.4e-11469.8Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS
        MDYFS +R LFC++KIIVSSFFGFLL ASSFRFPMQ   Q QGPVVEA+RFSISGLK LISS E+ QE G+E EDRVIRS G+GII SKLLT SSS  SS
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS

Query:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH
        I SCNLLMDDLIGTESGV LT NTEE EEK T + FD   N TN + FTEQN RC  KKQFPPPIP LA QAG RTR PWILTR   + RLILKLERV +
Subjt:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH

Query:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV
        HQS+ESHRENGRLILNLVP+PV GV    D+QDLQF+EE EG E++ +S+E EEG D   PEIS +SFTYG     GG+F DR+ FCGVNGN EERH VV
Subjt:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV

Query:  HGHFGSAPLRPMADSKLVAARSLEMVSSISILLGDMMQKDKDGAFGVKFEL
        H HF S PLRP+ DS LVA RSLE V S+ I+LG MMQK KD AF   F+L
Subjt:  HGHFGSAPLRPMADSKLVAARSLEMVSSISILLGDMMQKDKDGAFGVKFEL

KAG6602047.1 hypothetical protein SDJN03_07280, partial [Cucurbita argyrosperma subsp. sororia]1.4e-11172.35Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--SSSFCSSI
        M YFSGERS F +VK+IVSSF GFLLCA+SFRFPMQ QNQ PV+E +R SISGLK LIS +E    GK  E+RVIR VGVGIIRSKL T+  SSSFC+SI
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--SSSFCSSI

Query:  RSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHH
        RS N LMDDLIGTESGVCL SN  EIEEK TRSD D+  NR NR D          KK+FPPPIPFL A AGHR RPPWILTRNCI+GRLIL LERVRHH
Subjt:  RSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHH

Query:  QSLESHRENGRLILNLVPAPVAGVGTED-DDQDLQFMEEVEGEEKMEESMECEEGTDPEISFKSFTY-------GGGMFSDREPFCGVNGNFEERHVVVH
        QSLESHRENGRLILNLVPA +AGVGTED DDQDLQF+E+ E EEKM+ES+ECEEG DPE  FKSF +       GG M  DR PFCGVNGN  ERH VVH
Subjt:  QSLESHRENGRLILNLVPAPVAGVGTED-DDQDLQFMEEVEGEEKMEESMECEEGTDPEISFKSFTY-------GGGMFSDREPFCGVNGNFEERHVVVH

Query:  GHFGSAPLRPM
        GHFGSAPLR M
Subjt:  GHFGSAPLRPM

KAG7016541.1 hypothetical protein SDJN02_21650, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-10871.3Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS
        MDYFS +R LFC++KIIVSSFFGFLL ASSFRFPMQ   Q QGPVVEA+RFSISGLK LISS E+ QE G+E EDRVIRS G+GII SKLLT SSS  SS
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS

Query:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH
        I SCNLLMDDLIGTESGV LT NTEE EEK T + FD   N TN + FTEQN RC  KKQFPPPIP LA QAG RTR PWILTR   + RLILKLERVR+
Subjt:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH

Query:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV
        HQS+ESHRENGRLILNLVP+PV GV    D+QDLQF+EE EG E++ +S+E EEG D   PEIS +SFTYG     GG+F DR+ FCGVNGN EERH VV
Subjt:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV

Query:  HGHFGSAPLRPMADSKLVAARSLE
        H HF S PLRP+ DS LVA RSLE
Subjt:  HGHFGSAPLRPMADSKLVAARSLE

KAG7032741.1 hypothetical protein SDJN02_06791, partial [Cucurbita argyrosperma subsp. argyrosperma]2.8e-11273.14Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--SSSFCSSI
        M YFSGERS F +VK+IVSSF GFLLCA+SFRFPMQ QNQ PV+E +R SISGLK LIS +E    GK  E+RVIR VGVGIIRSKL T+  SSSFC+SI
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--SSSFCSSI

Query:  RSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHH
        RS N LMDDLIGTESGVCLTSN  EIEEK TRSD DY  NR NR D          KK+FPPPIPFL A AGHR RPPWILTRNCI+GRLIL LERVRHH
Subjt:  RSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHH

Query:  QSLESHRENGRLILNLVPAPVAGVGTED-DDQDLQFMEEVEGEEKMEESMECEEGTDPEISFKSFTY-----GGGMFSDREPFCGVNGNFEERHVVVHGH
        QSLESHRENGRLILNLVPA +AGVGTED DDQ+LQF+E+ E EEKM+ES+ECEEG DPE  FKSF +     GG M  DR PFCGVNGN  ERH VVHGH
Subjt:  QSLESHRENGRLILNLVPAPVAGVGTED-DDQDLQFMEEVEGEEKMEESMECEEGTDPEISFKSFTY-----GGGMFSDREPFCGVNGNFEERHVVVHGH

Query:  FGSAPLRPM
        FGSAPLR M
Subjt:  FGSAPLRPM

XP_022939162.1 uncharacterized protein LOC111445156 [Cucurbita moschata]1.8e-10370.48Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS
        MDYFS +R LFC++KIIVSSFFGFLL ASSFRFPMQ   Q QGPVVEA+RFSISGLK LISS E+ QE G+E EDRVIRS G+GII SKLLT SSS  SS
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS

Query:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH
        I SCNLLMDDLIGTESGV LT NTEE EEK T + FD   N TN + FTEQN RC  KKQFPPPIP LA QAG RTR PWILTR   + RLILKLERV +
Subjt:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH

Query:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV
        HQS+ESHRENGRLILNLVP+PV GV    D+QDLQF+EE EG E++ +S+E EEG D   PEIS +SFTYG     GG+F DR+ FCGVNGN EERH VV
Subjt:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV

Query:  HGHFGSAPLRPMADS
        H HF S PLRP+  S
Subjt:  HGHFGSAPLRPMADS

TrEMBL top hitse value%identityAlignment
A0A0A0KRM2 Uncharacterized protein8.4e-8661.54Show/hide
Query:  MDYFS-GERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSIS-GLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--------
        MDYFS G+ +LF K+KIIVSSFF FLLC+S+FRFP+QK N+ P +  L FSIS GLK+LISS +     + TED VIRSVG+ IIRS LLTH        
Subjt:  MDYFS-GERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSIS-GLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTH--------

Query:  ------SSSFC-SSIRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQ-AGHRTRPPWILTRN
              SSSFC SSIRSCN  MDDLIGTESGVCLTSN+EE+E   T SDFD    RT+R+ F  QN RC  KKQFPPPIPF+A Q AG+R R PW+LTR 
Subjt:  ------SSSFC-SSIRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQ-AGHRTRPPWILTRN

Query:  CINGRLILKLERVRHHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMEC----EEGTDPEISFKSFTYG--GGMFSDREPFC
          N RLILKLERV  HQSLES RENGRLILNLVP     +   D D     +EE EG E++ ES++C    +EGTD EIS +S+TYG  GG       FC
Subjt:  CINGRLILKLERVRHHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMEC----EEGTDPEISFKSFTYG--GGMFSDREPFC

Query:  GVNGNFEERHVVVHGHFGSAPLRPM
        G NGNFEERH VVHGHFGSAPLRPM
Subjt:  GVNGNFEERHVVVHGHFGSAPLRPM

A0A5D3DS64 Protein FAF-like4.9e-7860.44Show/hide
Query:  MDYFS-GERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQ-GPVVEALRFSIS-GLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS
        MDYFS G+ +LF K+KIIVSSFF FLLC+S+FRFP+QK NQ  P++  L FSIS GLK+LISS +     + +ED +I SVG+ I               
Subjt:  MDYFS-GERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQ-GPVVEALRFSIS-GLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS

Query:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQ-AGHRTRPPWILTRNCINGRLILKLERVR
        IRSCN  MDDLIGTESGVCLTSN+EE+E   T SDFD R  RTNR++F  QN RC  KKQ+PPPIPF+A Q AG+R R PWILTR   N RLILKLERV 
Subjt:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQ-AGHRTRPPWILTRNCINGRLILKLERVR

Query:  HHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQ-FMEEVEGEEKMEESMEC----EEGTDPEISFKSFTY------GGGMFSDREPFCGVNGNFEER
         HQSLES RENGRLILNLVP        ED    LQ  +EE EG+E++ ES+EC    +EGTD EISFKS TY      GGG       FCGVNGNFEER
Subjt:  HHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQ-FMEEVEGEEKMEESMEC----EEGTDPEISFKSFTY------GGGMFSDREPFCGVNGNFEER

Query:  HVVVHGHFGSAPLRPM
        H VVHGH GSAPLRPM
Subjt:  HVVVHGHFGSAPLRPM

A0A6J1CJZ0 uncharacterized protein LOC1110118613.4e-9565.83Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSSIRS
        MDYFSGER LF K++IIVSSFFGFLLCAS+F  PMQ   + PV E++RFSISGLK LISS+E +E G+E   R+IRS GVGIIRSKLLTHSSS  SSIRS
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSSIRS

Query:  CNLLMDDLIGTESGVCLTSN-TEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHHQ
        C LLMDDLIGTESGVCLT++  EEI+EKS+    DYR +R +  D  +QN RCAA+KQFPPPI FLAAQAG RTRPPW+LTR+C +GRL L LERVRH Q
Subjt:  CNLLMDDLIGTESGVCLTSN-TEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHHQ

Query:  SLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKME------ESMECEEG------TDPEISFKSFTY-----GGGMFSDREPFCGVNGNF
         +ESHRENGRLIL  VPAP+    +EDDD+DLQF+EE  G EK        ES+ECEEG      +DP  +FKSFTY     GGGMF DR+PFC V    
Subjt:  SLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKME------ESMECEEG------TDPEISFKSFTY-----GGGMFSDREPFCGVNGNF

Query:  EERHVVVHGHFGSAPLRPM
         +RH VV+GHFGSAPLRPM
Subjt:  EERHVVVHGHFGSAPLRPM

A0A6J1FG13 uncharacterized protein LOC1114451568.9e-10470.48Show/hide
Query:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS
        MDYFS +R LFC++KIIVSSFFGFLL ASSFRFPMQ   Q QGPVVEA+RFSISGLK LISS E+ QE G+E EDRVIRS G+GII SKLLT SSS  SS
Subjt:  MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQK--QNQGPVVEALRFSISGLKTLISSDED-QEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSS

Query:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH
        I SCNLLMDDLIGTESGV LT NTEE EEK T + FD   N TN + FTEQN RC  KKQFPPPIP LA QAG RTR PWILTR   + RLILKLERV +
Subjt:  IRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRH

Query:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV
        HQS+ESHRENGRLILNLVP+PV GV    D+QDLQF+EE EG E++ +S+E EEG D   PEIS +SFTYG     GG+F DR+ FCGVNGN EERH VV
Subjt:  HQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYG-----GGMFSDREPFCGVNGNFEERHVVV

Query:  HGHFGSAPLRPMADS
        H HF S PLRP+  S
Subjt:  HGHFGSAPLRPMADS

A0A6J1JWW3 uncharacterized protein LOC1114890656.7e-5965.37Show/hide
Query:  MDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHHQSLESH
        MDDLIGTE+ V LTSNTEE EEK T + FD   N  N + FTE N RC  KKQFPPPIP LA QAG +TR PWILTR   + RLILKLERV +HQS+ESH
Subjt:  MDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHHQSLESH

Query:  RENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYGG-----GMFSDREPFCGVNGNFEERHVVVHGHFGSA
        RENGRLILNLVP+PV GV     +QDLQF+EE EG E++ +S+E E G D   PEIS +SFTYGG     G+F DR+ FCGVNGN EERH VVHGHF S 
Subjt:  RENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTD---PEISFKSFTYGG-----GMFSDREPFCGVNGNFEERHVVVHGHFGSA

Query:  PLRPM
        PLRP+
Subjt:  PLRPM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G22110.1 structural constituent of ribosome2.9e-1433.52Show/hide
Query:  IRSKLLTHSSSFCSSIRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRN
        + S +L+ +SS  SS       + D IGTES   + S  E  +  S  S+    R R       E+  R AA ++FPPPIP LA         PW+L R 
Subjt:  IRSKLLTHSSSFCSSIRSCNLLMDDLIGTESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRN

Query:  CI-NGRLILKLERVRHHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTDPEI
           +GRLIL+ E+VRHH+   ++R NGRL L+LVP          +    Q  +E E +++ ++  EC++    E+
Subjt:  CI-NGRLILKLERVRHHQSLESHRENGRLILNLVPAPVAGVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTDPEI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATTATTTCTCTGGCGAACGCAGCTTGTTTTGCAAGGTTAAGATAATCGTTTCGTCTTTCTTCGGCTTCCTTCTCTGCGCATCGAGTTTTCGCTTCCCCATGCAAAA
GCAGAACCAAGGTCCAGTTGTCGAAGCACTACGATTCTCAATTTCTGGACTCAAAACCCTAATTTCTTCCGATGAAGATCAAGAACCAGGAAAAGAGACGGAGGATCGAG
TCATTCGCAGTGTTGGAGTCGGCATTATCAGGTCAAAGCTGTTGACTCATTCTTCTTCTTTTTGTTCTTCGATCAGATCTTGTAATCTGTTAATGGACGATTTGATCGGA
ACTGAGAGCGGTGTGTGCTTGACTTCGAATACGGAAGAAATCGAAGAGAAATCAACTCGTTCCGATTTCGATTATCGTCGCAATCGCACTAATCGCTACGATTTTACTGA
GCAAAATCTGCGATGTGCGGCGAAAAAGCAGTTTCCACCGCCGATTCCTTTCCTAGCGGCACAGGCAGGGCATCGAACTCGTCCGCCGTGGATTTTAACAAGAAATTGCA
TCAATGGAAGGTTGATTTTGAAACTGGAGAGAGTGAGGCACCATCAGTCCTTGGAATCGCACCGCGAAAATGGCCGTCTAATTCTCAATCTCGTTCCGGCGCCGGTCGCG
GGCGTCGGTACTGAGGACGACGATCAGGATCTTCAATTCATGGAAGAGGTCGAAGGAGAAGAGAAGATGGAAGAATCGATGGAATGCGAAGAAGGTACAGATCCTGAAAT
TTCATTTAAGAGTTTTACGTACGGCGGCGGAATGTTCAGTGATCGGGAACCGTTTTGTGGTGTAAATGGGAATTTTGAAGAACGACATGTCGTCGTCCATGGACATTTCG
GTTCGGCTCCTCTTCGTCCGATGGCTGACTCTAAACTCGTAGCGGCTCGGTCCTTGGAGATGGTGTCATCTATCAGCATCCTTCTTGGTGATATGATGCAGAAAGACAAA
GATGGAGCCTTTGGTGTCAAATTTGAACTGAATGTGATTTCAACAACTTGTGGAAGAGACAAGGGAGACCATCCTAGGACGCCAAGAAGCTATGAAGGCGAATGCTCCCC
TCGGGATGACCATGAGAGTCGATTATTGGCGTTTAGAAGCTTCCTGGATGCTCGGACGTTCTTGACAGTCATCATGAGTTTCTGTCTCTCCCCTTCTACCTCTGCAAACT
GA
mRNA sequenceShow/hide mRNA sequence
ATGGATTATTTCTCTGGCGAACGCAGCTTGTTTTGCAAGGTTAAGATAATCGTTTCGTCTTTCTTCGGCTTCCTTCTCTGCGCATCGAGTTTTCGCTTCCCCATGCAAAA
GCAGAACCAAGGTCCAGTTGTCGAAGCACTACGATTCTCAATTTCTGGACTCAAAACCCTAATTTCTTCCGATGAAGATCAAGAACCAGGAAAAGAGACGGAGGATCGAG
TCATTCGCAGTGTTGGAGTCGGCATTATCAGGTCAAAGCTGTTGACTCATTCTTCTTCTTTTTGTTCTTCGATCAGATCTTGTAATCTGTTAATGGACGATTTGATCGGA
ACTGAGAGCGGTGTGTGCTTGACTTCGAATACGGAAGAAATCGAAGAGAAATCAACTCGTTCCGATTTCGATTATCGTCGCAATCGCACTAATCGCTACGATTTTACTGA
GCAAAATCTGCGATGTGCGGCGAAAAAGCAGTTTCCACCGCCGATTCCTTTCCTAGCGGCACAGGCAGGGCATCGAACTCGTCCGCCGTGGATTTTAACAAGAAATTGCA
TCAATGGAAGGTTGATTTTGAAACTGGAGAGAGTGAGGCACCATCAGTCCTTGGAATCGCACCGCGAAAATGGCCGTCTAATTCTCAATCTCGTTCCGGCGCCGGTCGCG
GGCGTCGGTACTGAGGACGACGATCAGGATCTTCAATTCATGGAAGAGGTCGAAGGAGAAGAGAAGATGGAAGAATCGATGGAATGCGAAGAAGGTACAGATCCTGAAAT
TTCATTTAAGAGTTTTACGTACGGCGGCGGAATGTTCAGTGATCGGGAACCGTTTTGTGGTGTAAATGGGAATTTTGAAGAACGACATGTCGTCGTCCATGGACATTTCG
GTTCGGCTCCTCTTCGTCCGATGGCTGACTCTAAACTCGTAGCGGCTCGGTCCTTGGAGATGGTGTCATCTATCAGCATCCTTCTTGGTGATATGATGCAGAAAGACAAA
GATGGAGCCTTTGGTGTCAAATTTGAACTGAATGTGATTTCAACAACTTGTGGAAGAGACAAGGGAGACCATCCTAGGACGCCAAGAAGCTATGAAGGCGAATGCTCCCC
TCGGGATGACCATGAGAGTCGATTATTGGCGTTTAGAAGCTTCCTGGATGCTCGGACGTTCTTGACAGTCATCATGAGTTTCTGTCTCTCCCCTTCTACCTCTGCAAACT
GA
Protein sequenceShow/hide protein sequence
MDYFSGERSLFCKVKIIVSSFFGFLLCASSFRFPMQKQNQGPVVEALRFSISGLKTLISSDEDQEPGKETEDRVIRSVGVGIIRSKLLTHSSSFCSSIRSCNLLMDDLIG
TESGVCLTSNTEEIEEKSTRSDFDYRRNRTNRYDFTEQNLRCAAKKQFPPPIPFLAAQAGHRTRPPWILTRNCINGRLILKLERVRHHQSLESHRENGRLILNLVPAPVA
GVGTEDDDQDLQFMEEVEGEEKMEESMECEEGTDPEISFKSFTYGGGMFSDREPFCGVNGNFEERHVVVHGHFGSAPLRPMADSKLVAARSLEMVSSISILLGDMMQKDK
DGAFGVKFELNVISTTCGRDKGDHPRTPRSYEGECSPRDDHESRLLAFRSFLDARTFLTVIMSFCLSPSTSAN