; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10012226 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10012226
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
Descriptionproline-, glutamic acid- and leucine-rich protein 1-like isoform X1
Genome locationChr01:19058614..19059505
RNA-Seq ExpressionHG10012226
SyntenyHG10012226
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6606253.1 hypothetical protein SDJN03_03570, partial [Cucurbita argyrosperma subsp. sororia]6.5e-8367.29Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRLR+IKDRLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEHDE-------VNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH+E          KK +CCKEEED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEHDE-------VNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

XP_022930995.1 uncharacterized protein LOC111437321 isoform X2 [Cucurbita moschata]1.1e-8268.18Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRLR+IKDRLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK  CCKE+ED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

XP_022995232.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X1 [Cucurbita maxima]2.2e-8367.92Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRLR+IK+RLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK+ +CCK+EED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

XP_022995233.1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X2 [Cucurbita maxima]5.0e-8368.18Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRLR+IK+RLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK  CCK+EED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

XP_038888901.1 uncharacterized protein LOC120078676 [Benincasa hispida]2.0e-10081.32Show/hide
Query:  MSNIIQESSEPQNQEESFDPF--RFSTLCLNSSAVDPPLCSSCARRQPRLASTPMKRPSPT-PSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHSV
        MSN+IQESSEPQN EE FDPF  RFSTLCLN SAVDP LCSSCARR PR A+TPMKRP+PT P QHP    SK  FLDHQQP+ST FSKIDLPIPFD SV
Subjt:  MSNIIQESSEPQNQEESFDPF--RFSTLCLNSSAVDPPLCSSCARRQPRLASTPMKRPSPT-PSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHSV

Query:  SPLRRSFSDPTEALNFSP----QSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEE
         PLRRS SDPTEA NFSP    QSPAKRLCLNSPLPPLPLRRTVSDPNPSPE T DSPIKIGK       DNPESKRLRRIKDRLKEMNQWWNEV+SEE+
Subjt:  SPLRRSFSDPTEALNFSP----QSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEE

Query:  HDEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
         DE  TKK DC KEEE+DEETVGVERVGDSL L LKCSCGKGFEILLSGRSCFYKLL
Subjt:  HDEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

TrEMBL top hitse value%identityAlignment
A0A0A0LI25 Uncharacterized protein1.6e-7968.91Show/hide
Query:  QEESFDPFR-FSTLCLN---SSAVDPPLCSSCARRQPRLASTPMKRPSPTP--SQHPST-TTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSD
        QE+ +DPF+ FSTLCLN   SSAVDP LCSSC R   R ++TPMKRPSPTP  SQ  ST TTSK   LD QQPNS PFSKI+LPIPF  SVSPLRRS SD
Subjt:  QEESFDPFR-FSTLCLN---SSAVDPPLCSSCARRQPRLASTPMKRPSPTP--SQHPST-TTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSD

Query:  PTEALNFSP----QSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEE--HDEVNTK
        PT+A NFSP    QSPAKRLCLNSPLPPLPLRRTVSDPNP+PE T DSPIKI K       D+PESKRL+RIKDRLKEMN WWNEV+SEEE  +DE   K
Subjt:  PTEALNFSP----QSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEE--HDEVNTK

Query:  K-----------RDCCKEEE------DDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        K           RD  +EEE      DDEETVGVERVGDS+ L+LKCSCGK F+ILLSGR+CFYKLL
Subjt:  K-----------RDCCKEEE------DDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

A0A6J1ET23 proline-, glutamic acid- and leucine-rich protein 1-like isoform X19.2e-8367.92Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRLR+IKDRLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKK-RDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK  +CCKE+ED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKK-RDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

A0A6J1EYB4 uncharacterized protein LOC111437321 isoform X25.4e-8368.18Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P T T+KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS + T  SP+ IG+ ND I ED+P+SKRLR+IKDRLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK  CCKE+ED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

A0A6J1JY87 proline-, glutamic acid- and leucine-rich protein 1-like isoform X11.1e-8367.92Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRLR+IK+RLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK+ +CCK+EED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKR-DCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

A0A6J1K7B1 proline-, glutamic acid- and leucine-rich protein 1-like isoform X22.4e-8368.18Show/hide
Query:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-
        MSN+IQES+EPQN E+     RFSTLCLN        PPLCSSC RR PR A+T  KR SPT  Q P+ TT KK  LD +Q N T FSKIDLPIPF  S 
Subjt:  MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVD---PPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHS-

Query:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE
              SPL RS SDPTEA NFSP SPAKRLC NS LPPLPLRRTVSDP PS E T +SP+ IG+ ND I ED+P+SKRLR+IK+RLKEMN+WWNEV+SE
Subjt:  -----VSPLRRSFSDPTEALNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISE

Query:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
        +EH     DE  TKK  CCK+EED+EETVGVERVGDSL LRLKC CGKGFEILLSG SCFYKLL
Subjt:  EEH-----DEVNTKKRDCCKEEEDDEETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G32235.1 unknown protein1.3e-1232.23Show/hide
Query:  STPMKRPSPTPSQHPSTTTSKKQFL----DHQQPNSTPFSKIDLP-IPFDHSV--SPL-RRSFSDP----------TEALNFSPQSPAKRLCLNS----P
        ++P+KRPSP  S+       KK F+    + + PN   +SKI LP + F+ +   SPL +RS SD           +    ++  S A+     S     
Subjt:  STPMKRPSPTPSQHPSTTTSKKQFL----DHQQPNSTPFSKIDLP-IPFDHSV--SPL-RRSFSDP----------TEALNFSPQSPAKRLCLNS----P

Query:  LPPLP--LRRTVSDPNPSPENTFDSPIKIGKSNDLIIED--NPES----KRLRRIKDRLKEMNQWWNEVISEEEHDEVNTKKRDCCKEEEDD--------
        LPP P   RR+VSD +P+P +   S +   +SN +   D  NPES    K L  IKD ++E++QW N+++   E     + K+D   +  D+        
Subjt:  LPPLP--LRRTVSDPNPSPENTFDSPIKIGKSNDLIIED--NPES----KRLRRIKDRLKEMNQWWNEVISEEEHDEVNTKKRDCCKEEEDD--------

Query:  ---EETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL
           +E V V R+G++ V+ + C CG+ ++ L SGR C+YKLL
Subjt:  ---EETVGVERVGDSLVLRLKCSCGKGFEILLSGRSCFYKLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCAATATAATTCAAGAATCTTCAGAACCCCAAAACCAAGAAGAATCTTTCGATCCTTTCCGTTTCTCCACCCTTTGTCTCAACTCCTCCGCCGTCGACCCTCCACT
CTGTTCTTCGTGCGCTCGCCGTCAACCTCGCCTCGCATCCACTCCCATGAAACGCCCTTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCTCCAAGAAGCAATTTC
TTGATCATCAACAACCCAATTCCACCCCTTTCTCCAAGATCGATCTCCCCATTCCTTTTGATCATTCTGTTTCCCCTCTCCGCCGCTCTTTTTCCGACCCCACCGAAGCC
CTGAATTTCTCCCCTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCGTCCCCTGAAAATAC
TTTCGATTCCCCAATTAAAATTGGGAAATCCAACGATTTGATCATAGAAGACAACCCCGAATCAAAGAGACTTAGAAGGATCAAGGATCGATTGAAGGAGATGAATCAGT
GGTGGAACGAAGTGATAAGTGAAGAAGAACACGATGAAGTTAATACAAAAAAGAGAGATTGTTGCAAGGAAGAAGAAGATGATGAAGAAACAGTGGGAGTGGAAAGAGTG
GGAGATTCATTGGTGCTACGTTTAAAGTGTTCATGTGGGAAAGGATTTGAGATTCTTCTTTCTGGGAGAAGCTGTTTCTACAAGCTGCTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGCAATATAATTCAAGAATCTTCAGAACCCCAAAACCAAGAAGAATCTTTCGATCCTTTCCGTTTCTCCACCCTTTGTCTCAACTCCTCCGCCGTCGACCCTCCACT
CTGTTCTTCGTGCGCTCGCCGTCAACCTCGCCTCGCATCCACTCCCATGAAACGCCCTTCCCCCACGCCGTCGCAACACCCCTCCACCACCACCTCCAAGAAGCAATTTC
TTGATCATCAACAACCCAATTCCACCCCTTTCTCCAAGATCGATCTCCCCATTCCTTTTGATCATTCTGTTTCCCCTCTCCGCCGCTCTTTTTCCGACCCCACCGAAGCC
CTGAATTTCTCCCCTCAGTCCCCTGCAAAACGGTTATGTCTCAACTCACCCCTGCCGCCTCTGCCTCTCCGGCGTACTGTCTCTGACCCAAATCCGTCCCCTGAAAATAC
TTTCGATTCCCCAATTAAAATTGGGAAATCCAACGATTTGATCATAGAAGACAACCCCGAATCAAAGAGACTTAGAAGGATCAAGGATCGATTGAAGGAGATGAATCAGT
GGTGGAACGAAGTGATAAGTGAAGAAGAACACGATGAAGTTAATACAAAAAAGAGAGATTGTTGCAAGGAAGAAGAAGATGATGAAGAAACAGTGGGAGTGGAAAGAGTG
GGAGATTCATTGGTGCTACGTTTAAAGTGTTCATGTGGGAAAGGATTTGAGATTCTTCTTTCTGGGAGAAGCTGTTTCTACAAGCTGCTGTAG
Protein sequenceShow/hide protein sequence
MSNIIQESSEPQNQEESFDPFRFSTLCLNSSAVDPPLCSSCARRQPRLASTPMKRPSPTPSQHPSTTTSKKQFLDHQQPNSTPFSKIDLPIPFDHSVSPLRRSFSDPTEA
LNFSPQSPAKRLCLNSPLPPLPLRRTVSDPNPSPENTFDSPIKIGKSNDLIIEDNPESKRLRRIKDRLKEMNQWWNEVISEEEHDEVNTKKRDCCKEEEDDEETVGVERV
GDSLVLRLKCSCGKGFEILLSGRSCFYKLL