; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi09G010840 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi09G010840
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
Descriptionlysine-rich arabinogalactan protein 19
Genome locationchr09:17211575..17212834
RNA-Seq ExpressionLsi09G010840
SyntenyLsi09G010840
Gene Ontology termsGO:0005886 - plasma membrane (cellular component)
InterPro domainsIPR038793 - Lysine-rich arabinogalactan protein 19


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN61400.1 hypothetical protein Csa_005957 [Cucumis sativus]2.3e-7983.48Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASKSW FF+ICLAFLF S+  QAPAQPPSLPTVSPPP+TPPPT  T PPLSTPPPT VAPSTSPTIP   PPQSSP+S PSLPPALPPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP  TPPV APTLA  PLPTPPTP PSPPTPAPVA PE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN
        NGG+P+K KVG W++V  GF  LVATTGFN
Subjt:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN

XP_008457186.2 PREDICTED: lysine-rich arabinogalactan protein 19 [Cucumis melo]6.8e-7983.91Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASK W F +ICLAFLF SAI QAPAQPPSLPTVSPPPSTPPPT  TPPPLSTPPPT VAPSTSP IP   PPQSSP+S PSLPPA PPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLAPLPTP--PTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSP P  TPPV APTLAPLP P  PTP PSPPTPAPVA PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQTPPVPAPTLAPLPTP--PTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN
        + G+PMKGKVGTWT+V  GF  LVATTGFN
Subjt:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN

XP_011649058.1 lysine-rich arabinogalactan protein 19 [Cucumis sativus]2.3e-7983.48Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASKSW FF+ICLAFLF S+  QAPAQPPSLPTVSPPP+TPPPT  T PPLSTPPPT VAPSTSPTIP   PPQSSP+S PSLPPALPPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP  TPPV APTLA  PLPTPPTP PSPPTPAPVA PE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN
        NGG+P+K KVG W++V  GF  LVATTGFN
Subjt:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN

XP_022158972.1 lysine-rich arabinogalactan protein 19 [Momordica charantia]2.5e-7378.88Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPL-STPPP-TVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPP
        MASKSWP  +I LAFLF  AIGQAPAQPPSLPT S PPST  PT  TPPPL STPPP  VAPSTSPT+PPPQPPQ +P+S PSLPP LPPP ATPPPLP 
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPL-STPPP-TVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPP

Query:  PLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
          PPPQVSPTP+Q P VPAPTLAPLP PP P+P PPTPAPVA P L  SPAPAPGKH+ +RHHKHKKH APAPAPT+PSPPAPPTVVDSQDSTAPAPSPN
Subjt:  PLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF
        LNGG+PM+GKV TWTRVV GF +LVATT  NF
Subjt:  LNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF

XP_038875574.1 lysine-rich arabinogalactan protein 19-like [Benincasa hispida]8.0e-8888.65Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASKSWPF +ICL FLFP AIGQAP QPP+LPTVSPPPSTPPPT TTPPPLSTPPPT VAPSTSPTIPPPQP    PIS PSLPPALPPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG
        LPPPQVSPTPSQ PPVPAPTLAPL TPPTPMPSPPTPAP+AVPEL+PAPAPGKHK RRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG
Subjt:  LPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNG

Query:  GEPMKGKVGTWTRVVFGFTLLVATTGFNF
        G+PMKGKV T +RVV GF  LVATTGFNF
Subjt:  GEPMKGKVGTWTRVVFGFTLLVATTGFNF

TrEMBL top hitse value%identityAlignment
A0A0A0LHJ4 Uncharacterized protein1.1e-7983.48Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASKSW FF+ICLAFLF S+  QAPAQPPSLPTVSPPP+TPPPT  T PPLSTPPPT VAPSTSPTIP   PPQSSP+S PSLPPALPPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSPTP  TPPV APTLA  PLPTPPTP PSPPTPAPVA PE SPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQTPPVPAPTLA--PLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN
        NGG+P+K KVG W++V  GF  LVATTGFN
Subjt:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN

A0A1S3C506 lysine-rich arabinogalactan protein 193.3e-7983.91Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP
        MASK W F +ICLAFLF SAI QAPAQPPSLPTVSPPPSTPPPT  TPPPLSTPPPT VAPSTSP IP   PPQSSP+S PSLPPA PPP ATPPPLPPP
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPT-VAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPP

Query:  LPPPQVSPTPSQTPPVPAPTLAPLPTP--PTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL
        LPPPQVSP P  TPPV APTLAPLP P  PTP PSPPTPAPVA PELSPAPAPGKHK RR HKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSP+L
Subjt:  LPPPQVSPTPSQTPPVPAPTLAPLPTP--PTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNL

Query:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN
        + G+PMKGKVGTWT+V  GF  LVATTGFN
Subjt:  NGGEPMKGKVGTWTRVVFGFTLLVATTGFN

A0A6J1DYJ7 lysine-rich arabinogalactan protein 191.2e-7378.88Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPL-STPPP-TVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPP
        MASKSWP  +I LAFLF  AIGQAPAQPPSLPT S PPST  PT  TPPPL STPPP  VAPSTSPT+PPPQPPQ +P+S PSLPP LPPP ATPPPLP 
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPL-STPPP-TVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPP

Query:  PLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN
          PPPQVSPTP+Q P VPAPTLAPLP PP P+P PPTPAPVA P L  SPAPAPGKH+ +RHHKHKKH APAPAPT+PSPPAPPTVVDSQDSTAPAPSPN
Subjt:  PLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPEL--SPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPN

Query:  LNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF
        LNGG+PM+GKV TWTRVV GF +LVATT  NF
Subjt:  LNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF

A0A6J1EVZ4 lysine-rich arabinogalactan protein 19-like5.4e-6671.97Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPTVAPSTSPTIPPPQPPQSSPISPPSLPPA----------LPPPR
        MASKS    +ICLAFLFP AI QAPAQPPSLPT +PPPSTPPPT T  P    PP   APS +PTIPPPQPPQ+ P+S PSLPP           LPPP 
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPTVAPSTSPTIPPPQPPQSSPISPPSLPPA----------LPPPR

Query:  ATPPPLPPPLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DST
        ATPPPLPPPLP PQ+SPTP Q        +APLPTPPTP+PSPP  APVAVPEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQ DST
Subjt:  ATPPPLPPPLPPPQVSPTPSQTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQ-DST

Query:  APAPSPNLNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF
        APAPSPNLNGG+PMK KVG W RVV GF ++VA+T FNF
Subjt:  APAPSPNLNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF

A0A6J1IGY8 lysine-rich arabinogalactan protein 19-like1.4e-6671.78Show/hide
Query:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPTVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPL----
        MASKS P  +ICLAFLFP AI QAPAQPPSLPT +PPPSTPPPT T  P    PP   APS +PTIPPPQPPQ+ P+S PSLPP LPPP  TPPPL    
Subjt:  MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPTVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPL----

Query:  --PPPLPPPQVSPTPSQTPPVPAP-------TLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQD
          PPPLPPP  +P P   PP+PAP        +APLPTPPTP+PS P  APVAVPEL+PAPAPGKHK RR HKHKKHQAPAPAP VPSPPAPPTVVDSQD
Subjt:  --PPPLPPPQVSPTPSQTPPVPAP-------TLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQD

Query:  STAPAPSPNLNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF
        STAPAPSPNLNGG+PMK KVGTW R+V GF ++VATT F+F
Subjt:  STAPAPSPNLNGGEPMKGKVGTWTRVVFGFTLLVATTGFNF

SwissProt top hitse value%identityAlignment
Q9S740 Lysine-rich arabinogalactan protein 195.3e-1843.48Show/hide
Query:  WPFFLICLAFLFPSAIGQAP-AQPPSLPTVSPPPST--------PPPTATTPP------------------PLSTPPPTVAPSTSPTIPPPQPPQSSPIS
        W   L        S   Q P A P +  T +PPP+T        PPPT TTPP                  P S P P VAP  SP  PPPQPPQS P S
Subjt:  WPFFLICLAFLFPSAIGQAP-AQPPSLPTVSPPPST--------PPPTATTPP------------------PLSTPPPTVAPSTSPTIPPPQPPQSSPIS

Query:  PPSLPPALPPPRATPPPLPPPLPPPQVSPTPSQTPPVPA-PTLAPLPTPPTPMPSPP--TPAPVAVPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVP
         P++         +PPP+ PP  P    PTP+  PP PA P  AP   PP P+  PP   P+P+++P  +PAPAP KHKR+  HKHK+ H APAPAP  P
Subjt:  PPSLPPALPPPRATPPPLPPPLPPPQVSPTPSQTPPVPA-PTLAPLPTPPTPMPSPP--TPAPVAVPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVP

Query:  SPPAPPTVVDSQDSTAPAPSPNLNGG---EPMKGKVGTWTRVVFGFTLLVATT
        SPP+PP + D QD TAPAPSPN NGG     +KG+   W         L+A T
Subjt:  SPPAPPTVVDSQDSTAPAPSPNLNGG---EPMKGKVGTWTRVVFGFTLLVATT

Arabidopsis top hitse value%identityAlignment
AT1G68725.1 arabinogalactan protein 193.8e-1943.48Show/hide
Query:  WPFFLICLAFLFPSAIGQAP-AQPPSLPTVSPPPST--------PPPTATTPP------------------PLSTPPPTVAPSTSPTIPPPQPPQSSPIS
        W   L        S   Q P A P +  T +PPP+T        PPPT TTPP                  P S P P VAP  SP  PPPQPPQS P S
Subjt:  WPFFLICLAFLFPSAIGQAP-AQPPSLPTVSPPPST--------PPPTATTPP------------------PLSTPPPTVAPSTSPTIPPPQPPQSSPIS

Query:  PPSLPPALPPPRATPPPLPPPLPPPQVSPTPSQTPPVPA-PTLAPLPTPPTPMPSPP--TPAPVAVPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVP
         P++         +PPP+ PP  P    PTP+  PP PA P  AP   PP P+  PP   P+P+++P  +PAPAP KHKR+  HKHK+ H APAPAP  P
Subjt:  PPSLPPALPPPRATPPPLPPPLPPPQVSPTPSQTPPVPA-PTLAPLPTPPTPMPSPP--TPAPVAVPELSPAPAPGKHKRRRHHKHKK-HQAPAPAPTVP

Query:  SPPAPPTVVDSQDSTAPAPSPNLNGG---EPMKGKVGTWTRVVFGFTLLVATT
        SPP+PP + D QD TAPAPSPN NGG     +KG+   W         L+A T
Subjt:  SPPAPPTVVDSQDSTAPAPSPNLNGG---EPMKGKVGTWTRVVFGFTLLVATT


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCAAATCATGGCCTTTTTTTCTAATCTGCCTTGCCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAGCCACCCAGCTTGCCAACCGTCTCCCCACC
GCCTTCTACCCCACCTCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCTCCTACTGTTGCACCTTCTACAAGCCCAACCATTCCACCACCCCAACCACCACAAA
GTTCACCCATCTCGCCCCCGTCGCTACCACCAGCATTGCCACCTCCGCGCGCTACGCCTCCGCCATTGCCGCCTCCTTTGCCACCACCACAAGTCTCTCCAACTCCTTCA
CAGACACCACCGGTGCCAGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGCCTTCACCACCAACACCAGCACCCGTCGCGGTGCCAGAATTATCCCCTGC
ACCCGCACCGGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTCCCGAGCCCACCGGCACCACCTACGGTGGTGG
ATTCACAAGATAGCACAGCGCCAGCACCCTCACCAAATCTGAATGGAGGAGAGCCAATGAAAGGAAAGGTGGGAACATGGACTAGAGTTGTATTCGGATTCACACTTCTG
GTTGCTACCACGGGCTTCAATTTCTAA
mRNA sequenceShow/hide mRNA sequence
TGGAAGTTACCCATCAATATCACTATGAAAGCTACCATTCTCCCTCCACATTACACCAACCAGCTTTCCTTCAAAATTATTCATTCGATAAATGACTCATGCCCTTTCCT
TCTCTTAATTCCCATCTCCAACCATGGCTTCCAAATCATGGCCTTTTTTTCTAATCTGCCTTGCCTTTCTATTCCCTTCCGCGATCGGCCAGGCCCCGGCACAGCCACCC
AGCTTGCCAACCGTCTCCCCACCGCCTTCTACCCCACCTCCGACCGCAACAACTCCTCCACCCTTGTCAACACCACCTCCTACTGTTGCACCTTCTACAAGCCCAACCAT
TCCACCACCCCAACCACCACAAAGTTCACCCATCTCGCCCCCGTCGCTACCACCAGCATTGCCACCTCCGCGCGCTACGCCTCCGCCATTGCCGCCTCCTTTGCCACCAC
CACAAGTCTCTCCAACTCCTTCACAGACACCACCGGTGCCAGCTCCCACCCTAGCACCATTACCAACTCCGCCAACACCAATGCCTTCACCACCAACACCAGCACCCGTC
GCGGTGCCAGAATTATCCCCTGCACCCGCACCGGGAAAGCATAAGCGTAGGAGGCATCACAAACATAAGAAGCATCAAGCACCGGCCCCGGCCCCAACCGTCCCGAGCCC
ACCGGCACCACCTACGGTGGTGGATTCACAAGATAGCACAGCGCCAGCACCCTCACCAAATCTGAATGGAGGAGAGCCAATGAAAGGAAAGGTGGGAACATGGACTAGAG
TTGTATTCGGATTCACACTTCTGGTTGCTACCACGGGCTTCAATTTCTAATTTTGAAGAGAGGATTATTCACAAGGAACCCATTATGGTTCATAATATTATAGAATTTGG
CCCAGAATCTGAGAGAAGTATTTCATATATATCATTTTTTGTTCAATGATTTGTGGATTGGATTGGAATAATAATGAATTTTTTTTTTATTTTGGTAATTAATAATAATC
TTCAAGTTTCAAAGCTGATCCTCATTAATCTCAATCATGTTGAAAATTAAAAGATGATTGGTCCATGAACTATGATTTGGATAACATATATTGGAAACTGGTGAGAATTC
ATATTTGAT
Protein sequenceShow/hide protein sequence
MASKSWPFFLICLAFLFPSAIGQAPAQPPSLPTVSPPPSTPPPTATTPPPLSTPPPTVAPSTSPTIPPPQPPQSSPISPPSLPPALPPPRATPPPLPPPLPPPQVSPTPS
QTPPVPAPTLAPLPTPPTPMPSPPTPAPVAVPELSPAPAPGKHKRRRHHKHKKHQAPAPAPTVPSPPAPPTVVDSQDSTAPAPSPNLNGGEPMKGKVGTWTRVVFGFTLL
VATTGFNF