; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020782 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020782
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionRetrotransposon gag protein
Genome locationscaffold10:25283064..25299538
RNA-Seq ExpressionSpg020782
SyntenySpg020782
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_030494802.1 uncharacterized protein LOC115710583 [Cannabis sativa]1.2e-3038.4Show/hide
Query:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMD--------
        H +PHCIQLETFY GLN   ++++D+SANGA+L K+Y EA EIL+RIA  NY+W +    R    +   G  EVD +T + A++ + T  +         
Subjt:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMD--------

Query:  ----------------------ELAANQAPIVSHLAALGCGGES-----------------------NASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQ
                              E   +    V ++ A   G E                          S ++SLE++++D       YMAKNDA+++SQ
Subjt:  ----------------------ELAANQAPIVSHLAALGCGGES-----------------------NASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQ

Query:  AASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK
        AASL+NLE+QLGQLA++LKNRP GTLP++TE  R  GKEHC A+TL  GK
Subjt:  AASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK

XP_030495032.1 uncharacterized protein LOC115710819 [Cannabis sativa]6.3e-2933.33Show/hide
Query:  EDESYNPSYD-----------HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTV
        EDES + +++           H +PHCIQ+ETFY GLN + +++VD+SANGALL K+Y EA++I++RI+  NY+W +    R    K   G  EVD +T 
Subjt:  EDESYNPSYD-----------HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTV

Query:  VNAKIDAPTTRMDELAANQ---APIVSHLAALGC--------------------------------------------GGESNAS---------------
        ++A++ + +  +  ++  Q   +P V  L  + C                                             G SN++               
Subjt:  VNAKIDAPTTRMDELAANQ---APIVSHLAALGC--------------------------------------------GGESNAS---------------

Query:  ---------SSNSLEAMLKD-------FMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGKQQWK
                  ++SLE+M++D       FMT  + YMAKND  ++SQA S+R LE Q+GQLA+EL+NRP GTLP++TE  R  GKEHC A+ L  GK + K
Subjt:  ---------SSNSLEAMLKD-------FMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGKQQWK

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]9.8e-3036.67Show/hide
Query:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDEL------
        H +PHCIQLETFY GLN   ++++D+SANGA+L K+Y EA EIL+RIA  NY+W +    R H  +   G  EVD +T + A++ + T  +  +      
Subjt:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDEL------

Query:  ----------------------------------AANQ-------------APIVSHLAALGCGGES--------------------NASSSNSLEAMLK
                                            NQ              P   H      GG+                       S ++SLE++++
Subjt:  ----------------------------------AANQ-------------APIVSHLAALGCGGES--------------------NASSSNSLEAMLK

Query:  DFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK
        D       YMAKND +++SQAASLRNLEVQLGQLA++LKNRP GTLP++TE  R  GKEHC A+TL  GK
Subjt:  DFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK

XP_030503898.1 uncharacterized protein LOC115719117 [Cannabis sativa]2.6e-3038.17Show/hide
Query:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDEL------
        H +PHCIQLETFY GLN   ++++D+SANGA+L K+Y EA EIL+RIA  NY+W +    R    +   G  EVD +T + A++ + T  +  +      
Subjt:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDEL------

Query:  ----------------AANQ-------------APIVSHLAALGCGGESNASS------------------------------SNSLEAMLKDFMTSQQE
                          NQ              P   H      GG+  +SS                              ++SLE++++D       
Subjt:  ----------------AANQ-------------APIVSHLAALGCGGESNASS------------------------------SNSLEAMLKDFMTSQQE

Query:  YMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK
        YMAKNDA+++SQAASLRNLEVQLGQLA++LKNRP GTLP++TE  R  GKEHC A+TL  GK
Subjt:  YMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK

XP_030509265.1 uncharacterized protein LOC115723943 [Cannabis sativa]9.8e-3040Show/hide
Query:  EDESYNPSYD-----------HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTV
        EDES + +++           H +PHCIQ+ETFY GLN   ++++D+SANGA+L K+Y EA EIL+ IA  NY+W +    R    +   G  EVD +T 
Subjt:  EDESYNPSYD-----------HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTV

Query:  VNAKIDAPTTRMDEL-------AANQAPIVSHLAALGCGGESNASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGT
        + A++ + T     L       +++ AP     A      +    S ++  A      +  ++YMAKNDA+++SQAASLRNLE+QLG LA+ELK RP G+
Subjt:  VNAKIDAPTTRMDEL-------AANQAPIVSHLAALGCGGESNASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGT

Query:  LPNNTEASR--GKEHCHALTLCDGK
        LP++TE  R  GKE C ++ L  GK
Subjt:  LPNNTEASR--GKEHCHALTLCDGK

TrEMBL top hitse value%identityAlignment
A0A5B6VNY6 Gag-asp_proteas domain-containing protein2.8e-2237.93Show/hide
Query:  LPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAPIV
        +PHCIQLETFY GLN   +++VD+SANG LL K+Y EA+ I+DRIA KN +W +    R +  +      EVD +  + A + + ++ + +   N    +
Subjt:  LPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAPIV

Query:  SHLAALGCGGESNASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGKQQWKL
        +H+A                    + F      YMAKNDAL++ Q A+L+NLE ++GQLA+EL  RP G  P++ +  R  GKEHC  + L  GK     
Subjt:  SHLAALGCGGESNASSSNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGKQQWKL

Query:  DVV
        DV+
Subjt:  DVV

A0A5B6VWJ0 Retroelement pol polyprotein-like3.5e-2534.91Show/hide
Query:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDA--------PTTRMD
        H +PHCIQLETFY GL  + +++VD+SANGALL K+Y EA+EI++RIA  NY+W ++   R    +   G  EVD +T + +++ +         T   +
Subjt:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDA--------PTTRMD

Query:  ELAANQAPIVSHLAALGCG------------------GESN--------------------------------------------------------ASS
          AA       ++A + CG                  G  N                                                        A +
Subjt:  ELAANQAPIVSHLAALGCG------------------GESN--------------------------------------------------------ASS

Query:  SNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTL
        SNSLE++LK        YMAKNDAL++SQAA+L+NLE Q+GQLA+EL+NR  G LP++TE  R  GKEHC ALTL
Subjt:  SNSLEAMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTL

A0A6J1DWK1 uncharacterized protein LOC1110250537.3e-2334.66Show/hide
Query:  EDYTNRVSGHYSIDEDESYNPSYDHRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDP
        +D T +    Y +     +     H +P  IQ+ET+Y GL+   ++++D+S NGALL K Y +A  IL+RI+  N+ W    D R    K+S    E + 
Subjt:  EDYTNRVSGHYSIDEDESYNPSYDHRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDP

Query:  MTVVNAKIDAPT---TRMDELAANQAPIVSHLAALGCGGE--------SNASS-------------------SNSLEAMLKDFMTSQQEYMAKNDALVRS
         T +N+KI+  T    R    +    P + +    G  G         SNA +                    N  +  +       ++YMA NDA V+S
Subjt:  MTVVNAKIDAPT---TRMDELAANQAPIVSHLAALGCGGE--------SNASS-------------------SNSLEAMLKDFMTSQQEYMAKNDALVRS

Query:  QAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK
        QAASLRNLE+Q+GQLA +LK+RPVG LP++TE  +   KE C+ALTL  GK
Subjt:  QAASLRNLEVQLGQLASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK

A0A6J1DXK5 uncharacterized protein LOC1110255001.8e-2133.33Show/hide
Query:  LPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAPI-
        +P CIQ++T+Y GL+   ++++D+SANGALL K Y EA  IL+RI+  N  W    D R    K S G  E +  T +N KI+  T  +     +Q+ + 
Subjt:  LPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAPI-

Query:  -------VSHLAALGC-----------------------GGESNASSSNSLE-----------AMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQ
               VSH+  + C                         ++N ++  S+              ++    +  +YM  ND  V+SQA SLRNLE+Q+GQ
Subjt:  -------VSHLAALGC-----------------------GGESNASSSNSLE-----------AMLKDFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQ

Query:  LASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK
        LA++LK++P G LP++ +  +  GKE C+ALTL  GK
Subjt:  LASELKNRPVGTLPNNTEASR--GKEHCHALTLCDGK

A0A6J1G7Q6 uncharacterized protein LOC1114515982.4e-2132.72Show/hide
Query:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAP
        H LPHCIQ+ETFY GLN   + +VD+SANG +L KTY EA+EIL+RIA  N +W    D R +P K +    EVD ++ +NA++ + T  +  LA  Q  
Subjt:  HRLPHCIQLETFYVGLNKNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAP

Query:  IV---SHLA----------ALGCG--------------------------------------------------------------------GESNASSS
        ++   +H A           + CG                                                                    G  N  + 
Subjt:  IV---SHLA----------ALGCG--------------------------------------------------------------------GESNASSS

Query:  NSLEAMLKDFMTSQ-------------QEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTE
        +S +A  +   TSQ             +EYMA+NDA+++SQ  SLRNLEVQ+GQLA+EL+NRP+G LP +TE
Subjt:  NSLEAMLKDFMTSQ-------------QEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTE

SwissProt top hitse value%identityAlignment
Q39010 Shaggy-related protein kinase zeta8.4e-0868.18Show/hide
Query:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        D  K++ D +   EMS AV+E N+AVTGHIISTTIGGKNGEPKQ
Subjt:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

Q39012 Shaggy-related protein kinase iota2.4e-0765.91Show/hide
Query:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        D  K++ + +   EMS AV+E N+AVTGHIISTTIGGKNGEPKQ
Subjt:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

Arabidopsis top hitse value%identityAlignment
AT1G06390.1 GSK3/SHAGGY-like protein kinase 11.7e-0865.91Show/hide
Query:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        D  K++ + +   EMS AV+E N+AVTGHIISTTIGGKNGEPKQ
Subjt:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

AT1G06390.2 GSK3/SHAGGY-like protein kinase 11.7e-0865.91Show/hide
Query:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        D  K++ + +   EMS AV+E N+AVTGHIISTTIGGKNGEPKQ
Subjt:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

AT2G30980.1 SHAGGY-related protein kinase dZeta5.9e-0968.18Show/hide
Query:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        D  K++ D +   EMS AV+E N+AVTGHIISTTIGGKNGEPKQ
Subjt:  DGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

AT4G18710.1 Protein kinase superfamily protein6.1e-0677.42Show/hide
Query:  EMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        EM  AVV+ ++ VTGHIISTTIGGKNGEPKQ
Subjt:  EMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ

AT4G18710.2 Protein kinase superfamily protein6.1e-0677.42Show/hide
Query:  EMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ
        EM  AVV+ ++ VTGHIISTTIGGKNGEPKQ
Subjt:  EMSTAVVEKNNAVTGHIISTTIGGKNGEPKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCAACTCCCAATAAAGCTACTAGCAACCAGTCATCAACCTTCTTGGGTATTGGACAATCTACATAGTGATCTTATTTACTCAGAAAATCAAGTTTCTCTCATGGA
TGGAAGTAAAAAGCAACTTGATAGCAATGAATCACTTGAAATGTCAACTGCAGTAGTTGAAAAGAATAATGCAGTAACTGGTCATATCATTTCCACTACAATCGGTGGCA
AAAATGGTGAACCTAAACAGTGGTTCGTTCGGAATTGTATTCCAGACCACTGTCACCTTCACCGAAGCTGCTGCCGTTCACCGTCGTCGGAGCTGCCACCCACCTACAAC
GCACGCCAACGGAGCCGCCGCCCACCTGCAACGAGCACGAATCCGCCGAATTTGCCTTTGCCCGCCTACAACGCTCGCGAATCCGTCGAATCTGCCGGTAGATCTACAAC
TCAGTCGCCAGATCTGCAACACCCACGCGAATCCCTTGCCGGATCTACAACACCTAGATCTGCAACGCCTGATCTCCTTTTCGTTCACGCCGCCGCCGTTCGTCCTCGCG
TTCATGTCCTCCAATGGAAGCTTGATGTTGTAGCAGCTGGCGCATGGAAAGCTTGTTGCTGCAGCAAGGGATGCTTGTTGATGCTACAACAAGTTTCTATCCAAGCAATA
CATGCTTTGGCTCAAAGCAACACATGTTTTGTGCTTTGGCTCAAAGGAACACATGCTTTGGCTCCAAAGCAAGAAAGTTCAACCTGTCGCATCCATATTTTATTACCCCA
GCACAACATCATTAGAATAGGATTTAATTTGAGATATTTTCTCTACTCGCCCCTTCCCTGTGGATTCGACCCTGGAATACTCCAGGTGTGCACCCAAGAAATGGATGCGC
TGAACCCACAAAATCCTCCCGTAAATCCAATACAATTGGCTGATGATAGAGAACGAGGCATTAGAGACTATGCAGCCCCAAGGATGTGCAATTTCAACCCTGGAATTGTA
CGTCCCAAGCTTGACACTGAAAGAAGAAGACGTCGTGAAACCTACAAACTTGATGTTCAATTTTTGAAAGGTGTTCGAGATAATTTGGTCGAGTTACGTGAAGCCATGAA
TGATTATGTTGAAATATGCAAAGAGATGCAACTAAGATACCAAGCCAAGAAGACTCAAGCTGCACAATCCAATCAAATTGTTAGAGATTGTTTAGCACCGAAAGATGATG
GTGACCAACATATTCAAAATGGGGGTGATCATCAAAATCTAGAAGAGAAAGAGACAAATGTTGAGGTTGTTGTGGAAAACAACGACATATGTGAAGTATGTGTTCAGAAT
GAGGTGGAGACAAAATCTCTAAACGATAGTGAGGAGAAGAGTGATGAGAAAACAAAAGAACAAGAGGAAAATGTGAAAAGTAAGAAGAGAGAGAAAAGAGAGGAAAATGT
GGCAAGTAAGAAAAGAGAGGAGAGTGTGAAAAGTGAAACAATAAAGAAAAAAGAGGAAACTAAAAAAGAAATGAGTGTCAGTGAAAAGAAATGTTTAAGTGAAAAGAAAC
ATGAGAGTGAGGATTGTTTTCCAAGTGTTTTTGATGGTTTAATCAATATACGTTGGATAATCAAGTTTGAAATTTCTTATCAACAACAATGGGAGAGGAACGTTCTACGT
AAAATGTTCACATTTGATCCAAGAGAAGACATTCCAATCCAAGTGTTTGAACACGAACGAGACGATAATTACAAAGCTGCCTTGCAAGGAGAGCTTTCAGATTCGAGGAA
GAATCCTTTCGAAGAGGGGGAAGTTGATAAGAATCATGGTGATCTTGAGATTCGGTTTGCTTCATATAGAAAATCGGTCATTTTGATAATAGATTCTGGAAAACCGAGAA
GGCCGAGATTGTTCAACTTCTTTGCTTCTTTGTTGTTCTTTGCTATAAAAGTGAGTTTTAAACTCTCATTTGTGTCAGGATCTGACCGTTGCATGTTTCAAACAAGGAGT
TCAAACAAATCCATGTTTCGTCAAGCACAAAAACGTTGTGAGAAACACGTCCGTGATGATCCATTTCGGAAGGCTTTTCGTGAAAATATGGATGAAATACATAAATCCAT
AAAGTACATGTCCGAATTAGCTCACGAATTGAAATTGCGGGATCAAGCAAGAGAGGCACAAATCACAAAATCCATTCAAGAAGAAAAAGATTGTTCCATGCCTCAAGATG
ATGTTTCAAATGATGACGTTGTTGTTGATGACGTGGTGACCAGCGAAGAAAGTGTGCAAACATATGTTGATCCTTCCTTTGAGACAAAAAGTGAGGAAAGTGAAAAGGAG
AACGTTTGTGAGAAAGAAACATTTGAGACTCATATAGAGACTAAAGAGAAAGAGAATTTAGTAATTAGAGTTTGTGAGAAGAATATTGTCTTGACAATGGTCAAGCGTGA
AGAACTTAAGGAGAGCAAGAGTGTGAGATGCAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCGAGGACGAATCCTCTTGAAGAAGGGGAGGATTATACGA
ACCGAGTTAGTGGTCATTATAGCATTGATGAGGATGAAAGTTACAACCCTTCATATGACCATAGATTGCCTCACTGTATTCAATTGGAGACATTTTATGTGGGGCTGAAC
AAGAACTTTCAAGTGTTGGTTGACTCTTCAGCCAATGGCGCATTACTAAGGAAAACATATGTTGAAGCGCATGAAATCCTTGATCGCATTGCGAGGAAAAATTATGAATG
GGGATCAGCTGAAGACAAAAGGAAGCACCCAATGAAGACAAGTTATGGAAGTTTTGAAGTGGACCCCATGACAGTAGTTAATGCAAAAATCGATGCGCCAACCACACGAA
TGGACGAATTAGCTGCAAATCAAGCTCCCATTGTATCGCATCTAGCTGCGTTAGGATGTGGTGGTGAATCTAACGCATCGTCCTCAAACTCATTGGAGGCCATGCTGAAG
GATTTTATGACCAGTCAACAGGAGTATATGGCAAAGAATGATGCGTTAGTGCGAAGCCAGGCCGCATCACTCAGAAATCTTGAAGTTCAACTAGGGCAATTAGCATCAGA
ACTGAAGAACAGACCAGTAGGAACTCTTCCGAACAACACTGAAGCGTCAAGAGGAAAAGAGCACTGCCATGCGTTAACCTTGTGCGATGGTAAACAACAATGGAAGCTTG
ATGTTGTAGCAGCTGGCGCATGGAAAGCTTGTTGCTGCAGCAAGGGATGCTTGTTGATGCTACAGCAAGTTTCTATCCAAGCAATACATGCTTTGGCTCAAAGCAAGACA
TATGCTTTGGCTCCAAGCAAGAAAAGTTTATAA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCAACTCCCAATAAAGCTACTAGCAACCAGTCATCAACCTTCTTGGGTATTGGACAATCTACATAGTGATCTTATTTACTCAGAAAATCAAGTTTCTCTCATGGA
TGGAAGTAAAAAGCAACTTGATAGCAATGAATCACTTGAAATGTCAACTGCAGTAGTTGAAAAGAATAATGCAGTAACTGGTCATATCATTTCCACTACAATCGGTGGCA
AAAATGGTGAACCTAAACAGTGGTTCGTTCGGAATTGTATTCCAGACCACTGTCACCTTCACCGAAGCTGCTGCCGTTCACCGTCGTCGGAGCTGCCACCCACCTACAAC
GCACGCCAACGGAGCCGCCGCCCACCTGCAACGAGCACGAATCCGCCGAATTTGCCTTTGCCCGCCTACAACGCTCGCGAATCCGTCGAATCTGCCGGTAGATCTACAAC
TCAGTCGCCAGATCTGCAACACCCACGCGAATCCCTTGCCGGATCTACAACACCTAGATCTGCAACGCCTGATCTCCTTTTCGTTCACGCCGCCGCCGTTCGTCCTCGCG
TTCATGTCCTCCAATGGAAGCTTGATGTTGTAGCAGCTGGCGCATGGAAAGCTTGTTGCTGCAGCAAGGGATGCTTGTTGATGCTACAACAAGTTTCTATCCAAGCAATA
CATGCTTTGGCTCAAAGCAACACATGTTTTGTGCTTTGGCTCAAAGGAACACATGCTTTGGCTCCAAAGCAAGAAAGTTCAACCTGTCGCATCCATATTTTATTACCCCA
GCACAACATCATTAGAATAGGATTTAATTTGAGATATTTTCTCTACTCGCCCCTTCCCTGTGGATTCGACCCTGGAATACTCCAGGTGTGCACCCAAGAAATGGATGCGC
TGAACCCACAAAATCCTCCCGTAAATCCAATACAATTGGCTGATGATAGAGAACGAGGCATTAGAGACTATGCAGCCCCAAGGATGTGCAATTTCAACCCTGGAATTGTA
CGTCCCAAGCTTGACACTGAAAGAAGAAGACGTCGTGAAACCTACAAACTTGATGTTCAATTTTTGAAAGGTGTTCGAGATAATTTGGTCGAGTTACGTGAAGCCATGAA
TGATTATGTTGAAATATGCAAAGAGATGCAACTAAGATACCAAGCCAAGAAGACTCAAGCTGCACAATCCAATCAAATTGTTAGAGATTGTTTAGCACCGAAAGATGATG
GTGACCAACATATTCAAAATGGGGGTGATCATCAAAATCTAGAAGAGAAAGAGACAAATGTTGAGGTTGTTGTGGAAAACAACGACATATGTGAAGTATGTGTTCAGAAT
GAGGTGGAGACAAAATCTCTAAACGATAGTGAGGAGAAGAGTGATGAGAAAACAAAAGAACAAGAGGAAAATGTGAAAAGTAAGAAGAGAGAGAAAAGAGAGGAAAATGT
GGCAAGTAAGAAAAGAGAGGAGAGTGTGAAAAGTGAAACAATAAAGAAAAAAGAGGAAACTAAAAAAGAAATGAGTGTCAGTGAAAAGAAATGTTTAAGTGAAAAGAAAC
ATGAGAGTGAGGATTGTTTTCCAAGTGTTTTTGATGGTTTAATCAATATACGTTGGATAATCAAGTTTGAAATTTCTTATCAACAACAATGGGAGAGGAACGTTCTACGT
AAAATGTTCACATTTGATCCAAGAGAAGACATTCCAATCCAAGTGTTTGAACACGAACGAGACGATAATTACAAAGCTGCCTTGCAAGGAGAGCTTTCAGATTCGAGGAA
GAATCCTTTCGAAGAGGGGGAAGTTGATAAGAATCATGGTGATCTTGAGATTCGGTTTGCTTCATATAGAAAATCGGTCATTTTGATAATAGATTCTGGAAAACCGAGAA
GGCCGAGATTGTTCAACTTCTTTGCTTCTTTGTTGTTCTTTGCTATAAAAGTGAGTTTTAAACTCTCATTTGTGTCAGGATCTGACCGTTGCATGTTTCAAACAAGGAGT
TCAAACAAATCCATGTTTCGTCAAGCACAAAAACGTTGTGAGAAACACGTCCGTGATGATCCATTTCGGAAGGCTTTTCGTGAAAATATGGATGAAATACATAAATCCAT
AAAGTACATGTCCGAATTAGCTCACGAATTGAAATTGCGGGATCAAGCAAGAGAGGCACAAATCACAAAATCCATTCAAGAAGAAAAAGATTGTTCCATGCCTCAAGATG
ATGTTTCAAATGATGACGTTGTTGTTGATGACGTGGTGACCAGCGAAGAAAGTGTGCAAACATATGTTGATCCTTCCTTTGAGACAAAAAGTGAGGAAAGTGAAAAGGAG
AACGTTTGTGAGAAAGAAACATTTGAGACTCATATAGAGACTAAAGAGAAAGAGAATTTAGTAATTAGAGTTTGTGAGAAGAATATTGTCTTGACAATGGTCAAGCGTGA
AGAACTTAAGGAGAGCAAGAGTGTGAGATGCAAAAGATTTCAAGTTTGCACTACAAGGGGATCCTTGGATTCGAGGACGAATCCTCTTGAAGAAGGGGAGGATTATACGA
ACCGAGTTAGTGGTCATTATAGCATTGATGAGGATGAAAGTTACAACCCTTCATATGACCATAGATTGCCTCACTGTATTCAATTGGAGACATTTTATGTGGGGCTGAAC
AAGAACTTTCAAGTGTTGGTTGACTCTTCAGCCAATGGCGCATTACTAAGGAAAACATATGTTGAAGCGCATGAAATCCTTGATCGCATTGCGAGGAAAAATTATGAATG
GGGATCAGCTGAAGACAAAAGGAAGCACCCAATGAAGACAAGTTATGGAAGTTTTGAAGTGGACCCCATGACAGTAGTTAATGCAAAAATCGATGCGCCAACCACACGAA
TGGACGAATTAGCTGCAAATCAAGCTCCCATTGTATCGCATCTAGCTGCGTTAGGATGTGGTGGTGAATCTAACGCATCGTCCTCAAACTCATTGGAGGCCATGCTGAAG
GATTTTATGACCAGTCAACAGGAGTATATGGCAAAGAATGATGCGTTAGTGCGAAGCCAGGCCGCATCACTCAGAAATCTTGAAGTTCAACTAGGGCAATTAGCATCAGA
ACTGAAGAACAGACCAGTAGGAACTCTTCCGAACAACACTGAAGCGTCAAGAGGAAAAGAGCACTGCCATGCGTTAACCTTGTGCGATGGTAAACAACAATGGAAGCTTG
ATGTTGTAGCAGCTGGCGCATGGAAAGCTTGTTGCTGCAGCAAGGGATGCTTGTTGATGCTACAGCAAGTTTCTATCCAAGCAATACATGCTTTGGCTCAAAGCAAGACA
TATGCTTTGGCTCCAAGCAAGAAAAGTTTATAA
Protein sequenceShow/hide protein sequence
MSQLPIKLLATSHQPSWVLDNLHSDLIYSENQVSLMDGSKKQLDSNESLEMSTAVVEKNNAVTGHIISTTIGGKNGEPKQWFVRNCIPDHCHLHRSCCRSPSSELPPTYN
ARQRSRRPPATSTNPPNLPLPAYNARESVESAGRSTTQSPDLQHPRESLAGSTTPRSATPDLLFVHAAAVRPRVHVLQWKLDVVAAGAWKACCCSKGCLLMLQQVSIQAI
HALAQSNTCFVLWLKGTHALAPKQESSTCRIHILLPQHNIIRIGFNLRYFLYSPLPCGFDPGILQVCTQEMDALNPQNPPVNPIQLADDRERGIRDYAAPRMCNFNPGIV
RPKLDTERRRRRETYKLDVQFLKGVRDNLVELREAMNDYVEICKEMQLRYQAKKTQAAQSNQIVRDCLAPKDDGDQHIQNGGDHQNLEEKETNVEVVVENNDICEVCVQN
EVETKSLNDSEEKSDEKTKEQEENVKSKKREKREENVASKKREESVKSETIKKKEETKKEMSVSEKKCLSEKKHESEDCFPSVFDGLINIRWIIKFEISYQQQWERNVLR
KMFTFDPREDIPIQVFEHERDDNYKAALQGELSDSRKNPFEEGEVDKNHGDLEIRFASYRKSVILIIDSGKPRRPRLFNFFASLLFFAIKVSFKLSFVSGSDRCMFQTRS
SNKSMFRQAQKRCEKHVRDDPFRKAFRENMDEIHKSIKYMSELAHELKLRDQAREAQITKSIQEEKDCSMPQDDVSNDDVVVDDVVTSEESVQTYVDPSFETKSEESEKE
NVCEKETFETHIETKEKENLVIRVCEKNIVLTMVKREELKESKSVRCKRFQVCTTRGSLDSRTNPLEEGEDYTNRVSGHYSIDEDESYNPSYDHRLPHCIQLETFYVGLN
KNFQVLVDSSANGALLRKTYVEAHEILDRIARKNYEWGSAEDKRKHPMKTSYGSFEVDPMTVVNAKIDAPTTRMDELAANQAPIVSHLAALGCGGESNASSSNSLEAMLK
DFMTSQQEYMAKNDALVRSQAASLRNLEVQLGQLASELKNRPVGTLPNNTEASRGKEHCHALTLCDGKQQWKLDVVAAGAWKACCCSKGCLLMLQQVSIQAIHALAQSKT
YALAPSKKSL