; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0013797 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0013797
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionPinin-like
Genome locationchr1:52825603..52826475
RNA-Seq ExpressionLag0013797
SyntenyLag0013797
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6594621.1 hypothetical protein SDJN03_11174, partial [Cucurbita argyrosperma subsp. sororia]2.5e-9469.67Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN
        MGCCVSSGKS +SAHKFD   AA A KIFGPLTDNGSREPP SMEEETVKEVLSET ALK LP  P  K+C PE DEAQKPVGD    EIEKK  +IPIN
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN

Query:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA
        GI ++ASEF EISNP++   TANFTD MD G EV+Q V K+    LP NQ +   + G+KR+ SPN+TLNRRSDQSPVRRN  VGSARLVQ +D SPAM 
Subjt:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA

Query:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
         RGLR EP  +DPDEN  RRSRSPATAR D GGSRSALGRTPSVRKSGK +      A   A + S+KVVEE NI DG + TQIESLENPLVSLECFIFL
Subjt:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

KAG7026590.1 hypothetical protein SDJN02_10592, partial [Cucurbita argyrosperma subsp. argyrosperma]1.5e-9469.67Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN
        MGCCVSSGKS +SAHKFD   AA A KIFGPLTDNGSREPP SMEEETVKEVLSET ALK LP  P  K+C PE DEAQKPVGD    EIEKK  +IPIN
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN

Query:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA
        GI ++ASEF EISNP++   TANFTD MD G EV+Q V K+    LP NQ +   + G+KR+ SPN+TLNRRSDQSPVRRN  VGSARLVQ RD SPAM 
Subjt:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA

Query:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
         RGLR EP ++DPDEN  RRSRSPATAR D GGSRSALGRTPSVRKSGK +      A   A + S+KVVEE NI +G + TQIESLENPLVSLECFIFL
Subjt:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

XP_022926404.1 uncharacterized protein LOC111433567 [Cucurbita moschata]4.3e-9469.67Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN
        MGCCVSSGKS +SAHKFD   AA A KIFGPLTDNGSREPP SMEEETVKEVLSET ALK L   P  KNC PE DEAQKPVGD    EIEKK  +IPIN
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN

Query:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA
        GI ++ SEF EISNP++   TANFTD MD G EV+Q V K+    LP NQ +   + G+KR+ SPN+TLNRRSDQSPVRRN  VGSARLVQ RD SPAM 
Subjt:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA

Query:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
         RGLR EP ++DPDEN  RRSRSPATAR D GGSRSALGRTPSVRKSGK +      A   A + S+KVVEE NI DG + TQIESLENPLVSLECFIFL
Subjt:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

XP_023518159.1 uncharacterized protein LOC111781703 [Cucurbita pepo subsp. pepo]2.8e-9370.51Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN
        MGCCVSSGKS +SAHKFD   AA A KIFGPLTDNGSREPP SMEEETVKEVLSET ALK L   P  KNC PE DEAQKPVGD    EIEKK  +IPIN
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN

Query:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA
        GIA++ASEF EISNP++   TANFTD MD G EV+Q V K+    LP NQ +   D G+KR+ SPN+TLNRRSDQSPVRRN  VGSARLVQ RD SPAM 
Subjt:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA

Query:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLE
         RGLR EP ++DPDEN  RRSRSPATAR DGGGSRSALGRTPSVRKSGK +      A   A + S+KVVEE NI DG + TQIESLENPLVSLE
Subjt:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLE

XP_038882208.1 uncharacterized protein LOC120073430 [Benincasa hispida]6.0e-9670.55Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE
        MGCC+SSGKS NS +KF RNS            DNGSR+PP SMEEETVKEVLSETP+LK  PSPP KKN  PE D+  KPVG+EIEKK  +I INGIAE
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE

Query:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR
          SEFYEIS+PNEC+   TA  T++MD GGE++Q V KS PV+LPK+Q +S  D  +KRE S NRTL RRSDQSPVRRNGA+GS R+V NRD++PAMARR
Subjt:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR

Query:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
         LR EPPRRDPDENS RRSRSPATARSDG GSRSAL RTPSVRKSGK +    A + SQKVVEE NIIDGK N+QIESLENPLVSLECFIFL
Subjt:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

TrEMBL top hitse value%identityAlignment
A0A0A0KLE9 Uncharacterized protein4.0e-9067.12Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE
        MGCC+SS +S +S +KF  NS             N SR+PP SMEEETVKEVLSETPALK    PP+K N  PE DE +KP+GDEIEKK  +IPINGI E
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE

Query:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR
        + SEFYEIS+ N+C+    A FTD+ D GGEV+Q V KS PV+L KNQ VSS D  +KRE   +RTL RRSDQSPVRRNGAVGS R+V NRD+SPAMARR
Subjt:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR

Query:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
        GLR EPPRRDPDENSSRRS SP+TARSD  G RSAL RTPS RKSGK +      + SQKVVEE NI+DGK NTQIESLENPLVSLECFIFL
Subjt:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

A0A1S3B2L5 uncharacterized protein LOC1034853127.6e-8966.1Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE
        MGCC+SS +S NS +KF             P + N +R+PP SMEEETVKEVLSETPALK    PP  KNC PE DE  KP+GDE EKK  +IPINGI E
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE

Query:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR
        + SEFYEIS+ N+C+    A FTD+ D GGEV+Q   KS PV+L KNQ VSS D  +KRE   +RTL RRSDQSPVRRNGAVGS R+V NRD+SPAMARR
Subjt:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR

Query:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
        GLR EPPRRDPDENSSRRS+SP+TA SD  G RSAL RTPS RKSGK +      + SQKVVEE NI+DGK NTQIESLENPLVSLECFIFL
Subjt:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

A0A5D3CNI1 Putative BEST plant protein match is: (TAIR:plant.1) protein7.6e-8966.1Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE
        MGCC+SS +S NS +KF             P + N +R+PP SMEEETVKEVLSETPALK    PP  KNC PE DE  KP+GDE EKK  +IPINGI E
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAE

Query:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR
        + SEFYEIS+ N+C+    A FTD+ D GGEV+Q   KS PV+L KNQ VSS D  +KRE   +RTL RRSDQSPVRRNGAVGS R+V NRD+SPAMARR
Subjt:  KASEFYEISNPNECL--HTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARR

Query:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
        GLR EPPRRDPDENSSRRS+SP+TA SD  G RSAL RTPS RKSGK +      + SQKVVEE NI+DGK NTQIESLENPLVSLECFIFL
Subjt:  GLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

A0A6J1CMA7 uncharacterized protein LOC1110124332.6e-5264.32Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD-------EIEKKHRDI
        MGCCVSSG   NSAHKFDRNSAA   KI+       SREPP SMEEETVKEVL+ETPALK    PP  KN  P+ DEA KPV D       EIEKK R I
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD-------EIEKKHRDI

Query:  PINGIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVS
        P N +AE A EF EIS+P+ECL  A FTDKMD G EV+QRV ++ PV+LPKNQ   S    +KRE  PNR LNRRSDQSPVRRNG VGSARL QNRD++
Subjt:  PINGIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVS

A0A6J1EF08 uncharacterized protein LOC1114335672.1e-9469.67Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN
        MGCCVSSGKS +SAHKFD   AA A KIFGPLTDNGSREPP SMEEETVKEVLSET ALK L   P  KNC PE DEAQKPVGD    EIEKK  +IPIN
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGD----EIEKKHRDIPIN

Query:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA
        GI ++ SEF EISNP++   TANFTD MD G EV+Q V K+    LP NQ +   + G+KR+ SPN+TLNRRSDQSPVRRN  VGSARLVQ RD SPAM 
Subjt:  GIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMA

Query:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL
         RGLR EP ++DPDEN  RRSRSPATAR D GGSRSALGRTPSVRKSGK +      A   A + S+KVVEE NI DG + TQIESLENPLVSLECFIFL
Subjt:  RRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLA------ATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G11125.1 unknown protein3.2e-0728.98Show/hide
Query:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEET-VKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIA
        MGCC+SS               ATA K   P++   +  PP  ++EET VKEVLSET  L          N    V++       E E+K   I ++   
Subjt:  MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEET-VKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIA

Query:  EKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQL----VSSVDAGIKREFSP----NRTL----NRRSDQSPVRRNG--AVGSARL
        E       +  P             + G EV +  S S  +    N+     V  + + + R+ SP    NR +    NRR+D SP +RN     GS RL
Subjt:  EKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQL----VSSVDAGIKREFSP----NRTL----NRRSDQSPVRRNG--AVGSARL

Query:  VQNRDVSPAMARRGLRVEPPRRDPDENSSRRSRSPATARSDGGG---SRSALGRTPSVRKSGKLAATATAASGSQKVVEETN------IIDGKLNTQIES
        V +   +              RD  E S RRSRSPA  RS   G   S+ +     ++R+  +               +E N       I    +   +S
Subjt:  VQNRDVSPAMARRGLRVEPPRRDPDENSSRRSRSPATARSDGGG---SRSALGRTPSVRKSGKLAATATAASGSQKVVEETN------IIDGKLNTQIES

Query:  LENPLVSLECFIFL
         ENPLVSLECFIFL
Subjt:  LENPLVSLECFIFL

AT1G61170.1 unknown protein4.7e-0629.28Show/hide
Query:  CCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEET-VKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDI--------
        CCVSSG +       DR +            +N S +    +EEET VKEVLSET     L +P          D  +  + ++ EKK   +        
Subjt:  CCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEET-VKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDI--------

Query:  --PINGIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDV
          P +   E+ SE  EI + +    + + T  M+   E +  + +    + P         A  + + + N    RR+DQSP +RN    +         
Subjt:  --PINGIAEKASEFYEISNPNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDV

Query:  SPAMARRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLN-----TQIESLENPLVSLEC
            AR G  V    RDP E S RRSRSPAT       +RS +    S R  G        + G  ++    N +D + +     T  E LENPLVSLEC
Subjt:  SPAMARRGLRVEPPRRDPDENSSRRSRSPATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLN-----TQIESLENPLVSLEC

Query:  FIFL
        FIFL
Subjt:  FIFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTGCTGCGTTAGTTCGGGGAAATCCTTAAATTCTGCGCACAAATTTGATCGGAATTCTGCGGCGACCGCCGTTAAGATTTTTGGGCCGTTAACGGACAATGGAAG
CAGAGAGCCGCCGCCTTCCATGGAGGAAGAGACCGTCAAGGAAGTGCTCTCTGAAACACCTGCTCTGAAACAACTGCCGTCGCCGCCGAAGAAGAAGAATTGTCAACCGG
AAGTAGATGAAGCCCAGAAACCAGTCGGCGACGAGATCGAGAAGAAGCATCGCGATATTCCCATTAATGGAATTGCAGAAAAAGCTTCTGAATTCTATGAAATTTCCAAT
CCGAACGAGTGTCTCCACACCGCCAATTTCACCGATAAAATGGACGCCGGCGGAGAGGTTTATCAGAGGGTTTCGAAATCACCGCCGGTGCAACTGCCGAAGAATCAATT
AGTTTCTTCCGTGGACGCTGGGATAAAAAGAGAATTTTCGCCAAACAGGACACTAAACCGGAGATCCGACCAGTCTCCGGTCCGACGAAACGGCGCCGTCGGTTCGGCGA
GATTGGTTCAGAACAGAGACGTGAGTCCAGCAATGGCGCGGCGGGGGTTGAGAGTGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGCGATCCAGGTCGCCA
GCCACCGCGCGTTCCGACGGCGGAGGGTCTAGATCCGCCTTGGGTCGGACCCCATCAGTGAGAAAGTCCGGTAAGTTGGCGGCGACGGCGACGGCAGCTTCGGGCAGTCA
AAAAGTAGTAGAAGAAACTAATATAATTGACGGAAAGCTGAACACTCAAATTGAGTCACTTGAGAATCCTCTGGTTTCATTGGAGTGCTTCATCTTCCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTGCTGCGTTAGTTCGGGGAAATCCTTAAATTCTGCGCACAAATTTGATCGGAATTCTGCGGCGACCGCCGTTAAGATTTTTGGGCCGTTAACGGACAATGGAAG
CAGAGAGCCGCCGCCTTCCATGGAGGAAGAGACCGTCAAGGAAGTGCTCTCTGAAACACCTGCTCTGAAACAACTGCCGTCGCCGCCGAAGAAGAAGAATTGTCAACCGG
AAGTAGATGAAGCCCAGAAACCAGTCGGCGACGAGATCGAGAAGAAGCATCGCGATATTCCCATTAATGGAATTGCAGAAAAAGCTTCTGAATTCTATGAAATTTCCAAT
CCGAACGAGTGTCTCCACACCGCCAATTTCACCGATAAAATGGACGCCGGCGGAGAGGTTTATCAGAGGGTTTCGAAATCACCGCCGGTGCAACTGCCGAAGAATCAATT
AGTTTCTTCCGTGGACGCTGGGATAAAAAGAGAATTTTCGCCAAACAGGACACTAAACCGGAGATCCGACCAGTCTCCGGTCCGACGAAACGGCGCCGTCGGTTCGGCGA
GATTGGTTCAGAACAGAGACGTGAGTCCAGCAATGGCGCGGCGGGGGTTGAGAGTGGAGCCTCCCCGGAGAGACCCAGATGAGAATTCCAGCCGGCGATCCAGGTCGCCA
GCCACCGCGCGTTCCGACGGCGGAGGGTCTAGATCCGCCTTGGGTCGGACCCCATCAGTGAGAAAGTCCGGTAAGTTGGCGGCGACGGCGACGGCAGCTTCGGGCAGTCA
AAAAGTAGTAGAAGAAACTAATATAATTGACGGAAAGCTGAACACTCAAATTGAGTCACTTGAGAATCCTCTGGTTTCATTGGAGTGCTTCATCTTCCTCTGA
Protein sequenceShow/hide protein sequence
MGCCVSSGKSLNSAHKFDRNSAATAVKIFGPLTDNGSREPPPSMEEETVKEVLSETPALKQLPSPPKKKNCQPEVDEAQKPVGDEIEKKHRDIPINGIAEKASEFYEISN
PNECLHTANFTDKMDAGGEVYQRVSKSPPVQLPKNQLVSSVDAGIKREFSPNRTLNRRSDQSPVRRNGAVGSARLVQNRDVSPAMARRGLRVEPPRRDPDENSSRRSRSP
ATARSDGGGSRSALGRTPSVRKSGKLAATATAASGSQKVVEETNIIDGKLNTQIESLENPLVSLECFIFL