; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS002895 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS002895
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
Descriptionleucine-rich repeat extensin-like protein 3
Genome locationscaffold359:514335..515134
RNA-Seq ExpressionMS002895
SyntenyMS002895
Gene Ontology termsGO:1900150 - regulation of defense response to fungus (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR036163 - Heavy metal-associated domain superfamily
IPR044169 - Protein PYRICULARIA ORYZAE RESISTANCE 21


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573522.1 hypothetical protein SDJN03_27409, partial [Cucurbita argyrosperma subsp. sororia]1.4e-6057.08Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPK----PPPQKQTDPPPARSKDSP
        K  IM+L VDLQCHRCY KV+KV+ KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSI+I++PE PK    PPP+K   PPP +  DSP
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPK----PPPQKQTDPPPARSKDSP

Query:  PAAKPANPTKQK-------------HPQTPAGKPAAALTPAPPQILT----PVQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGP
        P  KP  P  QK              PQ     PA      PP ++     PVQ+         PV GYP  YP G  CR CYEG G GPCY G+GRP  
Subjt:  PAAKPANPTKQK-------------HPQTPAGKPAAALTPAPPQILT----PVQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGP

Query:  CCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
        CC+GCASGRPIY+S GGG  CY+S  E+LNE+NA GCSVM
Subjt:  CCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

KAG6583723.1 hypothetical protein SDJN03_19655, partial [Cucurbita argyrosperma subsp. sororia]3.6e-6152.79Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPARSKDS
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIK      P+KP PPP K+ DPPP +  D 
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPARSKDS

Query:  PPAAKP-------------------------------------ANPTKQKHPQTPAGK-----PAAALTPAPPQILTPVQSD-----------PVLGYPL
        PP AKP                                     A+P   K    P  K     PA    P PP+ + PV              PV GYP 
Subjt:  PPAAKP-------------------------------------ANPTKQKHPQTPAGK-----PAAALTPAPPQILTPVQSD-----------PVLGYPL

Query:  TYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
         YP G  C +CYEG G GPCY G+GRPGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GCSVM
Subjt:  TYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

XP_004139513.2 circumsporozoite protein [Cucumis sativus]1.4e-6050Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAK
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIKEPE PKPPP K  DPPP +  D PP+ K
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAK

Query:  PANPTKQKHPQTPAGK-------------PAAALTPAPPQ--------------------------------------------------------ILTP
        P  P  QK    P  K             P+ A  P PPQ                                                        I  P
Subjt:  PANPTKQKHPQTPAGK-------------PAAALTPAPPQ--------------------------------------------------------ILTP

Query:  VQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
        VQ +        PV GYP  YP G  CR+C+EG G GPCY G+G PGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GC VM
Subjt:  VQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

XP_022142410.1 protein PYRICULARIA ORYZAE RESISTANCE 21 [Momordica charantia]1.2e-11798.12Show/hide
Query:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA
        +SKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPP RSKDSPPA
Subjt:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA

Query:  AKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE
        AKPANPTKQKHPQ PAGKPAAALTPAPPQILTPVQSDPVLGYPL YPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE
Subjt:  AKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE

Query:  HLNEKNAAGCSVM
        HLNEKNAAGCSVM
Subjt:  HLNEKNAAGCSVM

XP_023542911.1 leucine-rich repeat extensin-like protein 3 [Cucurbita pepo subsp. pepo]1.0e-6056.28Show/hide
Query:  YLQSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPK----PPPQKQTDPPPARS
        ++  K  +M+L VDLQCHRCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSI+I++PE PK    PPPQK TD PP + 
Subjt:  YLQSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPK----PPPQKQTDPPPARS

Query:  KDSPPAAKPANPTKQKHPQTP----------AGKPAAALTPAPP----------QILTPVQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQ
         D PP  K  +P   K P  P            K A    PAPP          Q   PVQ+         PV GYP  YP G  CR CYEG G GPCY 
Subjt:  KDSPPAAKPANPTKQKHPQTP----------AGKPAAALTPAPP----------QILTPVQSD--------PVLGYPLTYPFGTGCRRCYEGIGCGPCYQ

Query:  GYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
        G+GRP  CC+GCASGRPIY+S GGG  CY+S  E+LNE+NA GCSVM
Subjt:  GYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

TrEMBL top hitse value%identityAlignment
A0A0A0LTA1 Uncharacterized protein7.7e-6251.43Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAK
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIKEPE PKPPP K  DPPP +  D PP+ K
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAK

Query:  PANPTKQKHPQTPAGK-------------PAAALTPAPPQ------------------------------------------------ILTPVQSD----
        P  P  QK    P  K             P+ A  P PPQ                                                I  PVQ +    
Subjt:  PANPTKQKHPQTPAGK-------------PAAALTPAPPQ------------------------------------------------ILTPVQSD----

Query:  ----PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
            PV GYP  YP G  CR+C+EG G GPCY G+G PGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GC VM
Subjt:  ----PVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

A0A6J1CKV6 protein PYRICULARIA ORYZAE RESISTANCE 215.8e-11898.12Show/hide
Query:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA
        +SKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPP RSKDSPPA
Subjt:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA

Query:  AKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE
        AKPANPTKQKHPQ PAGKPAAALTPAPPQILTPVQSDPVLGYPL YPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE
Subjt:  AKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSE

Query:  HLNEKNAAGCSVM
        HLNEKNAAGCSVM
Subjt:  HLNEKNAAGCSVM

A0A6J1ELG0 leucine-rich repeat extensin-like protein 3 isoform X121.1e-6053.85Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPARSKDS
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIK      P+KP PPP K+ DPPP    D 
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPARSKDS

Query:  PPAAK---------------------PANPTKQKHPQTPAGK------------PAAALTPAPPQILTPVQSD-----------PVLGYPLTYPFGTGCR
        PP  K                      A+P   K    P  K            PA    P PP+ + PV              PV GYP  YP G  C 
Subjt:  PPAAK---------------------PANPTKQKHPQTPAGK------------PAAALTPAPPQILTPVQSD-----------PVLGYPLTYPFGTGCR

Query:  RCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
        +CYEG G GPCY G+GRPGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GCSVM
Subjt:  RCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

A0A6J1EPF0 protein PYRICULARIA ORYZAE RESISTANCE 21-like isoform X109.5e-6052.04Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK---------------------EPEKPKP
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIK                      P KP P
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK---------------------EPEKPKP

Query:  PPQKQTDPPPARS---------------------KDSPPAAKPANPTKQKHPQTPAGK-----PAAALTPAPPQILTPVQSD-----------PVLGYPL
        PP K+ DPPP ++                     K  PP  K A+P   K    P  K     PA    P PP+ + PV              PV GYP 
Subjt:  PPQKQTDPPPARS---------------------KDSPPAAKPANPTKQKHPQTPAGK-----PAAALTPAPPQILTPVQSD-----------PVLGYPL

Query:  TYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
         YP G  C +CYEG G GPCY G+GRPGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GCSVM
Subjt:  TYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

A0A6J1IAN1 leucine-rich repeat extensin-like protein 3 isoform X47.3e-6053.18Show/hide
Query:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPAR----
        K  +M+L VDLQC RCY KV+KVL KF QIRD+IYDEK N VIIKVVCCNPEKLRD ICCKG GVIKSIEIK      P+KP PPP K+ DPPPA+    
Subjt:  KDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIK-----EPEKPKPPPQKQTDPPPAR----

Query:  --------------------------------SKDSPPAAKPANPTKQKHPQTPAGK----PAAALTPAPPQ---ILTPVQSD--------PVLGYPLTY
                                         K  PP  K A+P   K    P  K    P A   P PP+    L P Q +        PV GYP  Y
Subjt:  --------------------------------SKDSPPAAKPANPTKQKHPQTPAGK----PAAALTPAPPQ---ILTPVQSD--------PVLGYPLTY

Query:  PFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM
        P G  C +CYEG G GPCY G+GRPGPCCDGCASGRPIY+S GGG PCYVS+ E+LNE+NA+GCSVM
Subjt:  PFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49420.1 Heavy metal transport/detoxification superfamily protein4.1e-1534.15Show/hide
Query:  TVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAKPANPTKQ
        T  L   + ++KV+K LS   Q+RD+ ++E+ N V IKVVCC+PEK+ D +C KGRG IK IE  +P K      K+ + P         A KP  P K 
Subjt:  TVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAKPANPTKQ

Query:  KHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLN-EKNAA
        K    PA  PA    P+  ++       P++G+P+ +               GP Y G+ RP    +     RPIYNS GG  P    Y    + E+   
Subjt:  KHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLN-EKNAA

Query:  GCSVM
         CS+M
Subjt:  GCSVM

AT1G51090.1 Heavy metal transport/detoxification superfamily protein9.1e-2338.32Show/hide
Query:  LQSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPP
        +  K  +M L VDL C +CY KV+K + KF QI D ++DEK N +IIKVVC +PE+L + +C KG G IKSI I EP K   PPQ Q  PP    K + P
Subjt:  LQSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPP

Query:  AAKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYS
        A  PA        Q P     A + PAP  +L  V S      P+  P+            CGP Y+              GRP+Y S GGG  C     
Subjt:  AAKPANPTKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYS

Query:  EHLNEKNAAGCSVM
            + N+ GCS+M
Subjt:  EHLNEKNAAGCSVM

AT4G16380.1 Heavy metal transport/detoxification superfamily protein5.2e-3439.92Show/hide
Query:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA
        + K  +M L VDL C +CY KV+KVL KF QIRD+++DEK N VIIKVVCC+PE++ D +C KG G IK+IEI EP K   PPQ Q   PP + KD+ P 
Subjt:  QSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPA

Query:  A--------------------KPANPTKQKHPQ-----------------TPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGP
        A                    KP  P K K P+                  PA  PA A  PAP Q   P Q+ P++  P   P    C   Y+G G GP
Subjt:  A--------------------KPANPTKQKHPQ-----------------TPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGP

Query:  CYQGYGRPGPCCDGCASGRPIYNSCGGGGP--------CYVSYSEHLNEKNAAGCSVM
         + GYG P    +    GRP+Y S GGG P        C+V+  ++ +E+N   CS+M
Subjt:  CYQGYGRPGPCCDGCASGRPIYNSCGGGGP--------CYVSYSEHLNEKNAAGCSVM

AT4G16380.2 Heavy metal transport/detoxification superfamily protein2.2e-2437.72Show/hide
Query:  QIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAA--------------------KPANPTKQK
        +IRD+++DEK N VIIKVVCC+PE++ D +C KG G IK+IEI EP K   PPQ Q   PP + KD+ P A                    KP  P K K
Subjt:  QIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAA--------------------KPANPTKQK

Query:  HPQ-----------------TPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGP
         P+                  PA  PA A  PAP Q   P Q+ P++  P   P    C   Y+G G GP + GYG P    +    GRP+Y S GGG P
Subjt:  HPQ-----------------TPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGP

Query:  --------CYVSYSEHLNEKNAAGCSVM
                C+V+  ++ +E+N   CS+M
Subjt:  --------CYVSYSEHLNEKNAAGCSVM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TTGGTGTACTTACAGTCGAAGGATGCTATCATGGTGCTGACAGTGGATTTGCAGTGCCATCGATGCTATAATAAAGTCAGGAAAGTTCTCTCCAAATTCCATCAAATTCG
AGACCGGATTTATGATGAAAAATTGAACGCTGTGATTATCAAAGTGGTTTGTTGCAATCCTGAGAAGTTGAGAGATAATATTTGCTGCAAAGGGCGTGGGGTTATTAAGA
GCATTGAGATCAAAGAGCCTGAAAAACCCAAGCCCCCTCCGCAAAAACAGACCGATCCTCCGCCGGCGAGATCAAAAGACTCTCCTCCGGCAGCGAAACCCGCCAACCCT
ACGAAGCAAAAACACCCACAAACTCCGGCGGGGAAGCCAGCGGCTGCACTAACACCGGCTCCGCCCCAGATTCTCACTCCGGTCCAATCCGACCCGGTTCTCGGGTACCC
GCTGACGTACCCGTTTGGGACGGGCTGTAGACGGTGCTACGAAGGGATTGGTTGCGGCCCATGTTATCAAGGGTACGGTAGGCCCGGCCCATGTTGTGATGGCTGCGCTT
CTGGAAGGCCCATTTACAACAGTTGCGGCGGAGGGGGGCCCTGTTACGTCAGCTACTCTGAGCATCTTAATGAAAAAAATGCAGCTGGATGCAGTGTCATG
mRNA sequenceShow/hide mRNA sequence
TTGGTGTACTTACAGTCGAAGGATGCTATCATGGTGCTGACAGTGGATTTGCAGTGCCATCGATGCTATAATAAAGTCAGGAAAGTTCTCTCCAAATTCCATCAAATTCG
AGACCGGATTTATGATGAAAAATTGAACGCTGTGATTATCAAAGTGGTTTGTTGCAATCCTGAGAAGTTGAGAGATAATATTTGCTGCAAAGGGCGTGGGGTTATTAAGA
GCATTGAGATCAAAGAGCCTGAAAAACCCAAGCCCCCTCCGCAAAAACAGACCGATCCTCCGCCGGCGAGATCAAAAGACTCTCCTCCGGCAGCGAAACCCGCCAACCCT
ACGAAGCAAAAACACCCACAAACTCCGGCGGGGAAGCCAGCGGCTGCACTAACACCGGCTCCGCCCCAGATTCTCACTCCGGTCCAATCCGACCCGGTTCTCGGGTACCC
GCTGACGTACCCGTTTGGGACGGGCTGTAGACGGTGCTACGAAGGGATTGGTTGCGGCCCATGTTATCAAGGGTACGGTAGGCCCGGCCCATGTTGTGATGGCTGCGCTT
CTGGAAGGCCCATTTACAACAGTTGCGGCGGAGGGGGGCCCTGTTACGTCAGCTACTCTGAGCATCTTAATGAAAAAAATGCAGCTGGATGCAGTGTCATG
Protein sequenceShow/hide protein sequence
LVYLQSKDAIMVLTVDLQCHRCYNKVRKVLSKFHQIRDRIYDEKLNAVIIKVVCCNPEKLRDNICCKGRGVIKSIEIKEPEKPKPPPQKQTDPPPARSKDSPPAAKPANP
TKQKHPQTPAGKPAAALTPAPPQILTPVQSDPVLGYPLTYPFGTGCRRCYEGIGCGPCYQGYGRPGPCCDGCASGRPIYNSCGGGGPCYVSYSEHLNEKNAAGCSVM