; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g28660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g28660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionIntegrase catalytic domain-containing protein
Genome locationchr3:20540551..20544438
RNA-Seq ExpressionMoc03g28660
SyntenyMoc03g28660
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAA7044889.1 unnamed protein product [Microthlaspi erraticum]1.1e-2329.87Show/hide
Query:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA
        V ++ N D  SN + DQFQ L+  +QSH+   +P                  S SG       SSS H    C         SL N +P +  W++DSGA
Subjt:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA

Query:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL
        ++H+CF   +F++LLP S ++V+LPN  R  +   G + ++  + L  VL++PSFRFNL+SVS+L  + + +  F +N CLIQ  +    IG+  L   L
Subjt:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL

Query:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC
        Y+L+   S                RLGHPS   +    D        D F   +LP     +        P   + + S  S  +PD SP    S     
Subjt:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC

Query:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSP
          S +   PI   TV    TG+S +                    ++ R  R+ + P YL +YHC+ +  ++  IP S+T +PL  F S++ LSP
Subjt:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSP

CAA7044893.1 unnamed protein product [Microthlaspi erraticum]3.5e-2529.22Show/hide
Query:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA
        V ++ N D  S  + DQFQ L+  +QSH+   +P                  S SG       SSS H    C         SL N +P +  W++DSGA
Subjt:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA

Query:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL
        ++H+CF   +F++LLP S ++V+LPN  R  +  +G + ++  + L  VL++PSFRFNL+SVS+L  + + +  F +N CLIQ  +    IG+  L   L
Subjt:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL

Query:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC
        Y+L+   S                RLGHPS   +    D        D F   +LP     +        P   + + S  S  +PD SP    S     
Subjt:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC

Query:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSPTY
          S +   PI   TV    TG+S +                    ++ R  R+ + P YL +YHC+ +  ++  IP S+T +PL   +SY +L P +
Subjt:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSPTY

KAA0065480.1 Cysteine-rich RLK (receptor-like protein kinase) 8 [Cucumis melo var. makuwa]6.0e-2535.19Show/hide
Query:  SSVASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYC-WVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSR
        +S A++ + D  S+ N++Q+  L+T L +HL +      + A++ +H +G  +L +    S+  W++DSGAS HIC  + +F +   T+ + V LPN  R
Subjt:  SSVASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYC-WVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSR

Query:  IHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADAS--------------------R
        I V+ IGDIQIN  +TLK VL++  F +NL+SVS L    ++++ F +  C+IQ  S    IGKA   NGLY+L  +A+                    R
Subjt:  IHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADAS--------------------R

Query:  LGHPSDKHMLALKDIL
        LGH S K + +L   L
Subjt:  LGHPSDKHMLALKDIL

KAA8530341.1 hypothetical protein F0562_005050 [Nyssa sinensis]2.3e-2430.53Show/hide
Query:  LTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIP
        ++ L +HL +        ASS++H   + +              +GA+ HIC + ++FT++      +VTLPN++RI + F GDI++  ++ LK VLY+P
Subjt:  LTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIP

Query:  SFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADASRLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFSGQGPITS
         F+FNL+SV AL       + F  +  +IQ  +S KTIGK +    LY+L           D   L  + I  +++      H  L      S +  +  
Subjt:  SFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADASRLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFSGQGPITS

Query:  VGSSSGASPVIPDVSPDIGVSDGGTCVLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLLNAEPIP
        V   SG      DV P + VS   +           E   +D V          D+   P  + P +S+++ +R+S+R IKPP+YL DYHCSL++ + +P
Subjt:  VGSSSGASPVIPDVSPDIGVSDGGTCVLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLLNAEPIP

Query:  QSSTSHPLHHFISYISLSPTY
         S++S+PL  F+SY SLS ++
Subjt:  QSSTSHPLHHFISYISLSPTY

XP_022143573.1 uncharacterized protein LOC111013441 [Momordica charantia]4.4e-3634.67Show/hide
Query:  ASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHV-AGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHV
        AS +N+ ++S+Y     Q L   LQS L++ K  +D+  ++S  V   T SL          +LD GAS+HIC  R +F  +   S + V LPN  R  V
Subjt:  ASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHV-AGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHV

Query:  EFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANL-DVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADA-----------------------S
        E+ G ++++ H+++  VLYIP F FNL+SV+ L  ++  ++V+F  +TC+IQ KS  KTI K  L +GLYLL   +                       +
Subjt:  EFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANL-DVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADA-----------------------S

Query:  RLGHPSDKHMLALKDILPI-----------------------DASQDPFPHLVLPKSFDFSGQGPITSVGSSSGASPVIPDVSPDIGVSDGGTCVLSPNG
        RLGHPS   + ALK +LP+                       D S +PFP LVLP   DF    P   +   + A   IP V P   +S   T  + P  
Subjt:  RLGHPSDKHMLALKDILPI-----------------------DASQDPFPHLVLPKSFDFSGQGPITSVGSSSGASPVIPDVSPDIGVSDGGTCVLSPNG

Query:  CAPIEASTVDVVNTGESSIVATDLP-SIPGANDPVS---SSLV----VSRRSSRSIKPPTYLKDYHCSLL-NAEPIPQSSTSHPLHHFISYISLSPTY
        CAP    + D   +  + +V+  +P + P  + P+S   SS+V    V RRS+R  K P+YL+D+HCSLL N+ P P +ST HPL  ++SY  LS  +
Subjt:  CAPIEASTVDVVNTGESSIVATDLP-SIPGANDPVS---SSLV----VSRRSSRSIKPPTYLKDYHCSLL-NAEPIPQSSTSHPLHHFISYISLSPTY

TrEMBL top hitse value%identityAlignment
A0A5A7VE66 Cysteine-rich RLK (Receptor-like protein kinase) 82.9e-2535.19Show/hide
Query:  SSVASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYC-WVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSR
        +S A++ + D  S+ N++Q+  L+T L +HL +      + A++ +H +G  +L +    S+  W++DSGAS HIC  + +F +   T+ + V LPN  R
Subjt:  SSVASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYC-WVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSR

Query:  IHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADAS--------------------R
        I V+ IGDIQIN  +TLK VL++  F +NL+SVS L    ++++ F +  C+IQ  S    IGKA   NGLY+L  +A+                    R
Subjt:  IHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADAS--------------------R

Query:  LGHPSDKHMLALKDIL
        LGH S K + +L   L
Subjt:  LGHPSDKHMLALKDIL

A0A5J5AMM8 Uncharacterized protein1.1e-2430.53Show/hide
Query:  LTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIP
        ++ L +HL +        ASS++H   + +              +GA+ HIC + ++FT++      +VTLPN++RI + F GDI++  ++ LK VLY+P
Subjt:  LTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIP

Query:  SFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADASRLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFSGQGPITS
         F+FNL+SV AL       + F  +  +IQ  +S KTIGK +    LY+L           D   L  + I  +++      H  L      S +  +  
Subjt:  SFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADASRLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFSGQGPITS

Query:  VGSSSGASPVIPDVSPDIGVSDGGTCVLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLLNAEPIP
        V   SG      DV P + VS   +           E   +D V          D+   P  + P +S+++ +R+S+R IKPP+YL DYHCSL++ + +P
Subjt:  VGSSSGASPVIPDVSPDIGVSDGGTCVLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLLNAEPIP

Query:  QSSTSHPLHHFISYISLSPTY
         S++S+PL  F+SY SLS ++
Subjt:  QSSTSHPLHHFISYISLSPTY

A0A6D2JM99 Uncharacterized protein1.7e-2529.22Show/hide
Query:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA
        V ++ N D  S  + DQFQ L+  +QSH+   +P                  S SG       SSS H    C         SL N +P +  W++DSGA
Subjt:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA

Query:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL
        ++H+CF   +F++LLP S ++V+LPN  R  +  +G + ++  + L  VL++PSFRFNL+SVS+L  + + +  F +N CLIQ  +    IG+  L   L
Subjt:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL

Query:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC
        Y+L+   S                RLGHPS   +    D        D F   +LP     +        P   + + S  S  +PD SP    S     
Subjt:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC

Query:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSPTY
          S +   PI   TV    TG+S +                    ++ R  R+ + P YL +YHC+ +  ++  IP S+T +PL   +SY +L P +
Subjt:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSPTY

A0A6D2JXA3 Uncharacterized protein5.5e-2429.87Show/hide
Query:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA
        V ++ N D  SN + DQFQ L+  +QSH+   +P                  S SG       SSS H    C         SL N +P +  W++DSGA
Subjt:  VASSSNADSLSNYNADQFQGLLTFLQSHLASMKPT-----------------SDSGA------SSSSHVAGTC---------SLVNQVPWSYCWVLDSGA

Query:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL
        ++H+CF   +F++LLP S ++V+LPN  R  +   G + ++  + L  VL++PSFRFNL+SVS+L  + + +  F +N CLIQ  +    IG+  L   L
Subjt:  SSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGL

Query:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC
        Y+L+   S                RLGHPS   +    D        D F   +LP     +        P   + + S  S  +PD SP    S     
Subjt:  YLLRADAS----------------RLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFS-----GQGPITSVGSSSGASPVIPDVSPDIGVSDGGTC

Query:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSP
          S +   PI   TV    TG+S +                    ++ R  R+ + P YL +YHC+ +  ++  IP S+T +PL  F S++ LSP
Subjt:  VLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLL--NAEPIPQSSTSHPLHHFISYISLSP

A0A6J1CR17 uncharacterized protein LOC1110134412.2e-3634.67Show/hide
Query:  ASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHV-AGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHV
        AS +N+ ++S+Y     Q L   LQS L++ K  +D+  ++S  V   T SL          +LD GAS+HIC  R +F  +   S + V LPN  R  V
Subjt:  ASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHV-AGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHV

Query:  EFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANL-DVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADA-----------------------S
        E+ G ++++ H+++  VLYIP F FNL+SV+ L  ++  ++V+F  +TC+IQ KS  KTI K  L +GLYLL   +                       +
Subjt:  EFIGDIQINQHITLKRVLYIPSFRFNLLSVSALAANL-DVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADA-----------------------S

Query:  RLGHPSDKHMLALKDILPI-----------------------DASQDPFPHLVLPKSFDFSGQGPITSVGSSSGASPVIPDVSPDIGVSDGGTCVLSPNG
        RLGHPS   + ALK +LP+                       D S +PFP LVLP   DF    P   +   + A   IP V P   +S   T  + P  
Subjt:  RLGHPSDKHMLALKDILPI-----------------------DASQDPFPHLVLPKSFDFSGQGPITSVGSSSGASPVIPDVSPDIGVSDGGTCVLSPNG

Query:  CAPIEASTVDVVNTGESSIVATDLP-SIPGANDPVS---SSLV----VSRRSSRSIKPPTYLKDYHCSLL-NAEPIPQSSTSHPLHHFISYISLSPTY
        CAP    + D   +  + +V+  +P + P  + P+S   SS+V    V RRS+R  K P+YL+D+HCSLL N+ P P +ST HPL  ++SY  LS  +
Subjt:  CAPIEASTVDVVNTGESSIVATDLP-SIPGANDPVS---SSLV----VSRRSSRSIKPPTYLKDYHCSLL-NAEPIPQSSTSHPLHHFISYISLSPTY

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCATCTTCAGTTGCTTCTTCGTCCAATGCTGATTCGTTGAGTAATTACAATGCTGATCAGTTCCAAGGGCTTTTGACTTTTCTGCAGTCTCATTTGGCATCCATGAA
GCCAACATCCGATTCTGGTGCCTCTTCTTCCAGTCATGTGGCAGGTACTTGCTCCCTTGTTAATCAAGTTCCTTGGTCTTATTGTTGGGTTCTTGACTCAGGGGCATCAT
CGCATATCTGTTTTTCTCGTGATATGTTTACTTCTCTGCTGCCTACATCTGGTATTTCAGTCACTTTGCCAAATAATTCTCGGATTCATGTTGAGTTTATTGGAGATATT
CAGATTAATCAACACATTACTCTTAAGAGAGTGTTGTATATACCTTCTTTTCGGTTCAACCTGCTCTCTGTGAGTGCCTTGGCTGCTAACTTGGATGTTAATGTTCAGTT
TAATGCTAATACATGTCTCATCCAGGCCAAGTCCTCTTTGAAGACGATTGGCAAGGCTGAGCTCTGGAATGGTTTATATCTTCTGCGTGCAGATGCTTCTCGACTTGGCC
ACCCTTCTGATAAACACATGTTAGCATTAAAGGATATCTTACCTATTGATGCTTCTCAGGACCCTTTTCCTCACTTGGTTCTTCCCAAGTCGTTTGATTTTTCTGGGCAA
GGTCCTATAACCTCGGTTGGTTCTTCATCTGGTGCATCTCCTGTTATCCCTGATGTTTCTCCTGATATTGGTGTTTCTGATGGTGGTACTTGTGTTTTATCACCGAATGG
TTGTGCTCCAATCGAGGCCTCCACTGTTGATGTTGTTAATACAGGAGAATCGTCTATTGTAGCTACTGATTTGCCTTCCATCCCTGGTGCTAATGACCCGGTTTCTTCTA
GTTTAGTTGTGTCTCGTCGGTCTTCAAGATCTATCAAGCCGCCAACATACTTGAAAGACTATCATTGTAGTCTTCTTAATGCTGAGCCTATACCACAGTCTTCTACTAGC
CATCCTCTACACCATTTTATTTCATATATTAGCCTTTCTCCTACCTATAGCAAAATCCTGGAGCTGTTCGGTGGTCGGAATTGGTATACCAATTCCGATGGCCACCACGA
AATCAATGTCATCGATGAAGCTGGGTCGTCATCGATGCAAGTGTATGTGAACGAGGATCAGAATCAGAATCAGCTTACCGTTGTTAACGAAAAAACGTCAAAACACAAGA
TTGATCTCCGCTGGGCTAAATCTAAATCTCATGGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGCCATCTTCAGTTGCTTCTTCGTCCAATGCTGATTCGTTGAGTAATTACAATGCTGATCAGTTCCAAGGGCTTTTGACTTTTCTGCAGTCTCATTTGGCATCCATGAA
GCCAACATCCGATTCTGGTGCCTCTTCTTCCAGTCATGTGGCAGGTACTTGCTCCCTTGTTAATCAAGTTCCTTGGTCTTATTGTTGGGTTCTTGACTCAGGGGCATCAT
CGCATATCTGTTTTTCTCGTGATATGTTTACTTCTCTGCTGCCTACATCTGGTATTTCAGTCACTTTGCCAAATAATTCTCGGATTCATGTTGAGTTTATTGGAGATATT
CAGATTAATCAACACATTACTCTTAAGAGAGTGTTGTATATACCTTCTTTTCGGTTCAACCTGCTCTCTGTGAGTGCCTTGGCTGCTAACTTGGATGTTAATGTTCAGTT
TAATGCTAATACATGTCTCATCCAGGCCAAGTCCTCTTTGAAGACGATTGGCAAGGCTGAGCTCTGGAATGGTTTATATCTTCTGCGTGCAGATGCTTCTCGACTTGGCC
ACCCTTCTGATAAACACATGTTAGCATTAAAGGATATCTTACCTATTGATGCTTCTCAGGACCCTTTTCCTCACTTGGTTCTTCCCAAGTCGTTTGATTTTTCTGGGCAA
GGTCCTATAACCTCGGTTGGTTCTTCATCTGGTGCATCTCCTGTTATCCCTGATGTTTCTCCTGATATTGGTGTTTCTGATGGTGGTACTTGTGTTTTATCACCGAATGG
TTGTGCTCCAATCGAGGCCTCCACTGTTGATGTTGTTAATACAGGAGAATCGTCTATTGTAGCTACTGATTTGCCTTCCATCCCTGGTGCTAATGACCCGGTTTCTTCTA
GTTTAGTTGTGTCTCGTCGGTCTTCAAGATCTATCAAGCCGCCAACATACTTGAAAGACTATCATTGTAGTCTTCTTAATGCTGAGCCTATACCACAGTCTTCTACTAGC
CATCCTCTACACCATTTTATTTCATATATTAGCCTTTCTCCTACCTATAGCAAAATCCTGGAGCTGTTCGGTGGTCGGAATTGGTATACCAATTCCGATGGCCACCACGA
AATCAATGTCATCGATGAAGCTGGGTCGTCATCGATGCAAGTGTATGTGAACGAGGATCAGAATCAGAATCAGCTTACCGTTGTTAACGAAAAAACGTCAAAACACAAGA
TTGATCTCCGCTGGGCTAAATCTAAATCTCATGGCTAA
Protein sequenceShow/hide protein sequence
MPSSVASSSNADSLSNYNADQFQGLLTFLQSHLASMKPTSDSGASSSSHVAGTCSLVNQVPWSYCWVLDSGASSHICFSRDMFTSLLPTSGISVTLPNNSRIHVEFIGDI
QINQHITLKRVLYIPSFRFNLLSVSALAANLDVNVQFNANTCLIQAKSSLKTIGKAELWNGLYLLRADASRLGHPSDKHMLALKDILPIDASQDPFPHLVLPKSFDFSGQ
GPITSVGSSSGASPVIPDVSPDIGVSDGGTCVLSPNGCAPIEASTVDVVNTGESSIVATDLPSIPGANDPVSSSLVVSRRSSRSIKPPTYLKDYHCSLLNAEPIPQSSTS
HPLHHFISYISLSPTYSKILELFGGRNWYTNSDGHHEINVIDEAGSSSMQVYVNEDQNQNQLTVVNEKTSKHKIDLRWAKSKSHG