; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g05320 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g05320
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr3:3951305..3952642
RNA-Seq ExpressionMoc03g05320
SyntenyMoc03g05320
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]5.5e-8770.3Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MNRN QDPPPPQNPPVNGDMAGE AAN+AGEIPN ILL DNRDVA+RNYVT AFHNLNSGINNLLP+A Q ELKPVMF MLQTM QFGGLTNEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEKSV
        SFIEIANAFQLPGVSE+ALRLK+                                                      GLD SSRMMLNTAAN SLLEKSV
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEKSV

Query:  NEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTS
        NEIVDILNKM DINDQGE GRSL KKQVSAGIFELD VA MQAQM  MNQMLKQ TMEKETK VTS
Subjt:  NEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTS

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]8.7e-8544.94Show/hide
Query:  IPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDG
        +PNPI + D +D A+R+Y      +LNS + N LP   QFE KP+M QML  + QFGGL +EDP SHLKSFI++AN  +LPG+S+ ALRL +FPFSL   
Subjt:  IPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------------IEQFYRGLDHSSRMMLNTAANDSLL
        A  WLNA    +I TW+++ +KFL KY   T+N D+RE+I                                   IE F+RG D  ++MMLN AAN    
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------------IEQFYRGLDHSSRMMLNTAANDSLL

Query:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENC
         KS NEIV+IL+++++ NDQ   E  R+  K+   AG+  LD + SMQ Q+  + QMLK +           A   PSP  QI++ +  YC D H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENC

Query:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPA--SQPQQYNQLRGQNTTQQSGRNTS-LEAMMK
        P+NP+S++YVGQ  Q+ FNPYSNTY+PGW+ HPNFSWS QG +S + Q   QQYKQ YT  GFP  PA    P QYNQ +  N  Q   +N S +E +MK
Subjt:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPA--SQPQQYNQLRGQNTTQQSGRNTS-LEAMMK

Query:  DFMTR
        +F+T+
Subjt:  DFMTR

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]3.2e-13568.23Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MN NPQDPP P NPPV+GD AGE AAN+AGE+PNPILL DNRDVAVRNYVTHAFHNLNS + +  P                         NEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEK
        SFIEIANAFQL GVSE ALRLKM      D    R   N         + EL  + L+  H L   +      IEQFYRGLD  SRMMLNTAANDSL EK
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEK

Query:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENCPANP
        S++EI+DILNKMTD NDQGEIGRSLPKKQVSA +FELD VASMQAQM  +NQMLKQLTMEKETK  TSA+ EPS  LQISDIS VYC DN LYENCPANP
Subjt:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENCPANP

Query:  ASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPASQPQQYNQLRGQNTTQQSGRNTSLEAMMKDFMTRT-
         S+FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVASSSAQ P QQYKQNYT   FPTQPASQPQQYNQ R QNTTQQ G N SLEAM K+FMTR+ 
Subjt:  ASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPASQPQQYNQLRGQNTTQQSGRNTSLEAMMKDFMTRT-

Query:  VTTQNF
         TT+ F
Subjt:  VTTQNF

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]1.7e-15777.86Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MNRN QDPPPPQNPPVNGDMAGEEAAN+ GEIPN ILL DNRDVA+RNYVTHAFHNLNSGINN LP+A QFELKPVMFQ+LQTM QFGGLTNEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------
        SFIEIANAFQLPG SE ALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLTKN DLREDI                             
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------

Query:  ------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVT
              IEQFYRGLD SS+MMLNT AN SLLEKSVNEIVD+LNKMTDINDQGE+GRSLPKKQVS GIFELD VASMQAQM  MNQMLKQLTMEKETK VT
Subjt:  ------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVT

Query:  SAIPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYK
        SAIPE SP LQISDIS VYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAP QQYK
Subjt:  SAIPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]1.0e-10170.63Show/hide
Query:  MSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI------------
        M+QFGG TNEDPYSHLKSFI+IANAFQLPGVSE ALRLKMFPFSLRDGA TW+N LE N I TWAELT+KFLAKYHTLT+N DL+EDI            
Subjt:  MSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI------------

Query:  -----------------------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVM
                               I+QFYRGLDH  RMM +TAAN SLLEKSVNEI+DILNKM DINDQ E+GRSLPKKQ SAGIFELD V S+QAQ+  M
Subjt:  -----------------------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVM

Query:  NQMLKQLTMEKETKIVTSA-IPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN
        +QMLKQLTM+K  K  TS  I EPS  LQISDIS VYC DNHLYENC ANPA IFYVGQG QRNFNPYSNTYNPGWR HPNFS SN
Subjt:  NQMLKQLTMEKETKIVTSA-IPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220072.6e-8770.3Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MNRN QDPPPPQNPPVNGDMAGE AAN+AGEIPN ILL DNRDVA+RNYVT AFHNLNSGINNLLP+A Q ELKPVMF MLQTM QFGGLTNEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEKSV
        SFIEIANAFQLPGVSE+ALRLK+                                                      GLD SSRMMLNTAAN SLLEKSV
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEKSV

Query:  NEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTS
        NEIVDILNKM DINDQGE GRSL KKQVSAGIFELD VA MQAQM  MNQMLKQ TMEKETK VTS
Subjt:  NEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTS

A0A6J1DSZ5 uncharacterized protein LOC1110241074.2e-8544.94Show/hide
Query:  IPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDG
        +PNPI + D +D A+R+Y      +LNS + N LP   QFE KP+M QML  + QFGGL +EDP SHLKSFI++AN  +LPG+S+ ALRL +FPFSL   
Subjt:  IPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDG

Query:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------------IEQFYRGLDHSSRMMLNTAANDSLL
        A  WLNA    +I TW+++ +KFL KY   T+N D+RE+I                                   IE F+RG D  ++MMLN AAN    
Subjt:  ARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------------IEQFYRGLDHSSRMMLNTAANDSLL

Query:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENC
         KS NEIV+IL+++++ NDQ   E  R+  K+   AG+  LD + SMQ Q+  + QMLK +           A   PSP  QI++ +  YC D H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENC

Query:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPA--SQPQQYNQLRGQNTTQQSGRNTS-LEAMMK
        P+NP+S++YVGQ  Q+ FNPYSNTY+PGW+ HPNFSWS QG +S + Q   QQYKQ YT  GFP  PA    P QYNQ +  N  Q   +N S +E +MK
Subjt:  PANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPA--SQPQQYNQLRGQNTTQQSGRNTS-LEAMMK

Query:  DFMTR
        +F+T+
Subjt:  DFMTR

A0A6J1DYY9 uncharacterized protein LOC1110255573.8e-10270.98Show/hide
Query:  MSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI------------
        M+QFGG TNEDPYSHLKSFI+IANAFQLPGVSE ALRLKMFPFSLRDGA TWLN LE N I TWAELT+KFLAKYHTLT+N DL+EDI            
Subjt:  MSQFGGLTNEDPYSHLKSFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI------------

Query:  -----------------------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVM
                               I+QFYRGLDH  RMM +TAAN SLLEKSVNEI+DILNKM DINDQ E+GRSLPKKQ SAGIFELD V S+QAQ+  M
Subjt:  -----------------------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVM

Query:  NQMLKQLTMEKETKIVTSA-IPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN
        +QMLKQLTM+K  K  TS  I EPS  LQISDIS VYC DNHLYENC ANPA IFYVGQG QRNFNPYSNTYNPGWR HPNFS SN
Subjt:  NQMLKQLTMEKETKIVTSA-IPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN

A0A6J1DZ19 uncharacterized protein LOC1110248241.5e-13568.23Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MN NPQDPP P NPPV+GD AGE AAN+AGE+PNPILL DNRDVAVRNYVTHAFHNLNS + +  P                         NEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEK
        SFIEIANAFQL GVSE ALRLKM      D    R   N         + EL  + L+  H L   +      IEQFYRGLD  SRMMLNTAANDSL EK
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEK

Query:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENCPANP
        S++EI+DILNKMTD NDQGEIGRSLPKKQVSA +FELD VASMQAQM  +NQMLKQLTMEKETK  TSA+ EPS  LQISDIS VYC DN LYENCPANP
Subjt:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENCPANP

Query:  ASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPASQPQQYNQLRGQNTTQQSGRNTSLEAMMKDFMTRT-
         S+FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVASSSAQ P QQYKQNYT   FPTQPASQPQQYNQ R QNTTQQ G N SLEAM K+FMTR+ 
Subjt:  ASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYKQNYTSLGFPTQPASQPQQYNQLRGQNTTQQSGRNTSLEAMMKDFMTRT-

Query:  VTTQNF
         TT+ F
Subjt:  VTTQNF

A0A6J1E251 uncharacterized protein LOC1110253028.3e-15877.86Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK
        MNRN QDPPPPQNPPVNGDMAGEEAAN+ GEIPN ILL DNRDVA+RNYVTHAFHNLNSGINN LP+A QFELKPVMFQ+LQTM QFGGLTNEDPYSHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLK

Query:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------
        SFIEIANAFQLPG SE ALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLTKN DLREDI                             
Subjt:  SFIEIANAFQLPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDI-----------------------------

Query:  ------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVT
              IEQFYRGLD SS+MMLNT AN SLLEKSVNEIVD+LNKMTDINDQGE+GRSLPKKQVS GIFELD VASMQAQM  MNQMLKQLTMEKETK VT
Subjt:  ------IEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVT

Query:  SAIPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYK
        SAIPE SP LQISDIS VYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAP QQYK
Subjt:  SAIPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVASSSAQAPVQQYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAGAAGCAGCAAACCAAGCAGGAGAAATTCCTAATCCGATCCT
TCTGGTAGATAATCGAGATGTAGCCGTGCGGAATTATGTCACCCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCTTTTACCCGAAGCCACACAGTTCGAGCTCA
AGCCAGTCATGTTCCAGATGTTACAGACGATGAGCCAGTTCGGAGGATTAACTAACGAAGATCCTTACTCCCATCTCAAATCCTTTATCGAAATAGCTAATGCATTTCAA
CTTCCTGGTGTTTCTGAGCATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATG
GGCGGAGCTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAAGAACATAGACCTTCGAGAGGACATTATTGAACAATTCTATAGAGGATTGGATCATTCATCAA
GGATGATGTTGAACACTGCAGCCAATGACTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGA
AGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACATAGTAGCTTCAATGCAAGCCCAAATGGTGGTTATGAACCAGATGTTAAAGCAGTTGACAAT
GGAGAAGGAAACCAAAATCGTCACTTCGGCGATACCTGAACCCTCTCCTAATTTACAAATTTCAGATATATCTCGTGTCTATTGTAGTGATAACCACTTGTATGAGAATT
GCCCAGCTAATCCAGCATCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTTACAACCCTGGATGGAGGCACCATCCAAACTTTTCC
TGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGTTCAACAATACAAGCAAAACTACACTTCTCTTGGTTTTCCAACTCAACCGGCGTCGCAGCCTCAACA
ATACAATCAGCTAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGAAACACAAGTTTGGAGGCCATGATGAAAGATTTCATGACAAGAACTGTAACGACCCAGAATTTCC
CCCACACTTCTAGGTCACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAGAAGCAGCAAACCAAGCAGGAGAAATTCCTAATCCGATCCT
TCTGGTAGATAATCGAGATGTAGCCGTGCGGAATTATGTCACCCATGCGTTCCACAACCTAAATTCAGGGATAAATAATCTTTTACCCGAAGCCACACAGTTCGAGCTCA
AGCCAGTCATGTTCCAGATGTTACAGACGATGAGCCAGTTCGGAGGATTAACTAACGAAGATCCTTACTCCCATCTCAAATCCTTTATCGAAATAGCTAATGCATTTCAA
CTTCCTGGTGTTTCTGAGCATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTCAGGGATGGTGCAAGGACTTGGCTAAACGCGTTAGAACCAAATTCTATCAACACATG
GGCGGAGCTGACGGAGAAATTTTTGGCAAAGTACCACACTTTGACCAAGAACATAGACCTTCGAGAGGACATTATTGAACAATTCTATAGAGGATTGGATCATTCATCAA
GGATGATGTTGAACACTGCAGCCAATGACTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATAAGATGACAGACATTAATGACCAAGGCGAAATAGGA
AGGTCATTACCAAAGAAGCAAGTATCAGCTGGAATCTTTGAGTTAGACATAGTAGCTTCAATGCAAGCCCAAATGGTGGTTATGAACCAGATGTTAAAGCAGTTGACAAT
GGAGAAGGAAACCAAAATCGTCACTTCGGCGATACCTGAACCCTCTCCTAATTTACAAATTTCAGATATATCTCGTGTCTATTGTAGTGATAACCACTTGTATGAGAATT
GCCCAGCTAATCCAGCATCTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTTACAACCCTGGATGGAGGCACCATCCAAACTTTTCC
TGGAGTAACCAAGGAGTAGCTAGTAGCAGTGCACAAGCACCCGTTCAACAATACAAGCAAAACTACACTTCTCTTGGTTTTCCAACTCAACCGGCGTCGCAGCCTCAACA
ATACAATCAGCTAAGAGGTCAAAATACTACTCAGCAAAGTGGTAGAAACACAAGTTTGGAGGCCATGATGAAAGATTTCATGACAAGAACTGTAACGACCCAGAATTTCC
CCCACACTTCTAGGTCACATTAA
Protein sequenceShow/hide protein sequence
MNRNPQDPPPPQNPPVNGDMAGEEAANQAGEIPNPILLVDNRDVAVRNYVTHAFHNLNSGINNLLPEATQFELKPVMFQMLQTMSQFGGLTNEDPYSHLKSFIEIANAFQ
LPGVSEHALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTKNIDLREDIIEQFYRGLDHSSRMMLNTAANDSLLEKSVNEIVDILNKMTDINDQGEIG
RSLPKKQVSAGIFELDIVASMQAQMVVMNQMLKQLTMEKETKIVTSAIPEPSPNLQISDISRVYCSDNHLYENCPANPASIFYVGQGAQRNFNPYSNTYNPGWRHHPNFS
WSNQGVASSSAQAPVQQYKQNYTSLGFPTQPASQPQQYNQLRGQNTTQQSGRNTSLEAMMKDFMTRTVTTQNFPHTSRSH