; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g14780 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g14780
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC111022007
Genome locationchr4:11288299..11292801
RNA-Seq ExpressionMoc04g14780
SyntenyMoc04g14780
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]1.6e-8568.8Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK
        MNRN QDPPPPQNPPVNGDMAGE AANRAGEIPN ILL DN+DVAMR YVT AFHNLNSGINN LPQAAQ ELKPVMF MLQTMGQFGGLTNEDP SHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK

Query:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEKSV
         FIEIANAFQLPGVSE+ALRLK+                                                      GLD SSRMMLNT A+GSLLEKSV
Subjt:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEKSV

Query:  NEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTIS
        NEIVDILNKM DINDQGE GRSL KKQVS G+FELDTVA MQAQ+A MNQMLKQ TME ETKT  S
Subjt:  NEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTIS

XP_022157400.1 uncharacterized protein LOC111024107 [Momordica charantia]6.4e-8246.48Show/hide
Query:  IPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDG
        +PN I + D +D AMR Y      +LNS + NPLP  AQFE KP+M QML  + QFGGL +EDP SHLK FI++AN  +LPG+S+DALRL +FPFSL   
Subjt:  IPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDG

Query:  ARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------------IEQFYRGLDHSSRMMLNTVAHGSLL
        A  WL +    +I TW+++ +KFLVKY   TRNAD+RE+I                                   IE F+RG D  ++MMLN  A+G   
Subjt:  ARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------------IEQFYRGLDHSSRMMLNTVAHGSLL

Query:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENC
         KS NEIV+IL+++++ NDQ   E  R+  K+    GV  LD + SMQ QI T+ QMLK +   N    +  A   PSP+ QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENC

Query:  PTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK
        P+NP  ++YVGQ  Q+ FNPYSNTY+PGW+ HPNFSWS QG +  + Q   QQYK
Subjt:  PTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]3.0e-11657.11Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSH
        MN NPQDPP P NPPV+GD AGE AANRAGE+PN ILL DN+DVA+R YVTHAFHNLNS +  + P+ +A                       NEDP SH
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSH

Query:  LKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEK
        LK FIEIANAFQL GVSEDALRLKM         R  + S +        E  E+F         +       IEQFYRGLD  SRMMLNT A+ SL EK
Subjt:  LKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEK

Query:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENCPTNP
        S++EI+DILNKMTD NDQGEIGRSLPKKQVS  VFELDTVASMQAQ+AT+NQMLKQLTME ETKT  SA+ EPS  LQISDISCVYCGDN LYENCP NP
Subjt:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENCPTNP

Query:  VFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK-------------------------------------------------
          +FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVA SSAQ PAQQYK                                                 
Subjt:  VFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK-------------------------------------------------

Query:  ---------TDAAIRSLEMQMGHIANDQKSRPQGNL
                 TD  IR LEMQ+G IAND+KSRPQG L
Subjt:  ---------TDAAIRSLEMQMGHIANDQKSRPQGNL

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]3.4e-16077.34Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK
        MNRN QDPPPPQNPPVNGDMAGE AANR GEIPNLILL DN+DVAMR YVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMGQFGGLTNEDP SHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK

Query:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------
         FIEIANAFQLPG SEDALRLKMFPFSL+DGARTW+ +L+PNSINTWAELT+KFL KYHTLT+NADLREDI                             
Subjt:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------

Query:  ------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTI
              IEQFYRGLD SS+MMLNT+A+GSLLEKSVNEIVD+LNKMTDINDQGE+GRSLPKKQVS G+FELDTVASMQAQ+A MNQMLKQLTME ETKT  
Subjt:  ------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTI

Query:  SAIPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVA SSAQAPAQQYK
Subjt:  SAIPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]8.0e-10168.53Show/hide
Query:  MGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI------------
        M QFGG TNEDP SHLK FI+IANAFQLPGVSEDALRLKMFPFSL+DGA TW+  L+ N I TWAELT+KFL KYHTLTRNADL+EDI            
Subjt:  MGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI------------

Query:  -----------------------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATM
                               I+QFYRGLDH  RMM +T A+ SLLEKSVNEI+DILNKM DINDQ E+GRSLPKKQ S G+FELDTV S+QAQI+ M
Subjt:  -----------------------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATM

Query:  NQMLKQLTMENETKTTISA-IPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN
        +QMLKQLTM+   K   S  I EPS ILQISDISCVYC DNHLYENC  NP FIFYVGQG QRNFNPYSNTYNPGWR HPNFS SN
Subjt:  NQMLKQLTMENETKTTISA-IPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220077.9e-8668.8Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK
        MNRN QDPPPPQNPPVNGDMAGE AANRAGEIPN ILL DN+DVAMR YVT AFHNLNSGINN LPQAAQ ELKPVMF MLQTMGQFGGLTNEDP SHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK

Query:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEKSV
         FIEIANAFQLPGVSE+ALRLK+                                                      GLD SSRMMLNT A+GSLLEKSV
Subjt:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEKSV

Query:  NEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTIS
        NEIVDILNKM DINDQGE GRSL KKQVS G+FELDTVA MQAQ+A MNQMLKQ TME ETKT  S
Subjt:  NEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTIS

A0A6J1DSZ5 uncharacterized protein LOC1110241073.1e-8246.48Show/hide
Query:  IPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDG
        +PN I + D +D AMR Y      +LNS + NPLP  AQFE KP+M QML  + QFGGL +EDP SHLK FI++AN  +LPG+S+DALRL +FPFSL   
Subjt:  IPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDG

Query:  ARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------------IEQFYRGLDHSSRMMLNTVAHGSLL
        A  WL +    +I TW+++ +KFLVKY   TRNAD+RE+I                                   IE F+RG D  ++MMLN  A+G   
Subjt:  ARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------------IEQFYRGLDHSSRMMLNTVAHGSLL

Query:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENC
         KS NEIV+IL+++++ NDQ   E  R+  K+    GV  LD + SMQ QI T+ QMLK +   N    +  A   PSP+ QI++ +C YCGD H  ENC
Subjt:  EKSVNEIVDILNKMTDINDQ--GEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENC

Query:  PTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK
        P+NP  ++YVGQ  Q+ FNPYSNTY+PGW+ HPNFSWS QG +  + Q   QQYK
Subjt:  PTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK

A0A6J1DYY9 uncharacterized protein LOC1110255573.0e-10168.88Show/hide
Query:  MGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI------------
        M QFGG TNEDP SHLK FI+IANAFQLPGVSEDALRLKMFPFSL+DGA TWL  L+ N I TWAELT+KFL KYHTLTRNADL+EDI            
Subjt:  MGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI------------

Query:  -----------------------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATM
                               I+QFYRGLDH  RMM +T A+ SLLEKSVNEI+DILNKM DINDQ E+GRSLPKKQ S G+FELDTV S+QAQI+ M
Subjt:  -----------------------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATM

Query:  NQMLKQLTMENETKTTISA-IPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN
        +QMLKQLTM+   K   S  I EPS ILQISDISCVYC DNHLYENC  NP FIFYVGQG QRNFNPYSNTYNPGWR HPNFS SN
Subjt:  NQMLKQLTMENETKTTISA-IPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSN

A0A6J1DZ19 uncharacterized protein LOC1110248241.5e-11657.11Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSH
        MN NPQDPP P NPPV+GD AGE AANRAGE+PN ILL DN+DVA+R YVTHAFHNLNS +  + P+ +A                       NEDP SH
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGI--NNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSH

Query:  LKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEK
        LK FIEIANAFQL GVSEDALRLKM         R  + S +        E  E+F         +       IEQFYRGLD  SRMMLNT A+ SL EK
Subjt:  LKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEK

Query:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENCPTNP
        S++EI+DILNKMTD NDQGEIGRSLPKKQVS  VFELDTVASMQAQ+AT+NQMLKQLTME ETKT  SA+ EPS  LQISDISCVYCGDN LYENCP NP
Subjt:  SVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENCPTNP

Query:  VFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK-------------------------------------------------
          +FYVGQ AQRNFNPYSNTY+P WR+HPNFSWSNQGVA SSAQ PAQQYK                                                 
Subjt:  VFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK-------------------------------------------------

Query:  ---------TDAAIRSLEMQMGHIANDQKSRPQGNL
                 TD  IR LEMQ+G IAND+KSRPQG L
Subjt:  ---------TDAAIRSLEMQMGHIANDQKSRPQGNL

A0A6J1E251 uncharacterized protein LOC1110253021.7e-16077.34Show/hide
Query:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK
        MNRN QDPPPPQNPPVNGDMAGE AANR GEIPNLILL DN+DVAMR YVTHAFHNLNSGINNPLPQAAQFELKPVMFQ+LQTMGQFGGLTNEDP SHLK
Subjt:  MNRNPQDPPPPQNPPVNGDMAGERAANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLK

Query:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------
         FIEIANAFQLPG SEDALRLKMFPFSL+DGARTW+ +L+PNSINTWAELT+KFL KYHTLT+NADLREDI                             
Subjt:  FFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTWLTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDI-----------------------------

Query:  ------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTI
              IEQFYRGLD SS+MMLNT+A+GSLLEKSVNEIVD+LNKMTDINDQGE+GRSLPKKQVS G+FELDTVASMQAQ+A MNQMLKQLTME ETKT  
Subjt:  ------IEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQIATMNQMLKQLTMENETKTTI

Query:  SAIPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK
        SAIPE SPILQISDISCVYC                   GQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVA SSAQAPAQQYK
Subjt:  SAIPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGCACGCTGGAGCCTCCGTGAAATCGCCGCCGCTGCTACAGAGCTGCTGGAGTCACCGTGAAATTAACGCCATCACTGTGTCGCCAGAGAAACACGAGTCCCTACT
GTCTGCTGCCACAACTCGAAGGCCGCCGTTCGTGGTCCTTGGACGCCGAGGGGACGTTGGTGATGGAATCGCGAAGGCTACGCCGCTGCTGTTTAACCAACCGAATGGCA
AGGATTTAAGAAGCTATCAAAGAGATACAAAGGGGCTTAAGAGGATTGTTGAGGCAAATACAAAGCTTAGTGCAAAATTAGATAATGTTACCCTAGCCTCTGCAACAACC
AAAAGCGGTAGTGAAAGAGTAGAAGTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTGTTTTCCAGTTTTGCATCAAACCTGAGGGA
GTTGTGTATGAGCACAGGGCCTGGTTGGTTGACCTTTGATCTAAAGATTGAAAGAACCCTTAAGAGGAGAAGGCGTGTGCAGAGGTTGAGAAAAGAGAAAGAAAGTAGAA
AAGATAAAGAAGTTGAAGAAGAAGAGACCATCGAGATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAAGAGCA
GCAAACCGAGCAGGAGAAATTCCTAATCTGATCCTTCTAGAAGATAATCAAGATGTAGCCATGCGTTATTATGTCACTCATGCATTCCACAACCTAAATTCAGGGATAAA
TAATCCTTTACCCCAAGCCGCACAGTTCGAGCTCAAGCCAGTCATGTTCCAAATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAACGAAGATCCTTGCTCCCATC
TCAAATTCTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTCTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTGAAGGATGGTGCAAGGACTTGG
CTAACCTCGTTAAAACCAAATTCTATCAACACATGGGCGGAATTAACAGAGAAATTTTTGGTAAAGTACCACACTTTGACCAGGAATGCAGACCTTCGAGAGGACATTAT
TGAACAATTCTATAGAGGATTGGATCATTCGTCAAGGATGATGTTGAACACTGTAGCCCATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATA
AGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGTCGGAGTCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATA
GCGACTATGAACCAAATGTTAAAGCAATTGACAATGGAGAACGAAACCAAAACCACAATTTCGGCGATACCTGAACCTTCTCCTATTTTACAAATTTCAGATATATCTTG
TGTCTATTGTGGTGATAACCACTTGTATGAGAACTGTCCAACTAATCCAGTATTTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTT
ACAACCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTGGTAGCAGTGCACAAGCACCCGCTCAACAATATAAAACCGATGCTGCGATAAGA
AGCTTGGAGATGCAAATGGGGCACATAGCTAATGATCAAAAATCTAGACCCCAAGGTAACCTTCAATGTCCTCGATGCGAGCGTCTCCCGGATGAAGTCGAGGAGTGCTC
TACAATAGGGGCAATCATGCAGGAACTCCAGCAAATACTGGTGGACGACTTAGAAGTAGATTTGGAGGCCACAGAAAAAGAATCCAAAATTGCGCCTGGCATAATTTTGC
CCCAATTTGAGCGTTTTGAGTTTTTGCAACCGACAAATAGCGGATTTGACGGCCTTTCAACCTTCCATCATTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAGCACGCTGGAGCCTCCGTGAAATCGCCGCCGCTGCTACAGAGCTGCTGGAGTCACCGTGAAATTAACGCCATCACTGTGTCGCCAGAGAAACACGAGTCCCTACT
GTCTGCTGCCACAACTCGAAGGCCGCCGTTCGTGGTCCTTGGACGCCGAGGGGACGTTGGTGATGGAATCGCGAAGGCTACGCCGCTGCTGTTTAACCAACCGAATGGCA
AGGATTTAAGAAGCTATCAAAGAGATACAAAGGGGCTTAAGAGGATTGTTGAGGCAAATACAAAGCTTAGTGCAAAATTAGATAATGTTACCCTAGCCTCTGCAACAACC
AAAAGCGGTAGTGAAAGAGTAGAAGTCAAATCCCAAGAAAAGTCAGGAATTGCGCCTGGTGCATTTTCCCAACATAGTGTGTTTTCCAGTTTTGCATCAAACCTGAGGGA
GTTGTGTATGAGCACAGGGCCTGGTTGGTTGACCTTTGATCTAAAGATTGAAAGAACCCTTAAGAGGAGAAGGCGTGTGCAGAGGTTGAGAAAAGAGAAAGAAAGTAGAA
AAGATAAAGAAGTTGAAGAAGAAGAGACCATCGAGATGAATAGAAATCCACAAGATCCTCCACCGCCACAAAATCCACCTGTGAATGGAGATATGGCAGGTGAAAGAGCA
GCAAACCGAGCAGGAGAAATTCCTAATCTGATCCTTCTAGAAGATAATCAAGATGTAGCCATGCGTTATTATGTCACTCATGCATTCCACAACCTAAATTCAGGGATAAA
TAATCCTTTACCCCAAGCCGCACAGTTCGAGCTCAAGCCAGTCATGTTCCAAATGTTACAGACGATGGGCCAGTTCGGAGGATTGACTAACGAAGATCCTTGCTCCCATC
TCAAATTCTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTCTCTGAGGATGCACTAAGATTAAAAATGTTTCCTTTTTCTCTGAAGGATGGTGCAAGGACTTGG
CTAACCTCGTTAAAACCAAATTCTATCAACACATGGGCGGAATTAACAGAGAAATTTTTGGTAAAGTACCACACTTTGACCAGGAATGCAGACCTTCGAGAGGACATTAT
TGAACAATTCTATAGAGGATTGGATCATTCGTCAAGGATGATGTTGAACACTGTAGCCCATGGCTCGTTGTTAGAGAAGTCGGTAAATGAGATCGTTGATATCTTGAATA
AGATGACAGACATTAATGACCAAGGCGAAATAGGAAGGTCATTACCAAAGAAGCAAGTATCAGTCGGAGTCTTTGAGTTAGACACAGTAGCTTCAATGCAAGCCCAAATA
GCGACTATGAACCAAATGTTAAAGCAATTGACAATGGAGAACGAAACCAAAACCACAATTTCGGCGATACCTGAACCTTCTCCTATTTTACAAATTTCAGATATATCTTG
TGTCTATTGTGGTGATAACCACTTGTATGAGAACTGTCCAACTAATCCAGTATTTATTTTCTATGTAGGTCAAGGTGCCCAGCGGAATTTCAACCCGTATTCAAACACTT
ACAACCCTGGATGGAGGCACCATCCAAACTTTTCCTGGAGTAACCAAGGAGTAGCTGGTAGCAGTGCACAAGCACCCGCTCAACAATATAAAACCGATGCTGCGATAAGA
AGCTTGGAGATGCAAATGGGGCACATAGCTAATGATCAAAAATCTAGACCCCAAGGTAACCTTCAATGTCCTCGATGCGAGCGTCTCCCGGATGAAGTCGAGGAGTGCTC
TACAATAGGGGCAATCATGCAGGAACTCCAGCAAATACTGGTGGACGACTTAGAAGTAGATTTGGAGGCCACAGAAAAAGAATCCAAAATTGCGCCTGGCATAATTTTGC
CCCAATTTGAGCGTTTTGAGTTTTTGCAACCGACAAATAGCGGATTTGACGGCCTTTCAACCTTCCATCATTGA
Protein sequenceShow/hide protein sequence
MQHAGASVKSPPLLQSCWSHREINAITVSPEKHESLLSAATTRRPPFVVLGRRGDVGDGIAKATPLLFNQPNGKDLRSYQRDTKGLKRIVEANTKLSAKLDNVTLASATT
KSGSERVEVKSQEKSGIAPGAFSQHSVFSSFASNLRELCMSTGPGWLTFDLKIERTLKRRRRVQRLRKEKESRKDKEVEEEETIEMNRNPQDPPPPQNPPVNGDMAGERA
ANRAGEIPNLILLEDNQDVAMRYYVTHAFHNLNSGINNPLPQAAQFELKPVMFQMLQTMGQFGGLTNEDPCSHLKFFIEIANAFQLPGVSEDALRLKMFPFSLKDGARTW
LTSLKPNSINTWAELTEKFLVKYHTLTRNADLREDIIEQFYRGLDHSSRMMLNTVAHGSLLEKSVNEIVDILNKMTDINDQGEIGRSLPKKQVSVGVFELDTVASMQAQI
ATMNQMLKQLTMENETKTTISAIPEPSPILQISDISCVYCGDNHLYENCPTNPVFIFYVGQGAQRNFNPYSNTYNPGWRHHPNFSWSNQGVAGSSAQAPAQQYKTDAAIR
SLEMQMGHIANDQKSRPQGNLQCPRCERLPDEVEECSTIGAIMQELQQILVDDLEVDLEATEKESKIAPGIILPQFERFEFLQPTNSGFDGLSTFHH