; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc09g25140 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc09g25140
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr9:18836597..18839080
RNA-Seq ExpressionMoc09g25140
SyntenyMoc09g25140
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022154847.1 LOW QUALITY PROTEIN: uncharacterized protein LOC111022007 [Momordica charantia]2.1e-7464.54Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NGDM  EGAANRA E+PN ILL DNRDVAMRNYVT AFHNL SGI N LPQA Q ELKPVMF MLQ MGQF GLTNEDPYSHLKSFIEIANAFQLPGVS
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAIND
        ++ALRLK+                                                      GLDRS RMMLNTAANGSLLE S+NEIVDILNKM  IND
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAIND

Query:  QGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATS
        QGE GRSL KKQVSAG+FE DT+A +QAQMA MNQMLKQ TM+KETKT TS
Subjt:  QGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATS

XP_022158314.1 uncharacterized protein LOC111024824 [Momordica charantia]2.0e-7246.85Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        ++GD   EGAANRA E+PN ILL DNRDVA+RNYVTHAFHNL S + +                     G  R   NEDPYSHLKSFIEIANAFQL GVS
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAI
        +DALRLKM      D    R   N         + EL  + L+  H L          IEQFYRGLDR  RMMLNTAAN SL E SI+EI+DILNKMT  
Subjt:  DDALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAI

Query:  NDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDIYLV---------SIVVITTSMRTV-QLIQRLFS
        NDQGEIGRSLPKKQVSA VFE DT+AS+QAQMAT+NQMLKQLTM+KETKTATS M EP   LQISDI  V         +     TS+  V Q  QR F+
Subjt:  NDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDIYLV---------SIVVITTSMRTV-QLIQRLFS

Query:  MY--------------------------------------------------------------------------------------VKFMTRPDAAIR
         Y                                                                                       +FMTR D  IR
Subjt:  MY--------------------------------------------------------------------------------------VKFMTRPDAAIR

Query:  NLEMQVGHIANDQKSRPQGTLLGHMENPK
         LEMQVG IAND+KSRPQGTL G+ ENPK
Subjt:  NLEMQVGHIANDQKSRPQGTLLGHMENPK

XP_022158408.1 uncharacterized protein LOC111024897, partial [Momordica charantia]6.7e-7339.79Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NG+M D     R +E  N I + DNRDVAMR Y   AF N  SGI NP+P    FELKP+MFQMLQ +G F G  +EDP+ HLKSFI+IANAF+LPG++
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD
        DDA  L +FPFSL+D AR  LNA    SI TW  L EKFL K+   TR+ D+RE+I                                   IE F+RGLD
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD

Query:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQ--GEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMP------EPF
           +MMLN AANG+  + + NEIVDILN + + N+    +  R+ PKKQ  AGV   D   S+Q +M TMNQ LK++ +  +   AT   P         
Subjt:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQ--GEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMP------EPF

Query:  PVLQISDIYL--------------------------VSIVVITTSMR----TVQLIQR------------LFSMYVKFMTRPDAAI-------RNLEMQV
        PV Q++D+                                V  T         Q  QR            L +M  ++M R DA I       RN   Q+
Subjt:  PVLQISDIYL--------------------------VSIVVITTSMR----TVQLIQR------------LFSMYVKFMTRPDAAI-------RNLEMQV

Query:  GHIANDQKSRPQGTLLGHMENPKQEGKEQCKAIITKSGLSYEGPSLPTDEVTTPFPASTPPPEEKVEPVSSEEKGKNVDK
        GH+AN+ K+RPQG+  GH E P++EGKEQCKA+  +SGL+Y+GP++PT +V  P   ST P  +  E  ++ EK +N+ K
Subjt:  GHIANDQKSRPQGTLLGHMENPKQEGKEQCKAIITKSGLSYEGPSLPTDEVTTPFPASTPPPEEKVEPVSSEEKGKNVDK

XP_022158836.1 uncharacterized protein LOC111025302 [Momordica charantia]4.0e-11874.26Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NGDM  E AANR  E+PNLILL DNRDVAMRNYVTHAFHNL SGI NPLPQA QFELKPVMFQ+LQ MGQF GLTNEDPYSHLKSFIEIANAFQLPG S
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD
        +DALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+N DLREDI                                   IEQFYRGLD
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD

Query:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDI
        RS +MMLNT ANGSLLE S+NEIVD+LNKMT INDQGE+GRSLPKKQVS G+FE DT+AS+QAQMA MNQMLKQLTM+KETKT TS +PE  P+LQISDI
Subjt:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDI

Query:  YLV
          V
Subjt:  YLV

XP_022159127.1 uncharacterized protein LOC111025557 [Momordica charantia]3.1e-7063.18Show/hide
Query:  MGQFRGLTNEDPYSHLKSFIEIANAFQLPGVSDDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI------------
        M QF G TNEDPYSHLKSFI+IANAFQLPGVS+DALRLKMFPFSLRDGA TW+N LE N I TWAELT+KFLAKYHTLTRN DL+EDI            
Subjt:  MGQFRGLTNEDPYSHLKSFIEIANAFQLPGVSDDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI------------

Query:  -----------------------IEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATM
                               I+QFYRGLD  +RMM +TAAN SLLE S+NEI+DILNKM  INDQ E+GRSLPKKQ SAG+FE DT+ S+QAQ++ M
Subjt:  -----------------------IEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATM

Query:  NQMLKQLTMDKETKTATST-MPEPFPVLQISDIYLVSIV
        +QMLKQLTM K  K ATS  + EP  +LQISDI  V  V
Subjt:  NQMLKQLTMDKETKTATST-MPEPFPVLQISDIYLVSIV

TrEMBL top hitse value%identityAlignment
A0A6J1DMT3 LOW QUALITY PROTEIN: uncharacterized protein LOC1110220071.0e-7464.54Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NGDM  EGAANRA E+PN ILL DNRDVAMRNYVT AFHNL SGI N LPQA Q ELKPVMF MLQ MGQF GLTNEDPYSHLKSFIEIANAFQLPGVS
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAIND
        ++ALRLK+                                                      GLDRS RMMLNTAANGSLLE S+NEIVDILNKM  IND
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAIND

Query:  QGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATS
        QGE GRSL KKQVSAG+FE DT+A +QAQMA MNQMLKQ TM+KETKT TS
Subjt:  QGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATS

A0A6J1DW02 uncharacterized protein LOC1110248973.3e-7339.79Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NG+M D     R +E  N I + DNRDVAMR Y   AF N  SGI NP+P    FELKP+MFQMLQ +G F G  +EDP+ HLKSFI+IANAF+LPG++
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD
        DDA  L +FPFSL+D AR  LNA    SI TW  L EKFL K+   TR+ D+RE+I                                   IE F+RGLD
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD

Query:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQ--GEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMP------EPF
           +MMLN AANG+  + + NEIVDILN + + N+    +  R+ PKKQ  AGV   D   S+Q +M TMNQ LK++ +  +   AT   P         
Subjt:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQ--GEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMP------EPF

Query:  PVLQISDIYL--------------------------VSIVVITTSMR----TVQLIQR------------LFSMYVKFMTRPDAAI-------RNLEMQV
        PV Q++D+                                V  T         Q  QR            L +M  ++M R DA I       RN   Q+
Subjt:  PVLQISDIYL--------------------------VSIVVITTSMR----TVQLIQR------------LFSMYVKFMTRPDAAI-------RNLEMQV

Query:  GHIANDQKSRPQGTLLGHMENPKQEGKEQCKAIITKSGLSYEGPSLPTDEVTTPFPASTPPPEEKVEPVSSEEKGKNVDK
        GH+AN+ K+RPQG+  GH E P++EGKEQCKA+  +SGL+Y+GP++PT +V  P   ST P  +  E  ++ EK +N+ K
Subjt:  GHIANDQKSRPQGTLLGHMENPKQEGKEQCKAIITKSGLSYEGPSLPTDEVTTPFPASTPPPEEKVEPVSSEEKGKNVDK

A0A6J1DYY9 uncharacterized protein LOC1110255571.2e-7063.6Show/hide
Query:  MGQFRGLTNEDPYSHLKSFIEIANAFQLPGVSDDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI------------
        M QF G TNEDPYSHLKSFI+IANAFQLPGVS+DALRLKMFPFSLRDGA TWLN LE N I TWAELT+KFLAKYHTLTRN DL+EDI            
Subjt:  MGQFRGLTNEDPYSHLKSFIEIANAFQLPGVSDDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI------------

Query:  -----------------------IEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATM
                               I+QFYRGLD  +RMM +TAAN SLLE S+NEI+DILNKM  INDQ E+GRSLPKKQ SAG+FE DT+ S+QAQ++ M
Subjt:  -----------------------IEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATM

Query:  NQMLKQLTMDKETKTATST-MPEPFPVLQISDIYLVSIV
        +QMLKQLTM K  K ATS  + EP  +LQISDI  V  V
Subjt:  NQMLKQLTMDKETKTATST-MPEPFPVLQISDIYLVSIV

A0A6J1DZ19 uncharacterized protein LOC1110248249.5e-7346.85Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        ++GD   EGAANRA E+PN ILL DNRDVA+RNYVTHAFHNL S + +                     G  R   NEDPYSHLKSFIEIANAFQL GVS
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAI
        +DALRLKM      D    R   N         + EL  + L+  H L          IEQFYRGLDR  RMMLNTAAN SL E SI+EI+DILNKMT  
Subjt:  DDALRLKMFPFSLRD--GARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAI

Query:  NDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDIYLV---------SIVVITTSMRTV-QLIQRLFS
        NDQGEIGRSLPKKQVSA VFE DT+AS+QAQMAT+NQMLKQLTM+KETKTATS M EP   LQISDI  V         +     TS+  V Q  QR F+
Subjt:  NDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDIYLV---------SIVVITTSMRTV-QLIQRLFS

Query:  MY--------------------------------------------------------------------------------------VKFMTRPDAAIR
         Y                                                                                       +FMTR D  IR
Subjt:  MY--------------------------------------------------------------------------------------VKFMTRPDAAIR

Query:  NLEMQVGHIANDQKSRPQGTLLGHMENPK
         LEMQVG IAND+KSRPQGTL G+ ENPK
Subjt:  NLEMQVGHIANDQKSRPQGTLLGHMENPK

A0A6J1E251 uncharacterized protein LOC1110253022.0e-11874.26Show/hide
Query:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS
        +NGDM  E AANR  E+PNLILL DNRDVAMRNYVTHAFHNL SGI NPLPQA QFELKPVMFQ+LQ MGQF GLTNEDPYSHLKSFIEIANAFQLPG S
Subjt:  MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVS

Query:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD
        +DALRLKMFPFSLRDGARTW+NALEPNSINTWAELT+KFLAKYHTLT+N DLREDI                                   IEQFYRGLD
Subjt:  DDALRLKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDI-----------------------------------IEQFYRGLD

Query:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDI
        RS +MMLNT ANGSLLE S+NEIVD+LNKMT INDQGE+GRSLPKKQVS G+FE DT+AS+QAQMA MNQMLKQLTM+KETKT TS +PE  P+LQISDI
Subjt:  RSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPKKQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDI

Query:  YLV
          V
Subjt:  YLV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACGGAGATATGACAGATGAAGGAGCAGCAAACCGAGCAGAAGAATTGCCTAATCTGATCCTTCTAGTAGATAACCGAGATGTAGCCATGCGGAACTATGTC
ACTCATGCGTTCCATAACCTAAAATCAGGGATAACTAATCCTTTACCCCAAGCCACACAGTTCGAGCTTAAGCCAGTCATGTTCCAGATGTTGCAGATGATGGGC
CAGTTCAGAGGATTAACTAACGAAGATCCTTACTCCCATCTCAAATCTTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTCTCTGATGATGCACTAAGA
TTAAAAATGTTTCCTTTTTCTCTCAGGGACGGTGCAAGGACTTGGCTAAATGCGTTAGAACCAAATTCTATCAACACGTGGGCGGAACTGACGGAGAAATTTTTG
GCGAAGTACCACACTTTGACCAGAAACACAGACCTTCGAGAGGACATTATTGAACAGTTCTATAGAGGATTGGATCGTTCATTGAGAATGATGTTGAACACCGCA
GCCAATGGCTCGTTGTTAGAAAATTCGATTAATGAGATCGTTGATATCTTAAACAAGATGACAGCCATTAATGACCAAGGCGAAATAGGAAGGTCATTGCCAAAG
AAGCAAGTATCAGCCGGAGTCTTTGAGTTTGACACAATAGCTTCAATACAAGCCCAAATGGCGACTATGAACCAGATGTTAAAGCAGCTAACAATGGATAAGGAA
ACCAAAACCGCCACTTCAACGATGCCTGAACCCTTTCCTGTTTTACAAATTTCAGATATATATCTTGTGTCTATTGTGGTGATAACCACTTCTATGAGAACTGTC
CAGCTAATCCAGCGTCTATTTTCTATGTACGTCAAGTTCATGACAAGACCTGATGCTGCGATAAGAAACTTGGAGATGCAAGTGGGGCATATAGCGAATGACCAA
AAATCTAGACCCCAAGGTACATTGCTTGGACACATGGAAAATCCTAAACAAGAAGGAAAAGAGCAATGTAAGGCAATCATCACAAAAAGCGGATTGAGTTATGAA
GGACCATCACTTCCAACTGACGAAGTAACTACACCTTTTCCCGCATCCACTCCACCACCAGAAGAGAAAGTAGAACCCGTAAGTTCGGAAGAAAAAGGTAAGAAT
GTGGATAAAGTCGAAGACTACTCTGCAATAAGGGGCAATCATGGAGGAACTCCAGGAAATGATGGTGGAAGACTTAGAAGCAAATTTGGAGGTCGCAGAAAAAGA
AGCCAATGCGCAATTTTTCCCCAATATGAGAATTTCGAGCTTTTGCAGCCGACAATAGCTGATTTGAAGGACTTGCAACCTTCCATCATTGAACCTCCAGAATTG
GAGAAGGAACCCCTACCTTCTCATTTAATATATACTTATTGGGTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAACGGAGATATGACAGATGAAGGAGCAGCAAACCGAGCAGAAGAATTGCCTAATCTGATCCTTCTAGTAGATAACCGAGATGTAGCCATGCGGAACTATGTC
ACTCATGCGTTCCATAACCTAAAATCAGGGATAACTAATCCTTTACCCCAAGCCACACAGTTCGAGCTTAAGCCAGTCATGTTCCAGATGTTGCAGATGATGGGC
CAGTTCAGAGGATTAACTAACGAAGATCCTTACTCCCATCTCAAATCTTTTATTGAAATAGCTAATGCATTTCAACTTCCTGGTGTCTCTGATGATGCACTAAGA
TTAAAAATGTTTCCTTTTTCTCTCAGGGACGGTGCAAGGACTTGGCTAAATGCGTTAGAACCAAATTCTATCAACACGTGGGCGGAACTGACGGAGAAATTTTTG
GCGAAGTACCACACTTTGACCAGAAACACAGACCTTCGAGAGGACATTATTGAACAGTTCTATAGAGGATTGGATCGTTCATTGAGAATGATGTTGAACACCGCA
GCCAATGGCTCGTTGTTAGAAAATTCGATTAATGAGATCGTTGATATCTTAAACAAGATGACAGCCATTAATGACCAAGGCGAAATAGGAAGGTCATTGCCAAAG
AAGCAAGTATCAGCCGGAGTCTTTGAGTTTGACACAATAGCTTCAATACAAGCCCAAATGGCGACTATGAACCAGATGTTAAAGCAGCTAACAATGGATAAGGAA
ACCAAAACCGCCACTTCAACGATGCCTGAACCCTTTCCTGTTTTACAAATTTCAGATATATATCTTGTGTCTATTGTGGTGATAACCACTTCTATGAGAACTGTC
CAGCTAATCCAGCGTCTATTTTCTATGTACGTCAAGTTCATGACAAGACCTGATGCTGCGATAAGAAACTTGGAGATGCAAGTGGGGCATATAGCGAATGACCAA
AAATCTAGACCCCAAGGTACATTGCTTGGACACATGGAAAATCCTAAACAAGAAGGAAAAGAGCAATGTAAGGCAATCATCACAAAAAGCGGATTGAGTTATGAA
GGACCATCACTTCCAACTGACGAAGTAACTACACCTTTTCCCGCATCCACTCCACCACCAGAAGAGAAAGTAGAACCCGTAAGTTCGGAAGAAAAAGGTAAGAAT
GTGGATAAAGTCGAAGACTACTCTGCAATAAGGGGCAATCATGGAGGAACTCCAGGAAATGATGGTGGAAGACTTAGAAGCAAATTTGGAGGTCGCAGAAAAAGA
AGCCAATGCGCAATTTTTCCCCAATATGAGAATTTCGAGCTTTTGCAGCCGACAATAGCTGATTTGAAGGACTTGCAACCTTCCATCATTGAACCTCCAGAATTG
GAGAAGGAACCCCTACCTTCTCATTTAATATATACTTATTGGGTTTAA
Protein sequenceShow/hide protein sequence
MNGDMTDEGAANRAEELPNLILLVDNRDVAMRNYVTHAFHNLKSGITNPLPQATQFELKPVMFQMLQMMGQFRGLTNEDPYSHLKSFIEIANAFQLPGVSDDALR
LKMFPFSLRDGARTWLNALEPNSINTWAELTEKFLAKYHTLTRNTDLREDIIEQFYRGLDRSLRMMLNTAANGSLLENSINEIVDILNKMTAINDQGEIGRSLPK
KQVSAGVFEFDTIASIQAQMATMNQMLKQLTMDKETKTATSTMPEPFPVLQISDIYLVSIVVITTSMRTVQLIQRLFSMYVKFMTRPDAAIRNLEMQVGHIANDQ
KSRPQGTLLGHMENPKQEGKEQCKAIITKSGLSYEGPSLPTDEVTTPFPASTPPPEEKVEPVSSEEKGKNVDKVEDYSAIRGNHGGTPGNDGGRLRSKFGGRRKR
SQCAIFPQYENFELLQPTIADLKDLQPSIIEPPELEKEPLPSHLIYTYWV