; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc01g10690 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc01g10690
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionPlant protein of unknown function (DUF639)
Genome locationchr1:6595973..6611515
RNA-Seq ExpressionMoc01g10690
SyntenyMoc01g10690
Gene Ontology termsGO:0016020 - membrane (cellular component)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR032867 - DYW domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK08396.1 uncharacterized protein E5676_scaffold654G00030 [Cucumis melo var. makuwa]2.3e-5759.52Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +EC+ KGKEEKKKVVAA+VPPEQDEIPLFYSD+MP+LVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMKH+Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEF+ H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

XP_004149121.2 uncharacterized protein LOC101222504 [Cucumis sativus]1.7e-5760.48Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +ECV KGKEEKKKVVAA+VPPEQDEIPLFYSD+MPLLVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMK++Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

XP_008442001.1 PREDICTED: uncharacterized protein LOC103485996 [Cucumis melo]2.3e-5759.52Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +EC+ KGKEEKKKVVAA+VPPEQDEIPLFYSD+MP+LVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMKH+Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEF+ H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

XP_022154463.1 uncharacterized protein LOC111021735 [Momordica charantia]7.8e-5863.16Show/hide
Query:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK
        E V KGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVN+D DVGEDAF+     +  +  ++N            G+R  +  + K LK + KCMKH+QK
Subjt:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK

Query:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEE
        QA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS VI YENAVEIELSRDT+H V   STGPWG PLFDKAVVY+SP +SEE
Subjt:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEE

Query:  VVLEFPEST
        VVLEFPE T
Subjt:  VVLEFPEST

XP_038881464.1 uncharacterized protein LOC120072983 [Benincasa hispida]7.8e-5861.24Show/hide
Query:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK
        ECV KGKEEKKKVVAA+VPPEQDEIPLFYSD+MPLLVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMKH+QK
Subjt:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK

Query:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGVT--STGPWGGPLFDKAVVYKSPAMSEE
        QA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS  ITYENA+EIELSRDT+H VT  STGPWG PLFDKA+VY+SPA+ EE
Subjt:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGVT--STGPWGGPLFDKAVVYKSPAMSEE

Query:  VVLEFPEST
        V+LEFPE T
Subjt:  VVLEFPEST

TrEMBL top hitse value%identityAlignment
A0A0A0KWC3 Uncharacterized protein8.4e-5860.48Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +ECV KGKEEKKKVVAA+VPPEQDEIPLFYSD+MPLLVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMK++Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

A0A1S3B5F8 uncharacterized protein LOC1034859961.1e-5759.52Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +EC+ KGKEEKKKVVAA+VPPEQDEIPLFYSD+MP+LVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMKH+Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEF+ H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

A0A5D3CDG9 Uncharacterized protein1.1e-5759.52Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +EC+ KGKEEKKKVVAA+VPPEQDEIPLFYSD+MP+LVN+D DVGEDA++     +  +  ++N            G+R  +  + K LK + KCMKH+Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEF+ H+EGTASSQR               ++NYSLYFEAS VITYENA+EIELS+DT+H V   STGPWG PLFDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

A0A6J1DKD5 uncharacterized protein LOC1110217353.8e-5863.16Show/hide
Query:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK
        E V KGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVN+D DVGEDAF+     +  +  ++N            G+R  +  + K LK + KCMKH+QK
Subjt:  ECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQK

Query:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEE
        QA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS VI YENAVEIELSRDT+H V   STGPWG PLFDKAVVY+SP +SEE
Subjt:  QAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEE

Query:  VVLEFPEST
        VVLEFPE T
Subjt:  VVLEFPEST

A0A6J1IK87 uncharacterized protein LOC1114781845.5e-5760Show/hide
Query:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ
        +ECV K KEEK K++AA+VPPEQDEIPLFYSDL+PLLVN+D DVGEDAF+     +  +  ++N            G+R  +  + K LK + KCMKH+Q
Subjt:  RECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAFI-----ISFLKFVIN------------GNRARYSLFLKALKRL-KCMKHMQ

Query:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE
        KQA P GVELR+DEFI H+EGTASSQR               ++NYSLYFEAS VI YENAVEIELSRDTLH V   STGPWG P+FDKA+VY+SPA+ E
Subjt:  KQAPPYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSE

Query:  EVVLEFPEST
        EVVLEFPE T
Subjt:  EVVLEFPEST

SwissProt top hitse value%identityAlignment
B8YEK4 Pentatricopeptide repeat-containing protein OGR1, mitochondrial3.3e-1150Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HP   EI R L+ +   + E GY P+ S VLHD+ EEEK  +L YHSEKLA+A+GL+  P G  +R
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

Q56XI1 Pentatricopeptide repeat-containing protein At1g09410, mitochondrial3.0e-2071.21Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE   I++ILD L GLLREAGY PD S+ LHDVDEEEKV SL YHSE+LAVAY LLK+ +G+PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

Q9FXB9 Pentatricopeptide repeat-containing protein At1g56690, mitochondrial3.3e-1969.7Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE + I+ +L+   GLLREAGY PD S VLHDVDEEEKV SL  HSE+LAVAYGLLK+P+G+PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

Q9S7F4 Putative pentatricopeptide repeat-containing protein At2g015101.5e-1149.23Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPI
        HP   EI+R ++ L+  +   GY PD S V+ DVDE+ K++SL YHSE+LAVA+ L+  P+G PI
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPI

Q9SY02 Pentatricopeptide repeat-containing protein At4g027501.4e-1251.52Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE  EI   L+ L   +++AGY    S VLHDV+EEEK + + YHSE+LAVAYG++++  G PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

Arabidopsis top hitse value%identityAlignment
AT1G09410.1 pentatricopeptide (PPR) repeat-containing protein2.1e-2171.21Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE   I++ILD L GLLREAGY PD S+ LHDVDEEEKV SL YHSE+LAVAY LLK+ +G+PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

AT1G56690.1 Pentatricopeptide repeat (PPR) superfamily protein2.4e-2069.7Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE + I+ +L+   GLLREAGY PD S VLHDVDEEEKV SL  HSE+LAVAYGLLK+P+G+PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

AT2G21720.1 Plant protein of unknown function (DUF639)7.8e-4045.15Show/hide
Query:  GKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAF-----IISFLKFVINGNRA---------------RYSLFLKALKRLKCMKHMQKQAP
        GKE + K + A++ PEQD+I LFYSD+MPLLV+ +  VGEDAF     II     +ING                   Y +F+K +   KCMKH+QKQ+ 
Subjt:  GKEEKKKVVAASVPPEQDEIPLFYSDLMPLLVNEDSDVGEDAF-----IISFLKFVINGNRA---------------RYSLFLKALKRLKCMKHMQKQAP

Query:  PYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEEVVL
        P G+EL +DE I H+EGT +SQR               ++NY+LYFEA+ +I YE+A++I+LS+D        STGP G PLFDKA+VY+SP   E +V+
Subjt:  PYGVELREDEFISHMEGTASSQRAY-------------ISNYSLYFEASDVITYENAVEIELSRDTLHGV--TSTGPWGGPLFDKAVVYKSPAMSEEVVL

Query:  EFPEST
        EFPE T
Subjt:  EFPEST

AT3G61170.1 Tetratricopeptide repeat (TPR)-like superfamily protein6.7e-1556.06Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HP   EI   +D +  L++EAGY+ D SF LHD+D+E K   L YHSEKLAVA+GLL +P G PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR

AT4G02750.1 Tetratricopeptide repeat (TPR)-like superfamily protein9.6e-1451.52Show/hide
Query:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR
        HPE  EI   L+ L   +++AGY    S VLHDV+EEEK + + YHSE+LAVAYG++++  G PIR
Subjt:  HPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGACACCCCGAACATTCAGAGATCATGAGAATATTGGATTGGTTATCTGGATTGCTCAGAGAGGCTGGATATTACCCGGACCGGAGTTTTGTGCTACACGAC
GTAGATGAAGAAGAAAAAGTGCAAAGCTTGGGATATCATAGTGAGAAATTGGCTGTGGCATATGGACTTCTCAAAATACCAAAAGGGATGCCAATTCGGGAGTGT
GTGACAAAGGGGAAAGAAGAGAAGAAGAAAGTCGTAGCTGCCAGTGTACCTCCAGAGCAAGATGAGATACCTCTCTTCTATTCGGACCTTATGCCCTTGCTTGTA
AATGAGGATTCAGATGTTGGAGAAGATGCATTTATTATCTCATTCCTAAAGTTTGTGATTAATGGAAATCGTGCACGATACTCCCTCTTCCTCAAAGCTTTAAAA
CGACTGAAATGCATGAAGCATATGCAGAAACAGGCACCTCCATATGGTGTAGAACTACGAGAGGATGAGTTCATATCACACATGGAGGGAACTGCTAGCTCCCAG
AGGGCTTACATAAGTAATTACAGCCTCTACTTTGAGGCTTCGGATGTGATAACATATGAAAATGCGGTTGAGATAGAACTCTCAAGGGACACTCTTCACGGTGTG
ACTTCAACTGGACCTTGGGGTGGACCACTTTTTGACAAGGCAGTAGTCTACAAGTCCCCCGCGATGTCCGAGGAAGTTGTGTTGGAGTTTCCAGAGTCTACTTAC
AGATTGCCATTAATCAGGCTAGAGAAAGAAGATAAGGAAGTTTCCATTGCCGAGGCTACTGCTGTAGGGCTGAAAGAAAGGAATCGGCAGAAGTGCACTCATCTT
CCTGGCAAGTATCTTACTCTAAGAAAATGTTTAGAGCCATGGCACTGTACCAGAGCGCCGCGGAGCTGTGCAGCGTGGTGGCACTGCCTTGGCGGCTCTGGTGCT
GCAGCAACCACAGTTGCCCTTTGGCGCCGCGGCGCTATCCCGTGTGTTTCTGTTGTTTCGCTTCGTGTCGCCCTAGGCGCGGCCTCCCTATGGAAGGTGTTTGCA
TGGTTCAATATCGAGGTGAATGAAGATAGTATTCATAGTAAGTGGGAGAAAGATGTCACCGGCACGGCTCCCTCCGACGACCCAAGAACCGCGACACAAGCTTCC
GTCACGGCTGTTTTGTTCCTCCGCGAGCAGTGGCGTGTTCCCACTTCGTGTTTCAACAACCCATTGCAACCACGATTAGATCTGCGTCCGGTGACTCTCCGTGCG
ACGTGCGGCGTGCAACAGTGGCGTGATTCCGGCGGCGGTTTCATTTCTGCATCGTTTTCACGGTCGCTCTACCGTCCTGCACAGATCTCGCAGCAGCTCAAGCGA
TTTTGCGTTCATTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGGACACCCCGAACATTCAGAGATCATGAGAATATTGGATTGGTTATCTGGATTGCTCAGAGAGGCTGGATATTACCCGGACCGGAGTTTTGTGCTACACGAC
GTAGATGAAGAAGAAAAAGTGCAAAGCTTGGGATATCATAGTGAGAAATTGGCTGTGGCATATGGACTTCTCAAAATACCAAAAGGGATGCCAATTCGGGAGTGT
GTGACAAAGGGGAAAGAAGAGAAGAAGAAAGTCGTAGCTGCCAGTGTACCTCCAGAGCAAGATGAGATACCTCTCTTCTATTCGGACCTTATGCCCTTGCTTGTA
AATGAGGATTCAGATGTTGGAGAAGATGCATTTATTATCTCATTCCTAAAGTTTGTGATTAATGGAAATCGTGCACGATACTCCCTCTTCCTCAAAGCTTTAAAA
CGACTGAAATGCATGAAGCATATGCAGAAACAGGCACCTCCATATGGTGTAGAACTACGAGAGGATGAGTTCATATCACACATGGAGGGAACTGCTAGCTCCCAG
AGGGCTTACATAAGTAATTACAGCCTCTACTTTGAGGCTTCGGATGTGATAACATATGAAAATGCGGTTGAGATAGAACTCTCAAGGGACACTCTTCACGGTGTG
ACTTCAACTGGACCTTGGGGTGGACCACTTTTTGACAAGGCAGTAGTCTACAAGTCCCCCGCGATGTCCGAGGAAGTTGTGTTGGAGTTTCCAGAGTCTACTTAC
AGATTGCCATTAATCAGGCTAGAGAAAGAAGATAAGGAAGTTTCCATTGCCGAGGCTACTGCTGTAGGGCTGAAAGAAAGGAATCGGCAGAAGTGCACTCATCTT
CCTGGCAAGTATCTTACTCTAAGAAAATGTTTAGAGCCATGGCACTGTACCAGAGCGCCGCGGAGCTGTGCAGCGTGGTGGCACTGCCTTGGCGGCTCTGGTGCT
GCAGCAACCACAGTTGCCCTTTGGCGCCGCGGCGCTATCCCGTGTGTTTCTGTTGTTTCGCTTCGTGTCGCCCTAGGCGCGGCCTCCCTATGGAAGGTGTTTGCA
TGGTTCAATATCGAGGTGAATGAAGATAGTATTCATAGTAAGTGGGAGAAAGATGTCACCGGCACGGCTCCCTCCGACGACCCAAGAACCGCGACACAAGCTTCC
GTCACGGCTGTTTTGTTCCTCCGCGAGCAGTGGCGTGTTCCCACTTCGTGTTTCAACAACCCATTGCAACCACGATTAGATCTGCGTCCGGTGACTCTCCGTGCG
ACGTGCGGCGTGCAACAGTGGCGTGATTCCGGCGGCGGTTTCATTTCTGCATCGTTTTCACGGTCGCTCTACCGTCCTGCACAGATCTCGCAGCAGCTCAAGCGA
TTTTGCGTTCATTCGTGA
Protein sequenceShow/hide protein sequence
MGHPEHSEIMRILDWLSGLLREAGYYPDRSFVLHDVDEEEKVQSLGYHSEKLAVAYGLLKIPKGMPIRECVTKGKEEKKKVVAASVPPEQDEIPLFYSDLMPLLV
NEDSDVGEDAFIISFLKFVINGNRARYSLFLKALKRLKCMKHMQKQAPPYGVELREDEFISHMEGTASSQRAYISNYSLYFEASDVITYENAVEIELSRDTLHGV
TSTGPWGGPLFDKAVVYKSPAMSEEVVLEFPESTYRLPLIRLEKEDKEVSIAEATAVGLKERNRQKCTHLPGKYLTLRKCLEPWHCTRAPRSCAAWWHCLGGSGA
AATTVALWRRGAIPCVSVVSLRVALGAASLWKVFAWFNIEVNEDSIHSKWEKDVTGTAPSDDPRTATQASVTAVLFLREQWRVPTSCFNNPLQPRLDLRPVTLRA
TCGVQQWRDSGGGFISASFSRSLYRPAQISQQLKRFCVHS