; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g18480 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g18480
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF506)
Genome locationchr2:13780588..13793411
RNA-Seq ExpressionMoc02g18480
SyntenyMoc02g18480
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035272.1 hypothetical protein SDJN02_02067, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-6261.64Show/hide
Query:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI
        L   +  H     +EILG +GR AE EV+EAV KH+R+K+++PKTT +KKWLVMKL+MDGY S +LCHTSWVTS+GCP G+YEYIE K E      KRMI
Subjt:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI

Query:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R
        IDI+FKAQFEVAR T +YKQLT+ALP+VFVG+EE V +II+ILCSAAKQSL+ESGLHIPPWRTSTYMQ K+     +  K             P+VK   
Subjt:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R

Query:  RRLWSGPSALSTQFSNMRI
        +R+W G SALSTQFSNM I
Subjt:  RRLWSGPSALSTQFSNMRI

XP_022143594.1 uncharacterized protein LOC111013453 [Momordica charantia]1.3e-10996.67Show/hide
Query:  HEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ
        H + + +EILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ
Subjt:  HEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ

Query:  FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA
        FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA
Subjt:  FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA

Query:  LSTQFSNMRI
        LSTQFSNM I
Subjt:  LSTQFSNMRI

XP_022947663.1 uncharacterized protein LOC111451460 isoform X1 [Cucurbita moschata]2.0e-6262.1Show/hide
Query:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI
        L   +  H     +EILG +GR AE EV+EAV KH+R+K+++PKTT LKKWLVMKL+MDGY S DLCH+SWVTS+GCP G+YEYIE K E      KRMI
Subjt:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI

Query:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R
        IDI+FKAQFEVAR T +YKQLT+ALP+VFVG+EE V +II+ILCSAAKQSL+ESGLHIPPWRTSTYMQ K+     +  K             P+VK   
Subjt:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R

Query:  RRLWSGPSALSTQFSNMRI
        +R+W G SALSTQFSNM I
Subjt:  RRLWSGPSALSTQFSNMRI

XP_023007236.1 uncharacterized protein LOC111499781 [Cucurbita maxima]2.4e-6363.47Show/hide
Query:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI
        L   +  H     +EILG SGR AE EV+EAV KH+R+K ++PKTT LKKWLVMKL+MDGY S DLCHTSWVTS+GCP G+YEYIE KVE      KRMI
Subjt:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI

Query:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R
        IDI+FKAQFEVAR T +YKQLT+ALP+VFVG+EE V +II+ILCSAAKQSL+ESGLHIPPWRTSTYMQ K+     +  K             P+VK   
Subjt:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R

Query:  RRLWSGPSALSTQFSNMRI
        +R+W G SALSTQFSNM I
Subjt:  RRLWSGPSALSTQFSNMRI

XP_038900827.1 uncharacterized protein LOC120087891 [Benincasa hispida]1.1e-7169.09Show/hide
Query:  HEIVAHEEILGSGREAEGEVAEAVTKHLRK-KLESPKTT-SLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFK
        H  ++ +EILGSG +AEGEV E V KHLR  K +SPKTT SLKKWLVMKL+MDGYDS+DLCHTSWVTS+GCPAG+YEYIE KV+DE+G  KR+IIDIEFK
Subjt:  HEIVAHEEILGSGREAEGEVAEAVTKHLRK-KLESPKTT-SLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFK

Query:  AQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKR------EETEEEKEEKGSAFNYR-WKPPMVKQ-
        AQFEVAR T  YKQLTEALPTVFVG+EE V RII++LCSAAKQSL+ESGLHIPPWRTSTYM  K+              KE   +  N + WKPPMVK  
Subjt:  AQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKR------EETEEEKEEKGSAFNYR-WKPPMVKQ-

Query:  RRRLWSGPSALSTQFSNMRI
         RR+W G SALSTQFSNM I
Subjt:  RRRLWSGPSALSTQFSNMRI

TrEMBL top hitse value%identityAlignment
A0A2N9GFS6 Uncharacterized protein7.6e-6058.6Show/hide
Query:  EEILGSGREAEGEVAEAVTKHLR-KKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVAR
        +EI+GSG + E E+ E+V +H+R  K E+ KT+SLKKWLVMK +MDGY +A LCHTSW+TS+GCPAG+YEYI+  +++E G P R+I+DI+FK+QFE+AR
Subjt:  EEILGSGREAEGEVAEAVTKHLR-KKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVAR

Query:  PTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNK-----YKREETEEEKEEKGSAFNY---RW--KPPMVKQRRRLW
        PT  YK+LT+ LP +FVGTE+ + +II+ILCSAAKQSLRE GLHIPPWRT TYMQ+K     +K     E KE KG A  +   +W   PPMVK +R   
Subjt:  PTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNK-----YKREETEEEKEEKGSAFNY---RW--KPPMVKQRRRLW

Query:  SGPSALSTQFSNMRI
        +G SALS+QFSNM I
Subjt:  SGPSALSTQFSNMRI

A0A5N6QJH5 Uncharacterized protein3.2e-5859.72Show/hide
Query:  EEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKV--EDEFG---RPKRMIIDIEFKAQF
        +EILGSG +AE +V E+V KH+R K  + KTTSLKKWLVMKL+ DGYD A LC TSWVTS+GCPAG+YEYI+  +  +DE G      R+I+D++FK+QF
Subjt:  EEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKV--EDEFG---RPKRMIIDIEFKAQF

Query:  EVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYK---REETEEEKEEKGSAFNY-RW--KPPMVKQRR-RL
        E+ARPTA YK+LT+ LP +FVGTEE + +II++LCSAAKQSLR+ GLHIPPWRTSTYMQ+K+    ++  EE +E KG A  Y +W   PPMVK +R  L
Subjt:  EVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYK---REETEEEKEEKGSAFNY-RW--KPPMVKQRR-RL

Query:  WSGPSALSTQFSNMRI
          G S LS+QFS M I
Subjt:  WSGPSALSTQFSNMRI

A0A6J1CPS4 uncharacterized protein LOC1110134536.2e-11096.67Show/hide
Query:  HEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ
        H + + +EILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ
Subjt:  HEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQ

Query:  FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA
        FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA
Subjt:  FEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSA

Query:  LSTQFSNMRI
        LSTQFSNM I
Subjt:  LSTQFSNMRI

A0A6J1G7H8 uncharacterized protein LOC111451460 isoform X19.7e-6362.1Show/hide
Query:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI
        L   +  H     +EILG +GR AE EV+EAV KH+R+K+++PKTT LKKWLVMKL+MDGY S DLCH+SWVTS+GCP G+YEYIE K E      KRMI
Subjt:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI

Query:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R
        IDI+FKAQFEVAR T +YKQLT+ALP+VFVG+EE V +II+ILCSAAKQSL+ESGLHIPPWRTSTYMQ K+     +  K             P+VK   
Subjt:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R

Query:  RRLWSGPSALSTQFSNMRI
        +R+W G SALSTQFSNM I
Subjt:  RRLWSGPSALSTQFSNMRI

A0A6J1KZZ7 uncharacterized protein LOC1114997811.1e-6363.47Show/hide
Query:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI
        L   +  H     +EILG SGR AE EV+EAV KH+R+K ++PKTT LKKWLVMKL+MDGY S DLCHTSWVTS+GCP G+YEYIE KVE      KRMI
Subjt:  LKAIAGMHEIVAHEEILG-SGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMI

Query:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R
        IDI+FKAQFEVAR T +YKQLT+ALP+VFVG+EE V +II+ILCSAAKQSL+ESGLHIPPWRTSTYMQ K+     +  K             P+VK   
Subjt:  IDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQ-R

Query:  RRLWSGPSALSTQFSNMRI
        +R+W G SALSTQFSNM I
Subjt:  RRLWSGPSALSTQFSNMRI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77145.1 Protein of unknown function (DUF506)1.0e-2439.01Show/hide
Query:  IVAHEEILGSGREAEGEVAEAVTK-----HLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSI----GCP----AGEYEYIETKVEDEFGRP-
        ++  +EIL +  E E E+ E +        L  + +  K   + K +V KLR +GYD A L  TSW +S     GC     + +YEYI+  V+ +  R  
Subjt:  IVAHEEILGSGREAEGEVAEAVTK-----HLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSI----GCP----AGEYEYIETKVEDEFGRP-

Query:  ----KRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKRE
            KR+IID++FK QFE+AR T AYK +TE LP VFV TE  + R+++++C   K+S+++ G+  PPWRT+ YMQ+K+  E
Subjt:  ----KRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKRE

AT1G77160.1 Protein of unknown function (DUF506)6.0e-2538.31Show/hide
Query:  KAIAGMHEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKK----WLVMKLRMDGYDSADLCHTSWVTSI----GCP----AGEYEYIETKV-
        KA+  + EI   E +   G E E E+ E +  ++ +   S +    K+     +V KLR +GY +A L  TSW +S     GC     + +YEYI+  V 
Subjt:  KAIAGMHEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKK----WLVMKLRMDGYDSADLCHTSWVTSI----GCP----AGEYEYIETKV-

Query:  ----EDEFGRPKRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKE-EKG
             D   + KR+IID++FK QFE+AR T AYK +TE LPTVFV TE  + R+++++C   K+S+++ G+  PPWRTS YMQ+K+  E    +   +KG
Subjt:  ----EDEFGRPKRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKE-EKG

Query:  S
        S
Subjt:  S

AT2G38820.1 Protein of unknown function (DUF506)1.9e-2638.01Show/hide
Query:  GSGREA---EGEVAEAVTKHLRKKLESPKTTSLKKWL--VMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVAR
        GSG E+   E E +      + K L   K+  ++  L  V K+    YD+A LC + W  S  CPAGEYEY++  ++ E     R++IDI+FK++FE+AR
Subjt:  GSGREA---EGEVAEAVTKHLRKKLESPKTTSLKKWL--VMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVAR

Query:  PTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKG
         T  YK + + LP +FVG  + + +II ++C AAKQSL++ GLH+PPWR + Y+++K+       ++   G
Subjt:  PTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKG

AT2G38820.2 Protein of unknown function (DUF506)2.5e-2643.2Show/hide
Query:  GYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIP
        GYD+A LC + W  S  CPAGEYEY++  ++ E     R++IDI+FK++FE+AR T  YK + + LP +FVG  + + +II ++C AAKQSL++ GLH+P
Subjt:  GYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIP

Query:  PWRTSTYMQNKYKREETEEEKEEKG
        PWR + Y+++K+       ++   G
Subjt:  PWRTSTYMQNKYKREETEEEKEEKG

AT4G14620.1 Protein of unknown function (DUF506)1.9e-2637.28Show/hide
Query:  EEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVARP
        + ++  G   E  +    TK + K     +   L+K +V +L   GYDS+ +C + W  +   PAGEYEYI+  V  E     R+IIDI+F+++FE+AR 
Subjt:  EEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFEVARP

Query:  TAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEK
        T+ YK+L ++LP +FVG  + + +I++I+  A+KQSL++ G+H PPWR + YM+ K+    T    E+K
Subjt:  TAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCAATGCTAGTGGGTCGTGTGGTTGGGGTCCCGGAAGTCTAGGCGTTCGGTTGCTTTTAGGGTGGACTACTTACGGAGTATTTATGTACTCACTCCCCCCCATCTA
CCTTATTGTTTCAGGTGTGAACCGGGCCAACCGTGGTGATGACGTGGATGAGGACCGTTACTTCGTGAAACGTGCTAGAAATCTGGGCCTTTACAACACTGGTGTGAGCA
TACGGCAAGGTAGTAGGGTCGTGCAGGTAGTGAGGCTGGCAGCAGGGGGCATGAACTGGGATGAAATGGCGCAAGCGCCTGCAGGCGGGGTGTTGGCGTGCATGCAGGGG
CTACTGAAGGCAATAGCAGGCATGCATGAGATTGTTGCTCACGAGGAGATTCTTGGGAGTGGGAGAGAAGCAGAAGGAGAGGTGGCTGAGGCTGTGACGAAGCATCTCAG
AAAGAAACTGGAATCTCCCAAAACCACCAGTTTGAAGAAATGGCTTGTGATGAAGCTCAGGATGGACGGCTATGATTCCGCTGATCTTTGCCACACCTCTTGGGTCACTT
CCATAGGGTGCCCAGCAGGGGAATATGAGTACATAGAGACGAAAGTGGAGGATGAATTTGGGAGGCCAAAGAGGATGATAATAGACATAGAGTTCAAGGCACAGTTTGAA
GTTGCAAGGCCAACAGCAGCCTACAAGCAGCTCACAGAAGCACTTCCAACAGTGTTTGTAGGGACTGAGGAAAGTGTTTTTAGAATAATCACAATCCTATGCTCAGCAGC
CAAACAGTCCCTTAGGGAGAGTGGGCTCCACATACCCCCTTGGAGGACCTCCACTTACATGCAGAATAAATATAAGAGAGAAGAAACAGAAGAAGAAAAAGAAGAAAAAG
GAAGTGCTTTTAATTACAGGTGGAAGCCTCCCATGGTGAAGCAAAGGAGGAGGCTTTGGAGTGGACCCTCTGCCTTGTCTACTCAATTTTCTAACATGAGAATTTGCACA
ACGGTTCTTCACGAATCGAGCTCGAATCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGGGTAGTGGGCCCGATAGCACACACGACCGGCGATTA
CATGTCTTTTCTCATATCGGACCTGTCGGGTTCCGAGCAGGTCGGACTACAGTCAGTGCTAAGTCTACTACCGCCACTTCAGTTGCTGCTGCTTCGCACTGTCTTCGATA
CTGACAAATTTGAGAAGGACAACAAGACAGTCCACGAGCATCTCTTCAACCAAATGAACAACCCTCTATTTTATATGTTCTTAGTCCAAAAATCTGTACATATTATTTGG
GATACTTTAGAGTCCATATATGGTGGGGACGATGCATGCAGGAAGAAGTATGTCCTCAGTAAATGGCCGCAATTTCAGATGTCAGACGACAAACCATTACTGGACAATGT
TCATCAATACGAAAAGCTGGTGATTGATGTGTTGTTCGAAGGCATGAAAATGTGTGAGATTCTTCAAGTGAACGTGTTGCTAAAGAAATTACCACTTTCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCAATGCTAGTGGGTCGTGTGGTTGGGGTCCCGGAAGTCTAGGCGTTCGGTTGCTTTTAGGGTGGACTACTTACGGAGTATTTATGTACTCACTCCCCCCCATCTA
CCTTATTGTTTCAGGTGTGAACCGGGCCAACCGTGGTGATGACGTGGATGAGGACCGTTACTTCGTGAAACGTGCTAGAAATCTGGGCCTTTACAACACTGGTGTGAGCA
TACGGCAAGGTAGTAGGGTCGTGCAGGTAGTGAGGCTGGCAGCAGGGGGCATGAACTGGGATGAAATGGCGCAAGCGCCTGCAGGCGGGGTGTTGGCGTGCATGCAGGGG
CTACTGAAGGCAATAGCAGGCATGCATGAGATTGTTGCTCACGAGGAGATTCTTGGGAGTGGGAGAGAAGCAGAAGGAGAGGTGGCTGAGGCTGTGACGAAGCATCTCAG
AAAGAAACTGGAATCTCCCAAAACCACCAGTTTGAAGAAATGGCTTGTGATGAAGCTCAGGATGGACGGCTATGATTCCGCTGATCTTTGCCACACCTCTTGGGTCACTT
CCATAGGGTGCCCAGCAGGGGAATATGAGTACATAGAGACGAAAGTGGAGGATGAATTTGGGAGGCCAAAGAGGATGATAATAGACATAGAGTTCAAGGCACAGTTTGAA
GTTGCAAGGCCAACAGCAGCCTACAAGCAGCTCACAGAAGCACTTCCAACAGTGTTTGTAGGGACTGAGGAAAGTGTTTTTAGAATAATCACAATCCTATGCTCAGCAGC
CAAACAGTCCCTTAGGGAGAGTGGGCTCCACATACCCCCTTGGAGGACCTCCACTTACATGCAGAATAAATATAAGAGAGAAGAAACAGAAGAAGAAAAAGAAGAAAAAG
GAAGTGCTTTTAATTACAGGTGGAAGCCTCCCATGGTGAAGCAAAGGAGGAGGCTTTGGAGTGGACCCTCTGCCTTGTCTACTCAATTTTCTAACATGAGAATTTGCACA
ACGGTTCTTCACGAATCGAGCTCGAATCCGGTCTCCGGTTCCGACCTGAACACTAGAGTGGACCTGCACAAGAGGGTAGTGGGCCCGATAGCACACACGACCGGCGATTA
CATGTCTTTTCTCATATCGGACCTGTCGGGTTCCGAGCAGGTCGGACTACAGTCAGTGCTAAGTCTACTACCGCCACTTCAGTTGCTGCTGCTTCGCACTGTCTTCGATA
CTGACAAATTTGAGAAGGACAACAAGACAGTCCACGAGCATCTCTTCAACCAAATGAACAACCCTCTATTTTATATGTTCTTAGTCCAAAAATCTGTACATATTATTTGG
GATACTTTAGAGTCCATATATGGTGGGGACGATGCATGCAGGAAGAAGTATGTCCTCAGTAAATGGCCGCAATTTCAGATGTCAGACGACAAACCATTACTGGACAATGT
TCATCAATACGAAAAGCTGGTGATTGATGTGTTGTTCGAAGGCATGAAAATGTGTGAGATTCTTCAAGTGAACGTGTTGCTAAAGAAATTACCACTTTCATAG
Protein sequenceShow/hide protein sequence
MGNASGSCGWGPGSLGVRLLLGWTTYGVFMYSLPPIYLIVSGVNRANRGDDVDEDRYFVKRARNLGLYNTGVSIRQGSRVVQVVRLAAGGMNWDEMAQAPAGGVLACMQG
LLKAIAGMHEIVAHEEILGSGREAEGEVAEAVTKHLRKKLESPKTTSLKKWLVMKLRMDGYDSADLCHTSWVTSIGCPAGEYEYIETKVEDEFGRPKRMIIDIEFKAQFE
VARPTAAYKQLTEALPTVFVGTEESVFRIITILCSAAKQSLRESGLHIPPWRTSTYMQNKYKREETEEEKEEKGSAFNYRWKPPMVKQRRRLWSGPSALSTQFSNMRICT
TVLHESSSNPVSGSDLNTRVDLHKRVVGPIAHTTGDYMSFLISDLSGSEQVGLQSVLSLLPPLQLLLLRTVFDTDKFEKDNKTVHEHLFNQMNNPLFYMFLVQKSVHIIW
DTLESIYGGDDACRKKYVLSKWPQFQMSDDKPLLDNVHQYEKLVIDVLFEGMKMCEILQVNVLLKKLPLS