; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc06g11380 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc06g11380
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionF-box domain-containing protein
Genome locationchr6:8628591..8648126
RNA-Seq ExpressionMoc06g11380
SyntenyMoc06g11380
Gene Ontology termsGO:0003677 - DNA binding (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR012337 - Ribonuclease H-like superfamily
IPR017451 - F-box associated interaction domain
IPR025525 - hAT-like transposase, RNase-H fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135796.1 putative F-box protein At1g46984 [Cucumis sativus]2.1e-4450.67Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNA--DRVIRLYHYGFGFSPKT
        LHCV+ DP+  +GMS+VASF+FHPDLSS    I+IIN C+GL+    ++ R     L     V +LNP+TNEYFK P S +  DRV   Y YG GFSP T
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNA--DRVIRLYHYGFGFSPKT

Query:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVG-----NGLRQPDTKVIYRLDIEDEEFKFQEIFIPHYGEGSCII
          YKLAR  F +D+F  +V+IFAFGT+ +WT VG VP  +  E HGVY +GGLYWVG     NG     T+VIYRLD++DE  KF++I  P  G     I
Subjt:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVG-----NGLRQPDTKVIYRLDIEDEEFKFQEIFIPHYGEGSCII

Query:  GVFNDTLYLTLSIPDVGYHMCGRWK
         V+N TLYLT    D  YH    WK
Subjt:  GVFNDTLYLTLSIPDVGYHMCGRWK

XP_008450717.1 PREDICTED: putative F-box protein At3g21120 [Cucumis melo]4.7e-4446.53Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT
        LHCV+ DP+  +GMS VASF+FHPD  SA   I+IIN C+GL+     + R           V +LNP+TNEYFK P      D+V   Y YG GFSPKT
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT

Query:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI
          YKLAR  FA+++F  +V+IFAFGT+ +WT +G VP  +  E HGVYF+GGLYWVG+  + P+      T V+YRLD+ DE  KF++I IP +G     
Subjt:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI

Query:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH
        +GV+N TLYLTL   D  YH+  + K+ F   K  + +    I+H
Subjt:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH

XP_022143297.1 F-box/kelch-repeat protein At3g06240-like [Momordica charantia]5.4e-5651.28Show/hide
Query:  LAHLHAFKQHTCHQLHCVDLDPKQ--GMSSVASFSFHPD--LSSADISIINFCNGLLCCYFDRGRPTKTLLPCEG-EVFILNPMTNEYFKPPISNADRVI
        LAH + F  +    L C+ LDPK   GM +VA F FHPD  LSS  ISI+NFCNGLLCC F+  R TK + P E  EV ILNPMTNEYFKPPISNA +  
Subjt:  LAHLHAFKQHTCHQLHCVDLDPKQ--GMSSVASFSFHPD--LSSADISIINFCNGLLCCYFDRGRPTKTLLPCEG-EVFILNPMTNEYFKPPISNADRVI

Query:  RLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDTKVIYRLDIEDEEFKFQEIFIPH
          Y Y FGFSPKTK YKL R  ++ DK AIL+E F FGTN K T+V  VP  +  E+H VYF+G LYW+G  L+ P T V+YRLDIE EE   QEI  P 
Subjt:  RLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDTKVIYRLDIEDEEFKFQEIFIPH

Query:  YGEGSCIIGVFNDTLYLTLSIPDVGYHMCGRWK--KIFHGLKHLMCSYQSIIVHVTFNLSELVKMERSCHNGR
         G G C IGVFN +LYLTL+     YH    WK  + F   K        I VH        +++ R+C +G+
Subjt:  YGEGSCIIGVFNDTLYLTLSIPDVGYHMCGRWK--KIFHGLKHLMCSYQSIIVHVTFNLSELVKMERSCHNGR

XP_038880514.1 putative F-box protein At1g46984 [Benincasa hispida]1.5e-4549.18Show/hide
Query:  IGLKNLAHLHAFKQHTCHQLHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFK-PPISN
        I + NL   H F       LHCVDLDP+  +GMS+VASF+FHP++SS    ISIIN C+GL+     + R         G V +LNPMTNEYFK P   +
Subjt:  IGLKNLAHLHAFKQHTCHQLHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFK-PPISN

Query:  ADRVIRLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD---TKVIYRLDIEDEEFK
            ++ Y YG GFSP+TK YKLAR  +   +F  +VEIFAFGT+ +WT VG VP  +  E HGVYF+GGLYWVG G + P+   + VIYRLD+ED+  K
Subjt:  ADRVIRLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD---TKVIYRLDIEDEEFK

Query:  FQEIFIPHYGEGS---CIIGVFNDTLYLTLSIPDVGYHMCGRWK
        F++I  P  G+       IGV+NDTLYLTL   D  YH+   WK
Subjt:  FQEIFIPHYGEGS---CIIGVFNDTLYLTLSIPDVGYHMCGRWK

XP_038883771.1 putative F-box protein At1g60370 [Benincasa hispida]3.6e-4451.71Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSADISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNADRVI---RLYHYGFGFSPKTK
        L+C+D + K  +GMSSV SF+FHP   ++ ISIIN CNGLL     +    K+  P    + ILNPMTNEYFK P   +       R Y YG GFSP+TK
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSADISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNADRVI---RLYHYGFGFSPKTK

Query:  HYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDT-KVIYRLDIEDEEFKFQEIFIPHYGEGSCIIGVFND
         YK+AR SF  D+    VEIFAFGT  +WT VG +P L+  EDHGVYF+GGLYW  + L+  D+   IYRLDIE+E  K ++I  PHYG G    GVF+ 
Subjt:  HYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDT-KVIYRLDIEDEEFKFQEIFIPHYGEGSCIIGVFND

Query:  TLYLT
        TLYLT
Subjt:  TLYLT

TrEMBL top hitse value%identityAlignment
A0A0A0M1M5 F-box domain-containing protein1.0e-4450.67Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNA--DRVIRLYHYGFGFSPKT
        LHCV+ DP+  +GMS+VASF+FHPDLSS    I+IIN C+GL+    ++ R     L     V +LNP+TNEYFK P S +  DRV   Y YG GFSP T
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNA--DRVIRLYHYGFGFSPKT

Query:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVG-----NGLRQPDTKVIYRLDIEDEEFKFQEIFIPHYGEGSCII
          YKLAR  F +D+F  +V+IFAFGT+ +WT VG VP  +  E HGVY +GGLYWVG     NG     T+VIYRLD++DE  KF++I  P  G     I
Subjt:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVG-----NGLRQPDTKVIYRLDIEDEEFKFQEIFIPHYGEGSCII

Query:  GVFNDTLYLTLSIPDVGYHMCGRWK
         V+N TLYLT    D  YH    WK
Subjt:  GVFNDTLYLTLSIPDVGYHMCGRWK

A0A1S3BQV8 putative F-box protein At3g211202.3e-4446.53Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT
        LHCV+ DP+  +GMS VASF+FHPD  SA   I+IIN C+GL+     + R           V +LNP+TNEYFK P      D+V   Y YG GFSPKT
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT

Query:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI
          YKLAR  FA+++F  +V+IFAFGT+ +WT +G VP  +  E HGVYF+GGLYWVG+  + P+      T V+YRLD+ DE  KF++I IP +G     
Subjt:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI

Query:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH
        +GV+N TLYLTL   D  YH+  + K+ F   K  + +    I+H
Subjt:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH

A0A5D3CIX9 Putative F-box protein2.3e-4446.53Show/hide
Query:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT
        LHCV+ DP+  +GMS VASF+FHPD  SA   I+IIN C+GL+     + R           V +LNP+TNEYFK P      D+V   Y YG GFSPKT
Subjt:  LHCVDLDPK--QGMSSVASFSFHPDLSSAD--ISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPP--ISNADRVIRLYHYGFGFSPKT

Query:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI
          YKLAR  FA+++F  +V+IFAFGT+ +WT +G VP  +  E HGVYF+GGLYWVG+  + P+      T V+YRLD+ DE  KF++I IP +G     
Subjt:  KHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPD------TKVIYRLDIEDEEFKFQEIFIPHYGEGSCI

Query:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH
        +GV+N TLYLTL   D  YH+  + K+ F   K  + +    I+H
Subjt:  IGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVH

A0A6J1CQE2 F-box/kelch-repeat protein At3g06240-like2.6e-5651.28Show/hide
Query:  LAHLHAFKQHTCHQLHCVDLDPKQ--GMSSVASFSFHPD--LSSADISIINFCNGLLCCYFDRGRPTKTLLPCEG-EVFILNPMTNEYFKPPISNADRVI
        LAH + F  +    L C+ LDPK   GM +VA F FHPD  LSS  ISI+NFCNGLLCC F+  R TK + P E  EV ILNPMTNEYFKPPISNA +  
Subjt:  LAHLHAFKQHTCHQLHCVDLDPKQ--GMSSVASFSFHPD--LSSADISIINFCNGLLCCYFDRGRPTKTLLPCEG-EVFILNPMTNEYFKPPISNADRVI

Query:  RLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDTKVIYRLDIEDEEFKFQEIFIPH
          Y Y FGFSPKTK YKL R  ++ DK AIL+E F FGTN K T+V  VP  +  E+H VYF+G LYW+G  L+ P T V+YRLDIE EE   QEI  P 
Subjt:  RLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQPDTKVIYRLDIEDEEFKFQEIFIPH

Query:  YGEGSCIIGVFNDTLYLTLSIPDVGYHMCGRWK--KIFHGLKHLMCSYQSIIVHVTFNLSELVKMERSCHNGR
         G G C IGVFN +LYLTL+     YH    WK  + F   K        I VH        +++ R+C +G+
Subjt:  YGEGSCIIGVFNDTLYLTLSIPDVGYHMCGRWK--KIFHGLKHLMCSYQSIIVHVTFNLSELVKMERSCHNGR

A0A6J1DJA3 F-box/LRR-repeat protein At2g43260-like8.7e-4459.66Show/hide
Query:  GMSSVASFSFHPDLSSADISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNADRVIRLYHYGFGFSPKTKHYKLARLSFAYDKFA
        G SSVASFSFH +   A ISII+ CNGLLCCY    R  +       E FILNPMTNEYFKPP SNAD  +  Y +GFGFS KTK YKL RLSFA++K A
Subjt:  GMSSVASFSFHPDLSSADISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNADRVIRLYHYGFGFSPKTKHYKLARLSFAYDKFA

Query:  ILVEIFAFGTN-GKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQP--DTKVIYRLDIEDEEFKFQEIFIPHYG
         LVEIF  GT+  KW  VGF+PP +   D GVYF+GGLYWV N   QP   T V+YRLDIEDEE    EI  P  G
Subjt:  ILVEIFAFGTN-GKWTHVGFVPPLIRWEDHGVYFDGGLYWVGNGLRQP--DTKVIYRLDIEDEEFKFQEIFIPHYG

SwissProt top hitse value%identityAlignment
B9FJG3 Zinc finger BED domain-containing protein RICESLEEPER 14.3e-0826.04Show/hide
Query:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR
        + +A N TSN+ FHE   +Q  + N +    +P+ +S+   M  ++DK W   +  N +L +A V+DPR+KM  + + +++++    AK + K V+D + 
Subjt:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR

Query:  QLFTEYNLS-MPKNVSSTSKG---------RMSQSIVASTSDSVVQSFRFQSYKCNQRKGKPEQKHVPQ
        +L+ EY    +P   +   +G           +Q+   ST D +V    F  Y       +P +  + Q
Subjt:  QLFTEYNLS-MPKNVSSTSKG---------RMSQSIVASTSDSVVQSFRFQSYKCNQRKGKPEQKHVPQ

Q0JMB2 Zinc finger BED domain-containing protein RICESLEEPER 42.0e-0526.73Show/hide
Query:  NSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLRQLFTE
        + T+NM FH+   +Q  ++N      + ++  +V  +  K+DK W   +  N +L +A  +DPR+KM  + + +++++    A +  K V+D +  L+ E
Subjt:  NSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLRQLFTE

Query:  Y
        Y
Subjt:  Y

Q6AVI0 Zinc finger BED domain-containing protein RICESLEEPER 27.4e-0826.16Show/hide
Query:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR
        + +A N TSN+ FHE   +Q  + N +    +P+ +S+   M  ++DK W   +  N +L +A V+DPR+KM  + + +++++    AK + K V+D + 
Subjt:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR

Query:  QLFTEYNLSMP-------------KNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYKCNQRKGKPEQKHVPQ
        +L+ EY ++ P              N  ++  G  +Q+   ST D +V    F  Y       +P +  + Q
Subjt:  QLFTEYNLSMP-------------KNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYKCNQRKGKPEQKHVPQ

Q75HY5 Zinc finger BED domain-containing protein RICESLEEPER 32.5e-0832.08Show/hide
Query:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR
        + +A N TSN+ FHE   +Q+ + N +    +PI  S    M  ++DK W   +  N +L +A V+DPR+KM  + + ++++     AK + K V+D + 
Subjt:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR

Query:  QLFTEY
        +L++EY
Subjt:  QLFTEY

Q9M2N5 Zinc finger BED domain-containing protein DAYSLEEPER4.8e-0728.47Show/hide
Query:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR
        + S  N ++   FHE+   Q+ + + +    +P +  +   MQ K DK W      + +L +A V+DPR+KM  + + F+++F  +  K I K V+D + 
Subjt:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR

Query:  QLFTEYNLSMPKNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYK
        +LFTEY +++P   ++TS+G  +  +  S  D+ +     Q+ K
Subjt:  QLFTEYNLSMPKNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYK

Arabidopsis top hitse value%identityAlignment
AT3G42170.1 BED zinc finger ;hAT family dimerisation domain3.4e-0828.47Show/hide
Query:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR
        + S  N ++   FHE+   Q+ + + +    +P +  +   MQ K DK W      + +L +A V+DPR+KM  + + F+++F  +  K I K V+D + 
Subjt:  VSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEANVAKEIGKKVEDVLR

Query:  QLFTEYNLSMPKNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYK
        +LFTEY +++P   ++TS+G  +  +  S  D+ +     Q+ K
Subjt:  QLFTEYNLSMPKNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACCACCAAAAAAGGGCTATGGATTCACAGCGTTTCACCATATCCATCGAGTCCATTCATGGACCACCAAAAGAAGGGCTATGGATTCACATATGTTTCACTACATC
CATCAAGTCCATACGTGGACCACCCAAAAGAAAGGGCTATGGATTCATGCCACTGAGTGGTTGTGATAATCACTCCAGGGAATTTCTCGAGCAACACATTGGCCTGAAGA
ATCTCGCACATCTTCATGCCTTCAAACAACACACTTGCCACCAGCTCCATTGTGTGGACTTGGATCCTAAACAAGGGATGAGCAGTGTTGCCTCATTTAGTTTCCATCCT
GATTTGTCCTCTGCTGATATCTCAATCATCAATTTTTGCAATGGACTCCTCTGTTGTTATTTTGACCGTGGACGACCCACCAAAACACTACTACCTTGTGAAGGTGAAGT
TTTCATATTAAATCCCATGACAAATGAGTACTTCAAACCTCCCATCTCGAACGCGGATAGGGTTATACGTCTATATCATTATGGATTTGGGTTTAGTCCTAAAACAAAAC
ACTACAAATTAGCAAGACTCTCCTTTGCCTATGATAAATTTGCTATCTTAGTGGAAATTTTTGCATTTGGCACAAATGGAAAATGGACCCATGTTGGTTTTGTGCCTCCT
TTAATAAGATGGGAAGACCATGGTGTTTACTTCGATGGAGGTCTCTATTGGGTTGGAAATGGCCTACGACAACCTGATACGAAGGTCATATACCGTTTGGATATAGAAGA
TGAGGAATTCAAATTCCAAGAAATTTTTATTCCCCATTATGGCGAGGGTTCTTGTATAATTGGAGTCTTTAATGACACTCTCTATCTAACTCTGTCCATCCCAGATGTCG
GTTACCATATGTGTGGAAGATGGAAGAAGATTTTTCATGGATTAAAGCATTTGATGTGCTCCTACCAGAGCATCATCGTCCATGTCACCTTCAACTTATCAGAGCTTGTG
AAGATGGAAAGATCTTGTCACAATGGTCGTGCCGAATTTTGCTTTCAAGGCAGTGGGTTCCACGACACAACAACGTCTTCTTCCTCATATAGCTCTGACAATTTGAAAGC
AAAATTTATGGCTTTAATAAGATTCGGCAAGTCACCAACGTCACAGTCGTCGTCGCTGCCGCCGCTGGAACTGTCGCTGGAGCCACGGGAAATGTCGTCGGAGCTGTCAT
CGGACACTGCCGCTGTTGATGCCGCCAATAAATTCGAGAAGGATAACAAGACTGTTTGGCAGTACAAAACTGTCCACACCCGAACCGACTCGTCTCTTTTATGTTTTTTT
TTTAATTCTCCTGCTCTGTTTCCGTTCCATTCCACGAGTCTGCAATTCCGCTTGGTGGGCCGCCCTGTGCCTGTGTCGTCAGCGGTGAATAGTACTTCCAATATGGTCTT
CCATGAGATTTCTACTATCCAAAATTGTATAAGAAACTATTCTTGTGATAGCAGTAATCCCATTCTTGCTAGCGTAGTGACCAAAATGCAGCTGAAATATGATAAGGATT
GGGGGAATACTGAAAAGGCGAATTATTTATTATATGTTGCATTTGTTCTCGACCCTCGTTATAAGATGAACTTTCTTATGTATTGTTTCACTCAACTATTCGAAGCTAAT
GTTGCAAAGGAAATAGGAAAGAAAGTGGAAGATGTTTTGAGACAACTATTTACTGAATATAATTTGTCTATGCCTAAGAATGTTTCTTCTACTTCGAAGGGTCGAATGAG
TCAATCTATTGTTGCTTCTACTAGTGATAGTGTTGTACAGTCTTTCCGCTTTCAATCATACAAGTGCAACCAGAGAAAAGGAAAGCCAGAACAAAAGCATGTTCCACAAG
CCAACATTGCTGAGACGGACGACATCATTGCTGTTGTAGTTGTGGAAGCAAACTTAGTTGAAAACAAGTCTGACTGGGTTCTAGACACTGGAGCTTCCCTGCACTTCTGC
TCAAACCGAAACCTCTTTCACGATGAACCCACGACACAACAACGTCTTCTTCCTCATATTGCTCTGACAGTGACCCCTCGACTTTTGACTTTGTGGATTTTGTTGGACAT
TTCGCTACCGATCGAGGGCAATTCTAGCATTGTCCGAGGAGATGAAGGTGAGTCTGTGGATGAGGAGATGTTCCATCGGCATTCCAAATCGAATCCCGTATTGGTGTCAT
CTGCTCTTGATGGCAATGCAGTCATCGTAGAAGATGATGAAAAAGGAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACCACCAAAAAAGGGCTATGGATTCACAGCGTTTCACCATATCCATCGAGTCCATTCATGGACCACCAAAAGAAGGGCTATGGATTCACATATGTTTCACTACATC
CATCAAGTCCATACGTGGACCACCCAAAAGAAAGGGCTATGGATTCATGCCACTGAGTGGTTGTGATAATCACTCCAGGGAATTTCTCGAGCAACACATTGGCCTGAAGA
ATCTCGCACATCTTCATGCCTTCAAACAACACACTTGCCACCAGCTCCATTGTGTGGACTTGGATCCTAAACAAGGGATGAGCAGTGTTGCCTCATTTAGTTTCCATCCT
GATTTGTCCTCTGCTGATATCTCAATCATCAATTTTTGCAATGGACTCCTCTGTTGTTATTTTGACCGTGGACGACCCACCAAAACACTACTACCTTGTGAAGGTGAAGT
TTTCATATTAAATCCCATGACAAATGAGTACTTCAAACCTCCCATCTCGAACGCGGATAGGGTTATACGTCTATATCATTATGGATTTGGGTTTAGTCCTAAAACAAAAC
ACTACAAATTAGCAAGACTCTCCTTTGCCTATGATAAATTTGCTATCTTAGTGGAAATTTTTGCATTTGGCACAAATGGAAAATGGACCCATGTTGGTTTTGTGCCTCCT
TTAATAAGATGGGAAGACCATGGTGTTTACTTCGATGGAGGTCTCTATTGGGTTGGAAATGGCCTACGACAACCTGATACGAAGGTCATATACCGTTTGGATATAGAAGA
TGAGGAATTCAAATTCCAAGAAATTTTTATTCCCCATTATGGCGAGGGTTCTTGTATAATTGGAGTCTTTAATGACACTCTCTATCTAACTCTGTCCATCCCAGATGTCG
GTTACCATATGTGTGGAAGATGGAAGAAGATTTTTCATGGATTAAAGCATTTGATGTGCTCCTACCAGAGCATCATCGTCCATGTCACCTTCAACTTATCAGAGCTTGTG
AAGATGGAAAGATCTTGTCACAATGGTCGTGCCGAATTTTGCTTTCAAGGCAGTGGGTTCCACGACACAACAACGTCTTCTTCCTCATATAGCTCTGACAATTTGAAAGC
AAAATTTATGGCTTTAATAAGATTCGGCAAGTCACCAACGTCACAGTCGTCGTCGCTGCCGCCGCTGGAACTGTCGCTGGAGCCACGGGAAATGTCGTCGGAGCTGTCAT
CGGACACTGCCGCTGTTGATGCCGCCAATAAATTCGAGAAGGATAACAAGACTGTTTGGCAGTACAAAACTGTCCACACCCGAACCGACTCGTCTCTTTTATGTTTTTTT
TTTAATTCTCCTGCTCTGTTTCCGTTCCATTCCACGAGTCTGCAATTCCGCTTGGTGGGCCGCCCTGTGCCTGTGTCGTCAGCGGTGAATAGTACTTCCAATATGGTCTT
CCATGAGATTTCTACTATCCAAAATTGTATAAGAAACTATTCTTGTGATAGCAGTAATCCCATTCTTGCTAGCGTAGTGACCAAAATGCAGCTGAAATATGATAAGGATT
GGGGGAATACTGAAAAGGCGAATTATTTATTATATGTTGCATTTGTTCTCGACCCTCGTTATAAGATGAACTTTCTTATGTATTGTTTCACTCAACTATTCGAAGCTAAT
GTTGCAAAGGAAATAGGAAAGAAAGTGGAAGATGTTTTGAGACAACTATTTACTGAATATAATTTGTCTATGCCTAAGAATGTTTCTTCTACTTCGAAGGGTCGAATGAG
TCAATCTATTGTTGCTTCTACTAGTGATAGTGTTGTACAGTCTTTCCGCTTTCAATCATACAAGTGCAACCAGAGAAAAGGAAAGCCAGAACAAAAGCATGTTCCACAAG
CCAACATTGCTGAGACGGACGACATCATTGCTGTTGTAGTTGTGGAAGCAAACTTAGTTGAAAACAAGTCTGACTGGGTTCTAGACACTGGAGCTTCCCTGCACTTCTGC
TCAAACCGAAACCTCTTTCACGATGAACCCACGACACAACAACGTCTTCTTCCTCATATTGCTCTGACAGTGACCCCTCGACTTTTGACTTTGTGGATTTTGTTGGACAT
TTCGCTACCGATCGAGGGCAATTCTAGCATTGTCCGAGGAGATGAAGGTGAGTCTGTGGATGAGGAGATGTTCCATCGGCATTCCAAATCGAATCCCGTATTGGTGTCAT
CTGCTCTTGATGGCAATGCAGTCATCGTAGAAGATGATGAAAAAGGAATTTGA
Protein sequenceShow/hide protein sequence
MDHQKRAMDSQRFTISIESIHGPPKEGLWIHICFTTSIKSIRGPPKRKGYGFMPLSGCDNHSREFLEQHIGLKNLAHLHAFKQHTCHQLHCVDLDPKQGMSSVASFSFHP
DLSSADISIINFCNGLLCCYFDRGRPTKTLLPCEGEVFILNPMTNEYFKPPISNADRVIRLYHYGFGFSPKTKHYKLARLSFAYDKFAILVEIFAFGTNGKWTHVGFVPP
LIRWEDHGVYFDGGLYWVGNGLRQPDTKVIYRLDIEDEEFKFQEIFIPHYGEGSCIIGVFNDTLYLTLSIPDVGYHMCGRWKKIFHGLKHLMCSYQSIIVHVTFNLSELV
KMERSCHNGRAEFCFQGSGFHDTTTSSSSYSSDNLKAKFMALIRFGKSPTSQSSSLPPLELSLEPREMSSELSSDTAAVDAANKFEKDNKTVWQYKTVHTRTDSSLLCFF
FNSPALFPFHSTSLQFRLVGRPVPVSSAVNSTSNMVFHEISTIQNCIRNYSCDSSNPILASVVTKMQLKYDKDWGNTEKANYLLYVAFVLDPRYKMNFLMYCFTQLFEAN
VAKEIGKKVEDVLRQLFTEYNLSMPKNVSSTSKGRMSQSIVASTSDSVVQSFRFQSYKCNQRKGKPEQKHVPQANIAETDDIIAVVVVEANLVENKSDWVLDTGASLHFC
SNRNLFHDEPTTQQRLLPHIALTVTPRLLTLWILLDISLPIEGNSSIVRGDEGESVDEEMFHRHSKSNPVLVSSALDGNAVIVEDDEKGI