; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g02000 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g02000
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr2:1506809..1508089
RNA-Seq ExpressionMoc02g02000
SyntenyMoc02g02000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]4.9e-13980.49Show/hide
Query:  MGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP
        M TQM +ME+MY+EM+ AAGA SRSENR+ R ++ EQRG HLGPV++ H E  E E +T +RGDLREHLN+KR SSLRKGQSPS SHR+SNQQAESS+NP
Subjt:  MGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP

Query:  ATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
         TP GVITR EFDQL+ K DAQVEALKAKCE+KE S +DGDLGE PFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  ATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEA
        IALTGSA+LWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT L DE LTVKL EEA
Subjt:  IALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEA

Query:  LATFAEVLQKAKKVIDGQELLRTKTGRP
         ATF EVLQKAKK+IDGQELLRTKT RP
Subjt:  LATFAEVLQKAKKVIDGQELLRTKTGRP

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]3.0e-12894.55Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGE PFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SA+LWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP
         HCSDDSAMCYFLT L DEA TVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  AHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP

XP_022155128.1 uncharacterized protein LOC111022267 [Momordica charantia]4.9e-12383.39Show/hide
Query:  DNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDV
        + E E +T +RGDLREHLN+KR SSLRKGQSPS SHR+SNQQAESS+NP TP  VITR EFDQL+ K DAQVEALKA CE+KE S +DGDLGELPFT D+
Subjt:  DNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGSA+LWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLT L D+ LTVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP

XP_022156088.1 uncharacterized protein LOC111023060 [Momordica charantia]1.5e-12472.21Show/hide
Query:  MEAMGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAM TQMR+MEEMYN+M+  AGA SRS +++   ++ EQ   H  PV+EEHL            GDLR+HLN+KR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
        +NP  P GVITR EF+QL+ K DAQVEALK +CE+KE + +DGDLGE PFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLG
        AFQIALTGSA+LWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLT L DE  TVKLG
Subjt:  AFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLG

Query:  EEALATFAEVLQKAKKVIDGQELLRTKTGRP
        EEALATFAEVLQ  KK IDGQELLRTKT RP
Subjt:  EEALATFAEVLQKAKKVIDGQELLRTKTGRP

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]1.1e-15772.47Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMG
        MVQP +STNT DRR L A+D HQREV A VVEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAM 
Subjt:  MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMG

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
        TQM +MEEMYNEM+ A GAGSRSE+R  R E                            RGDLR+HL++KR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVE LKA+CE K  + +DGDLGE PFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALA
        LT SA+LWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLTDL DE LTVKLGEEA A
Subjt:  LTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALA

Query:  TFAEVLQKAKKVIDGQELLRTKTGR
        TFAEVLQKAKKVIDGQEL RTKTGR
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGR

TrEMBL top hitse value%identityAlignment
A0A6J1DDS5 uncharacterized protein LOC1110198421.4e-12894.55Show/hide
Query:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITR EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGE PFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQIALT SA+LWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP
         HCSDDSAMCYFLT L DEA TVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  AHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP

A0A6J1DDW5 uncharacterized protein LOC1110196342.4e-13980.49Show/hide
Query:  MGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP
        M TQM +ME+MY+EM+ AAGA SRSENR+ R ++ EQRG HLGPV++ H E  E E +T +RGDLREHLN+KR SSLRKGQSPS SHR+SNQQAESS+NP
Subjt:  MGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNP

Query:  ATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ
         TP GVITR EFDQL+ K DAQVEALKAKCE+KE S +DGDLGE PFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  ATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEA
        IALTGSA+LWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SAMCYFLT L DE LTVKL EEA
Subjt:  IALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEA

Query:  LATFAEVLQKAKKVIDGQELLRTKTGRP
         ATF EVLQKAKK+IDGQELLRTKT RP
Subjt:  LATFAEVLQKAKKVIDGQELLRTKTGRP

A0A6J1DM55 uncharacterized protein LOC1110222672.4e-12383.39Show/hide
Query:  DNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDV
        + E E +T +RGDLREHLN+KR SSLRKGQSPS SHR+SNQQAESS+NP TP  VITR EFDQL+ K DAQVEALKA CE+KE S +DGDLGELPFT D+
Subjt:  DNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDV

Query:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ
        LEAPI PKFK PT+KPYDGSK+PKDYV+VFEGLM+FQAA+DAIKCRAFQIA TGSA+LWYRRLPARSISTYSQLR+EF++QFSSR+YD+KTATHLATIRQ
Subjt:  LEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQ

Query:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP
        K+GETLREYVTRFQEEQLKVAHCSDDSAMCYFLT L D+ LTVKLGEEA ATFAEVLQKAKKVIDGQELLRTKTGRP
Subjt:  KEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP

A0A6J1DPN4 uncharacterized protein LOC1110230607.4e-12572.21Show/hide
Query:  MEAMGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESS
        MEAM TQMR+MEEMYN+M+  AGA SRS +++   ++ EQ   H  PV+EEHL            GDLR+HLN+KR SS R  ++ +  H++SNQQAESS
Subjt:  MEAMGTQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESS

Query:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR
        +NP  P GVITR EF+QL+ K DAQVEALK +CE+KE + +DGDLGE PFTSD+LEA IPPKFK PT+K YDGSKDPKDYVEVFEGLMDFQAA+DAIKCR
Subjt:  HNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCR

Query:  AFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLG
        AFQIALTGSA+LWYRRLPARSISTYSQLR+EF++QF SRHYD+KT THLATIRQKEG+TL+EY+TRFQEEQLKV HCSDDS+MCYFLT L DE  TVKLG
Subjt:  AFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLG

Query:  EEALATFAEVLQKAKKVIDGQELLRTKTGRP
        EEALATFAEVLQ  KK IDGQELLRTKT RP
Subjt:  EEALATFAEVLQKAKKVIDGQELLRTKTGRP

A0A6J1DZJ1 uncharacterized protein LOC1110257385.1e-15872.47Show/hide
Query:  MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMG
        MVQP +STNT DRR L A+D HQREV A VVEGQ H+GL TEP  RSARIT P L PAHP+  KA RGRGG S++   G APAP+ ENFDALQ+EMEAM 
Subjt:  MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMG

Query:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT
        TQM +MEEMYNEM+ A GAGSRSE+R  R E                            RGDLR+HL++KR SSLRKG+SPS SH++SNQQAESS+NP  
Subjt:  TQMRSMEEMYNEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPAT

Query:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA
        P GVITR EFDQL+ K DAQVE LKA+CE K  + +DGDLGE PFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQIA
Subjt:  PAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIA

Query:  LTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALA
        LT SA+LWYRRLPARSISTYSQLR+EF +QFSSRHY++KTATHLATIRQKE ETLREYVT FQEEQLKVAH SDDSA+CYFLTDL DE LTVKLGEEA A
Subjt:  LTGSAQLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALA

Query:  TFAEVLQKAKKVIDGQELLRTKTGR
        TFAEVLQKAKKVIDGQEL RTKTGR
Subjt:  TFAEVLQKAKKVIDGQELLRTKTGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCTTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGGGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGATGACGCGCATTGAGATACTCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCTCGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAAAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAATCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATTGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGATCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCAGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCAATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGATCTAACCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCTGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGTTCAACCAGCAAACTCGACCAATACGGCAGATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCAGAGCAGCAGTGGTAGAGGGGCAAGGTCACGA
CGGCTTAGCAACAGAACCCCTCCGCAGGTCGGCACGAATCACCGCGCCTGTCCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTA
AGAAGGGCGCCCGGGGTCCAGCCCCGGCTCCAACAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGGGCACACAAATGCGGTCCATGGAGGAAATGTAT
AACGAGATGATACTAGCTGCAGGCGCAGGGTCCCGATCTGAAAATCGGATGACGCGCATTGAGATACTCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACA
TCTCGAAGACAACGAGAGCGAGGGACACACTCGCCGGAGGGGAGACCTCCGTGAGCATCTCAACAAAAAGAGAGGCTCATCTCTCCGAAAAGGACAGTCACCATCCCGCT
CACACCGGAGCTCCAATCAGCAGGCTGAATCCTCTCACAACCCAGCAACTCCCGCAGGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAG
GTGGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATTGCCCTTCACCTCGGACGTTTTGGAAGCACCGATCCCTCCGAA
GTTCAAAGCTCCTACCGTGAAGCCTTATGATGGGTCGAAGGATCCCAAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCAGCATCAGACGCAATCAAAT
GCCGCGCCTTTCAGATCGCGCTTACTGGCAGCGCGCAATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAG
TTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAATT
GAAGGTTGCACACTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGATCTAACCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGGCCCTGGCCACCTTCGCCG
AGGTGCTTCAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAAAACCGGCCGACCCTAA
Protein sequenceShow/hide protein sequence
MVQPANSTNTADRRTLAASDAHQREVRAAVVEGQGHDGLATEPLRRSARITAPVLPPAHPRTSKATRGRGGTSKKGARGPAPAPTSENFDALQREMEAMGTQMRSMEEMY
NEMILAAGAGSRSENRMTRIEILEQRGSHLGPVEEEHLEDNESEGHTRRRGDLREHLNKKRGSSLRKGQSPSRSHRSSNQQAESSHNPATPAGVITRAEFDQLRGKLDAQ
VEALKAKCEQKEGSLNDGDLGELPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIALTGSAQLWYRRLPARSISTYSQLRREFLAQ
FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAMCYFLTDLTDEALTVKLGEEALATFAEVLQKAKKVIDGQELLRTKTGRP