; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00770 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00770
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:488228..489574
RNA-Seq ExpressionMoc07g00770
SyntenyMoc07g00770
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]4.2e-14393.95Show/hide
Query:  QVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        + ESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+I LTGSARLWYRRLPA SISTYSQLRREFL+ FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP
        TVKLGEE+PATFAEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRP
Subjt:  TVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]4.6e-14277.12Show/hide
Query:  AGSRSENRVTHVDLREQRGSHLGPAEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLD
        A SRSENRV   D+ EQRG HLGP ++ HPE  E E +T QRGDLREHLNRKR SSL+KGQSPS SHR+SNQQ ESS+NP TP GVITR EFDQL+ K D
Subjt:  AGSRSENRVTHVDLREQRGSHLGPAEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLD

Query:  AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSIS
        AQVEALKAKCE+KE   +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQI LTGSARLWYRRLPARSIS
Subjt:  AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSIS

Query:  TYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQEL
        TYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQE+QLKVAHCSD SAMCYFLT LADE LTVKL EE+PATF EVLQKAKK+IDGQEL
Subjt:  TYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQEL

Query:  LRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRP
        LRTK  RPE++I +G++ KD  K D K++DKG  S S R  YRR++N   RSRP
Subjt:  LRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRP

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]1.0e-14992.38Show/hide
Query:  KRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSL+KGQSPSRSHRSSNQQ ESSHNPATPAGVITR EFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQI LT SARLWYRRLPARSISTYSQLRREFL+QFSSRHYDK+TATHLATIRQKEGETLREYVTRFQE+QLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEE+PATFAEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS

Query:  RP
        RP
Subjt:  RP

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]1.5e-13793.68Show/hide
Query:  GVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELT
        G+ITR EFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQI LT
Subjt:  GVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELT

Query:  GSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATF
        GSARLWYRRLP RSISTYSQLRREFL+QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEALTVKLGEE+PATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATF

Query:  AEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP
        AEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRP
Subjt:  AEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP

XP_022159327.1 uncharacterized protein LOC111025738 [Momordica charantia]3.4e-13766.75Show/hide
Query:  EVGAAVVEGQGHDGLATEPLRRSARISAPVLPPAHPRTSKATRGRGGTSKKGARGPALAPTSAGSRSENRVTHVDLREQRGSHLGPAEEEHPE-------
        EVGA VVEGQ H+GL TEP  RSARI+ P L PAHP+  KA RGRGG S++   G A AP    SR        ++   R   L   EE + E       
Subjt:  EVGAAVVEGQGHDGLATEPLRRSARISAPVLPPAHPRTSKATRGRGGTSKKGARGPALAPTSAGSRSENRVTHVDLREQRGSHLGPAEEEHPE-------

Query:  --DNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTS
           +E      +RGDLR+HL+RKR SSL+KG+SPS SH++SNQQ ESS+NP  P GVITR EFDQL+ K DAQVE LKA+CE K    +DGDLGESPFTS
Subjt:  --DNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTS

Query:  DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATI
        D+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQI LT SARLWYRRLPARSISTYSQLR+EF SQFSSRHY++KTATHLATI
Subjt:  DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATI

Query:  RQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSK
        RQKE ETLREYVT FQE+QLKVAH SDDSA+CYFLT L DE LTVKLGEE+PATFAEVLQKAKKVIDGQEL RTK GR E++I + K  ++  KA+ KSK
Subjt:  RQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSK

Query:  DKGSFSSGRAEYRRAENGPTRSRP
        DK       AEYRR+++GP+RSRP
Subjt:  DKGSFSSGRAEYRRAENGPTRSRP

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088132.0e-14393.95Show/hide
Query:  QVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD
        + ESS NPATPAGVITR EFDQLRG+LDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFE LMDFQAASD
Subjt:  QVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASD

Query:  AIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEAL
        AIKCRAF+I LTGSARLWYRRLPA SISTYSQLRREFL+ FSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEAL
Subjt:  AIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEAL

Query:  TVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP
        TVKLGEE+PATFAEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRP
Subjt:  TVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP

A0A6J1DDS5 uncharacterized protein LOC1110198425.0e-15092.38Show/hide
Query:  KRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS
        +RGSSL+KGQSPSRSHRSSNQQ ESSHNPATPAGVITR EFDQLRGKLDAQVEALKAKCEQKEG LNDGDLGESPFTSDVLEAPIP KFKAPTVKPYDGS
Subjt:  KRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGS

Query:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV
        +DPKDYVEVFEGLMDFQAASD IKCRAFQI LT SARLWYRRLPARSISTYSQLRREFL+QFSSRHYDK+TATHLATIRQKEGETLREYVTRFQE+QLKV
Subjt:  KDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKV

Query:  AHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSAMCYFLTGLADEA TVKLGEE+PATFAEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS

Query:  RP
        RP
Subjt:  RP

A0A6J1DDW5 uncharacterized protein LOC1110196342.2e-14277.12Show/hide
Query:  AGSRSENRVTHVDLREQRGSHLGPAEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLD
        A SRSENRV   D+ EQRG HLGP ++ HPE  E E +T QRGDLREHLNRKR SSL+KGQSPS SHR+SNQQ ESS+NP TP GVITR EFDQL+ K D
Subjt:  AGSRSENRVTHVDLREQRGSHLGPAEEEHPEDNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLD

Query:  AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSIS
        AQVEALKAKCE+KE   +DGDLGESPFTSD+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLMDFQAA+DAIKCR FQI LTGSARLWYRRLPARSIS
Subjt:  AQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSIS

Query:  TYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQEL
        TYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQE+QLKVAHCSD SAMCYFLT LADE LTVKL EE+PATF EVLQKAKK+IDGQEL
Subjt:  TYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQEL

Query:  LRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRP
        LRTK  RPE++I +G++ KD  K D K++DKG  S S R  YRR++N   RSRP
Subjt:  LRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRP

A0A6J1DS95 uncharacterized protein LOC1110234217.5e-13893.68Show/hide
Query:  GVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELT
        G+ITR EFDQLRG+LDAQVEALKAKCEQK+  LNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDG+KDPKDYVEVFEGLMDFQAASDAIKCRAFQI LT
Subjt:  GVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELT

Query:  GSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATF
        GSARLWYRRLP RSISTYSQLRREFL+QFSSRHYDKKTATHLATIRQKEGETLREYVTRFQE+QLKVAHCSDDSAMCYFLTGLADEALTVKLGEE+PATF
Subjt:  GSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATF

Query:  AEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP
        AEVLQKAKKVIDGQELLRTK GRPER+IGRG+SGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRP
Subjt:  AEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP

A0A6J1DZJ1 uncharacterized protein LOC1110257381.7e-13766.75Show/hide
Query:  EVGAAVVEGQGHDGLATEPLRRSARISAPVLPPAHPRTSKATRGRGGTSKKGARGPALAPTSAGSRSENRVTHVDLREQRGSHLGPAEEEHPE-------
        EVGA VVEGQ H+GL TEP  RSARI+ P L PAHP+  KA RGRGG S++   G A AP    SR        ++   R   L   EE + E       
Subjt:  EVGAAVVEGQGHDGLATEPLRRSARISAPVLPPAHPRTSKATRGRGGTSKKGARGPALAPTSAGSRSENRVTHVDLREQRGSHLGPAEEEHPE-------

Query:  --DNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTS
           +E      +RGDLR+HL+RKR SSL+KG+SPS SH++SNQQ ESS+NP  P GVITR EFDQL+ K DAQVE LKA+CE K    +DGDLGESPFTS
Subjt:  --DNESEGHTRQRGDLREHLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTS

Query:  DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATI
        D+LEA IP KFK PT+KPYDGSKDPKDYVEVFEGLM FQAA+DAIK RAFQI LT SARLWYRRLPARSISTYSQLR+EF SQFSSRHY++KTATHLATI
Subjt:  DVLEAPIPPKFKAPTVKPYDGSKDPKDYVEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATI

Query:  RQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSK
        RQKE ETLREYVT FQE+QLKVAH SDDSA+CYFLT L DE LTVKLGEE+PATFAEVLQKAKKVIDGQEL RTK GR E++I + K  ++  KA+ KSK
Subjt:  RQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGLADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSK

Query:  DKGSFSSGRAEYRRAENGPTRSRP
        DK       AEYRR+++GP+RSRP
Subjt:  DKGSFSSGRAEYRRAENGPTRSRP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGAATCTCTGCGCCTGTCTTACCACCTGCGCACCC
AAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCTGGCTCCAACAAGCGCAGGGTCCCGATCTGAAAATCGAGTGACGC
ACGTTGACTTACGCGAGCAAAGGGGTTCGCACCTCGGCCCAGCCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAG
CATCTCAATAGAAAGAGAGGCTCATCTCTCCAAAAAGGACAGTCACCATCTCGCTCACACCGAAGCTCCAACCAGCAGGTTGAATCCTCTCACAACCCAGCAACTCCCGC
AGGGGTGATTACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCG
ACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCAAAGTTTAAAGCTCCTACCGTGAAGCCTTATGATGGGTCAAAGGACCCCAAGGATTAT
GTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGAGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACT
GCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCTCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGAC
AGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAACAATTGAAGGTCGCACATTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTA
GCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGTCCCCTGCCACCTTCGCCGAGGTGCTACAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAA
AATCGGCCGACCAGAACGAAGAATTGGCCGGGGAAAAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGT
ATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCAACGGAACCCCTCCGCAGGTCGGCACGAATCTCTGCGCCTGTCTTACCACCTGCGCACCC
AAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCCCGGGGTCCAGCCCTGGCTCCAACAAGCGCAGGGTCCCGATCTGAAAATCGAGTGACGC
ACGTTGACTTACGCGAGCAAAGGGGTTCGCACCTCGGCCCAGCCGAGGAGGAACATCCCGAAGACAACGAGAGCGAGGGACACACTCGCCAGAGGGGAGACCTCCGTGAG
CATCTCAATAGAAAGAGAGGCTCATCTCTCCAAAAAGGACAGTCACCATCTCGCTCACACCGAAGCTCCAACCAGCAGGTTGAATCCTCTCACAACCCAGCAACTCCCGC
AGGGGTGATTACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGATGCTCAGGTTGAGGCCTTAAAGGCCAAATGTGAGCAGAAAGAAGGTCCACTGAACGATGGCG
ACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTGGAAGCACCAATCCCTCCAAAGTTTAAAGCTCCTACCGTGAAGCCTTATGATGGGTCAAAGGACCCCAAGGATTAT
GTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCGAGCTTACTGGCAGCGCGCGATTGTGGTATCGGAGACT
GCCAGCCAGGTCGATCTCGACCTACTCTCAGTTGAGAAGGGAGTTCCTCTCCCAGTTCTCTTCTCGGCATTATGACAAAAAGACAGCGACCCATCTCGCCACCATCAGAC
AGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGAAACAATTGAAGGTCGCACATTGCTCCGATGACTCGGCCATGTGCTATTTTCTCACCGGTCTA
GCCGACGAAGCCCTCACGGTGAAACTTGGAGAGGAGTCCCCTGCCACCTTCGCCGAGGTGCTACAGAAGGCGAAGAAAGTCATCGATGGACAGGAGCTCCTCCGAACCAA
AATCGGCCGACCAGAACGAAGAATTGGCCGGGGAAAAAGTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCCTTTTCCAGCGGCCGAGCTGAGT
ATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCTTAG
Protein sequenceShow/hide protein sequence
MEVGAAVVEGQGHDGLATEPLRRSARISAPVLPPAHPRTSKATRGRGGTSKKGARGPALAPTSAGSRSENRVTHVDLREQRGSHLGPAEEEHPEDNESEGHTRQRGDLRE
HLNRKRGSSLQKGQSPSRSHRSSNQQVESSHNPATPAGVITRAEFDQLRGKLDAQVEALKAKCEQKEGPLNDGDLGESPFTSDVLEAPIPPKFKAPTVKPYDGSKDPKDY
VEVFEGLMDFQAASDAIKCRAFQIELTGSARLWYRRLPARSISTYSQLRREFLSQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEKQLKVAHCSDDSAMCYFLTGL
ADEALTVKLGEESPATFAEVLQKAKKVIDGQELLRTKIGRPERRIGRGKSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP