; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g01720 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g01720
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr7:1362521..1364574
RNA-Seq ExpressionMoc07g01720
SyntenyMoc07g01720
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022137317.1 uncharacterized protein LOC111008813 [Momordica charantia]6.8e-14086.93Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        VITR EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDGSKDP+DYVEVFE LMDFQAASDAIKCRAF+IA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTP TIPISEILTNIEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPK
        LRGAP+
Subjt:  LRGAPK

XP_022141796.1 uncharacterized protein LOC111012081 [Momordica charantia]1.6e-13686.09Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        +ITR EFDQLRG+LDAQ EALKAKCEQKEG LNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDGSKDP+DYVEVFEGLMDFQA SDAIKCRAFQIA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKI RGRSGKDIEK DPKSKDKGSFSSGR EYRRAENGPTRSRPYE FTP TIPI EILT IEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LR
        LR
Subjt:  LR

XP_022151719.1 uncharacterized protein LOC111019634 [Momordica charantia]9.9e-13165.27Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSQSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTHQRGDLR----------------------------------
        MRTQM +ME+MY+EM+ AAGA S+SENR+ R D+ EQRG HLGPV++ HPE  E E +THQRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMILAAGAGSQSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTHQRGDLR----------------------------------

Query:  -----VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQ
             VITR EFDQL+ K DAQVEALKAKCE+KE S +DGDLGESPFTSD+ EA IP KFK PT+KPYDGSKDP+DYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT-------
        IA TGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SA+CYFLT LADE LT       
Subjt:  IAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT-------

Query:  ------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKL
                          ELLRTKT RPE+KI +GR+ KD  K D K++DKG  S S R  YRR++N   RSRPYE +TP TIPISEILTNIE++GMEKL
Subjt:  ------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKL

Query:  LKRPEK
        LKRPEK
Subjt:  LKRPEK

XP_022152033.1 uncharacterized protein LOC111019842 [Momordica charantia]5.4e-13779.12Show/hide
Query:  QRGSHL----GPVEEEHPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGS
        QRGS L     P       + ++E   +      VITR EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV EAPIP KFKAPTVKPYDGS
Subjt:  QRGSHL----GPVEEEHPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGS

Query:  KDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DP+DYVEVFEGLMDFQAASD IKCRAFQIA T SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAVCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSA+CYFLTGLADEA T                         ELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAVCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS

Query:  RPYECFTPITIPISEILTNIEESGMEKLLKRPEKLRGAPK
        RPYE FTP TIPISEILTNIEESGMEKLLKRPEKLRGAP+
Subjt:  RPYECFTPITIPISEILTNIEESGMEKLLKRPEKLRGAPK

XP_022156542.1 uncharacterized protein LOC111023421 [Momordica charantia]3.1e-14086.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        +ITR EFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDG+KDP+DYVEVFEGLMDFQAASDAIKCRAFQIA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTP TIPI EILTNIEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPK
        LRGAP+
Subjt:  LRGAPK

TrEMBL top hitse value%identityAlignment
A0A6J1C7X5 uncharacterized protein LOC1110088133.3e-14086.93Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        VITR EFDQLRG+LDAQVEALKAKCEQKEG LNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDGSKDP+DYVEVFE LMDFQAASDAIKCRAF+IA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLPA SISTYSQLRREFLA FSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKIGRGRSGKDIE ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTP TIPISEILTNIEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPK
        LRGAP+
Subjt:  LRGAPK

A0A6J1CKB3 uncharacterized protein LOC1110120817.6e-13786.09Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        +ITR EFDQLRG+LDAQ EALKAKCEQKEG LNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDGSKDP+DYVEVFEGLMDFQA SDAIKCRAFQIA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKI RGRSGKDIEK DPKSKDKGSFSSGR EYRRAENGPTRSRPYE FTP TIPI EILT IEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LR
        LR
Subjt:  LR

A0A6J1DDS5 uncharacterized protein LOC1110198422.6e-13779.12Show/hide
Query:  QRGSHL----GPVEEEHPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGS
        QRGS L     P       + ++E   +      VITR EFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDV EAPIP KFKAPTVKPYDGS
Subjt:  QRGSHL----GPVEEEHPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGS

Query:  KDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV
        +DP+DYVEVFEGLMDFQAASD IKCRAFQIA T SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDK+TATHLATIRQKEGETLREYVTRFQEEQLKV
Subjt:  KDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKV

Query:  AHCSDDSAVCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS
         HCSDDSA+CYFLTGLADEA T                         ELLRTKTGRPERKIGRGRSGKDIE+AD KSKDKGSFSS RA YRRAENGPTRS
Subjt:  AHCSDDSAVCYFLTGLADEALT-------------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRS

Query:  RPYECFTPITIPISEILTNIEESGMEKLLKRPEKLRGAPK
        RPYE FTP TIPISEILTNIEESGMEKLLKRPEKLRGAP+
Subjt:  RPYECFTPITIPISEILTNIEESGMEKLLKRPEKLRGAPK

A0A6J1DDW5 uncharacterized protein LOC1110196344.8e-13165.27Show/hide
Query:  MRTQMRSMEEMYNEMILAAGAGSQSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTHQRGDLR----------------------------------
        MRTQM +ME+MY+EM+ AAGA S+SENR+ R D+ EQRG HLGPV++ HPE  E E +THQRGDLR                                  
Subjt:  MRTQMRSMEEMYNEMILAAGAGSQSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTHQRGDLR----------------------------------

Query:  -----VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQ
             VITR EFDQL+ K DAQVEALKAKCE+KE S +DGDLGESPFTSD+ EA IP KFK PT+KPYDGSKDP+DYVEVFEGLMDFQAA+DAIKCR FQ
Subjt:  -----VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQ

Query:  IAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT-------
        IA TGSARLWYRRLPARSISTYSQLR+EF+ QFSSRHYD+KTATHL TIRQKEGETLREYVTRFQEEQLKVAHCSD SA+CYFLT LADE LT       
Subjt:  IAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT-------

Query:  ------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKL
                          ELLRTKT RPE+KI +GR+ KD  K D K++DKG  S S R  YRR++N   RSRPYE +TP TIPISEILTNIE++GMEKL
Subjt:  ------------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFS-SGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKL

Query:  LKRPEK
        LKRPEK
Subjt:  LKRPEK

A0A6J1DS95 uncharacterized protein LOC1110234211.5e-14086.27Show/hide
Query:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG
        +ITR EFDQLRG+LDAQVEALKAKCEQK+ SLNDGDLGESPFTSDV EAPIPPKFKAPTVKPYDG+KDP+DYVEVFEGLMDFQAASDAIKCRAFQIA TG
Subjt:  VITRAEFDQLRGKLDAQVEALKAKCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTG

Query:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------
        SARLWYRRLP RSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSA+CYFLTGLADEALT            
Subjt:  SARLWYRRLPARSISTYSQLRREFLAQFSSRHYDKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALT------------

Query:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK
                     ELLRTKTGRPERKIGRGRSGKD+E+ADPKSKDKGSFSSGRAEYRRAENGPTRSRPYE FTP TIPI EILTNIEESGMEKLLKRPEK
Subjt:  -------------ELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRPYECFTPITIPISEILTNIEESGMEKLLKRPEK

Query:  LRGAPK
        LRGAP+
Subjt:  LRGAPK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGCCCAAGTATTCAGATCGGTCCGGAGGCCGAGTTCGAGCTGCAATCTGATATACACTGTTGTGCATATCC
TTGCATAAACATTTGGCGCCGTCTATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCA
ACAGAACCCCTCCGCAGGTCGGCCGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCC
CGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAGATGAT
ACTAGCTGCAGGCGCAGGGTCCCAATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACA
ACGAGAGCGAGGGACACACTCACCAGAGAGGAGACCTTCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCTTTAAAGGCA
AAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTTGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGT
GAAACCTTATGATGGGTCGAAGGATCCCGAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCG
CGCATACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTAT
GACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTC
CGATGACTCGGCCGTGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAA
GTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCT
TACGAATGCTTCACCCCGATCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCC
GAAAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCAAGTATGAGGGCCGAGGTGAACCTGGCCGAGGTCCGCCCAAGTATTCAGATCGGTCCGGAGGCCGAGTTCGAGCTGCAATCTGATATACACTGTTGTGCATATCC
TTGCATAAACATTTGGCGCCGTCTATCGAAGGACTCTAGCTGCCAGCGATGCCCACCAGAGGGAGGTCGGAGCAGCAGTGGTAGAGGGGCAAGGTCACGACGGCCTAGCA
ACAGAACCCCTCCGCAGGTCGGCCGAATCACCGCGCCTGTTCTACCACCTGCGCACCCAAGGACATCCAAGGCCACCCGTGGCCGAGGTGGGACCTCTAAGAAGGGCGCC
CGGGGTCCAGCCCCGGCTCCACCAAGTGAGAACTTTGACGCACTCCAGAGAGAAATGGAGGCAATGCGCACACAAATGCGGTCCATGGAGGAAATGTATAACGAGATGAT
ACTAGCTGCAGGCGCAGGGTCCCAATCTGAAAATCGAATGACGCGCATTGACATACGCGAGCAAAGGGGTTCCCACCTCGGCCCAGTCGAGGAGGAACATCCCGAAGACA
ACGAGAGCGAGGGACACACTCACCAGAGAGGAGACCTTCGGGTGATCACAAGGGCGGAGTTCGACCAGCTGAGGGGCAAGCTCGACGCTCAGGTGGAGGCTTTAAAGGCA
AAATGTGAGCAGAAAGAAGGTTCACTGAACGATGGCGACTTGGGAGAATCGCCCTTCACCTCGGACGTTTTTGAAGCACCGATCCCTCCGAAGTTCAAAGCTCCTACCGT
GAAACCTTATGATGGGTCGAAGGATCCCGAGGATTATGTTGAGGTCTTTGAAGGCCTCATGGATTTCCAAGCGGCATCAGACGCAATCAAATGCCGCGCCTTTCAGATCG
CGCATACTGGCAGCGCGCGATTGTGGTATCGGAGACTGCCAGCCAGGTCGATCTCGACCTACTCTCAGCTGAGAAGGGAGTTCCTCGCCCAGTTCTCTTCTCGGCATTAT
GACAAAAAGACAGCGACCCATCTCGCCACCATCAGACAGAAGGAGGGTGAGACGCTGCGAGAATATGTCACCAGATTCCAGGAGGAGCAGTTGAAGGTCGCACACTGCTC
CGATGACTCGGCCGTGTGCTATTTTCTCACCGGTCTAGCCGACGAGGCCCTCACGGAGCTCCTCCGAACCAAAACCGGCCGACCAGAACGAAAGATCGGCCGGGGCAGAA
GTGGAAAAGATATAGAAAAGGCAGATCCCAAGTCCAAGGACAAGGGATCTTTTTCCAGCGGCCGAGCTGAGTATCGAAGGGCGGAGAACGGACCTACCAGGAGCCGACCT
TACGAATGCTTCACCCCGATCACGATTCCAATTTCCGAGATCCTAACGAACATCGAGGAGTCTGGAATGGAAAAACTACTCAAGCGTCCTGAGAAGCTTCGGGGAGCCCC
GAAAGGCGCAGCAAGGACAAGTATTGCCGCTTCCATCGGGAGCACGGCCATAACACGTCGGACTGCTGGGAGTTGA
Protein sequenceShow/hide protein sequence
MPSMRAEVNLAEVRPSIQIGPEAEFELQSDIHCCAYPCINIWRRLSKDSSCQRCPPEGGRSSSGRGARSRRPSNRTPPQVGRITAPVLPPAHPRTSKATRGRGGTSKKGA
RGPAPAPPSENFDALQREMEAMRTQMRSMEEMYNEMILAAGAGSQSENRMTRIDIREQRGSHLGPVEEEHPEDNESEGHTHQRGDLRVITRAEFDQLRGKLDAQVEALKA
KCEQKEGSLNDGDLGESPFTSDVFEAPIPPKFKAPTVKPYDGSKDPEDYVEVFEGLMDFQAASDAIKCRAFQIAHTGSARLWYRRLPARSISTYSQLRREFLAQFSSRHY
DKKTATHLATIRQKEGETLREYVTRFQEEQLKVAHCSDDSAVCYFLTGLADEALTELLRTKTGRPERKIGRGRSGKDIEKADPKSKDKGSFSSGRAEYRRAENGPTRSRP
YECFTPITIPISEILTNIEESGMEKLLKRPEKLRGAPKGAARTSIAASIGSTAITRRTAGS