; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g04590 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g04590
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110412945
Genome locationchr7:3939164..3940305
RNA-Seq ExpressionMoc07g04590
SyntenyMoc07g04590
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022156835.1 uncharacterized protein LOC111023669 [Momordica charantia]4.5e-9866.9Show/hide
Query:  MKYFPPSKNAKYRSEINNFQQFAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDP
        MKYFPP KNAKYRSEI NFQQ   ESV+ESWE FK+LLQ CPHHGIPRCIQIE YYK L+DATRL                                 DP
Subjt:  MKYFPPSKNAKYRSEINNFQQFAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDP

Query:  RAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWR
        RA+QG+  KGL ESESY  LNS +ENLT LVMRSM QQ++VGA  G ANV+ IQGISCSFCEG+HHYNN P NPESVYYLGN QNN  N YSNTYNPGWR
Subjt:  RAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWR

Query:  NHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQ
        NHPNFSWSG+QGG+NAGTS+APA+Q K SYPP F NQGQ+  +  SEGS ASLE LMK+ M  ND TVQSQA SLRNL++QVGQ
Subjt:  NHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQ

XP_022157438.1 uncharacterized protein LOC111024136 [Momordica charantia]3.9e-11064.86Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQTV +FHGHATEDPHQHLKF MGVCNSFK+EGLS +V+RLKLFP+SLRDEARTWLESLP ESITSWDDLAEKFLMKYFPP+KNAKYR+EINNFQQ
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGES--VSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVN
        F GES   +E++   +R+  S  +H         +++          DP+AVQGKSSK LVESESYTTLNS IENLT LVMRS+ QQS  GA  G  NVN
Subjt:  FAGES--VSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVN

Query:  QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIA
        QIQGISCSF EGDHHYNNCPGNPES                               SG+QGGHNAGTS+APAFQ K                RQSEGS A
Subjt:  QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIA

Query:  SLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
        SLEKLMKQYMANNDATV+ Q + LRNL+LQVGQLATDL S+P+GALPSDT
Subjt:  SLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

XP_022158598.1 uncharacterized protein LOC111025053 [Momordica charantia]6.1e-9555.53Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQM+  V QFHGHATE PHQHLKFFMGV NSFK+EGLS  VLRLKLF YSLR EARTWLESL  E ITSWDDL EKFLMKYF PS              
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDPRAVQGKSSKGLVESESYTTLN
                     KRL Q CP+HGIP  IQIETYYK L++ATRL                                 D RA++GKSSK LVESESYTTLN
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDPRAVQGKSSKGLVESESYTTLN

Query:  SNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSA
        S IE LT L                                                      NNRN  YSNTYNP  RNHPNF WSG+QGGHN G S+A
Subjt:  SNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSA

Query:  PAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
        P FQ KVSYPPGF  QGQMV   QS+GSI SLE +MKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+PVGALPSDT
Subjt:  PAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

XP_022158611.1 uncharacterized protein LOC111025065 [Momordica charantia]9.4e-19799.71Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI
        FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI

Query:  QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL
        QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL
Subjt:  QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL

Query:  EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPV
        EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKP+
Subjt:  EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPV

XP_030510138.1 uncharacterized protein LOC115724905 [Cannabis sativa]5.5e-8849.48Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQTV QF G  TEDPH H+  F+ V +SFK +G+S E LRLKLFP+SLRD AR WL +LP +S+T+W+DLAE FL KYFPP++NAK+RSEI +FQQ
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYT-------TLNSN--------------------IEN
           E+ S++WE FK LL+ CPHHGIP CIQ+ET+Y  LN A+R+     V   S+ G + S+SY         + SN                    ++ 
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYT-------TLNSN--------------------IEN

Query:  LTVLV-----MRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNR-NNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSS
        LT L      M ++++  ++G     A   Q   ISC +C   H + NCP NP SV Y+GN   NR NN YSN+YNP W++HPNFSW    GG  A +S 
Subjt:  LTVLV-----MRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNR-NNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSS

Query:  APAFQHKVSYPPGFVNQGQMVARRQSEGS-IASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
        A   Q K S+PPGF  Q       Q +GS  +SLE LM+ YMA NDA +QSQA SLRNL++Q+GQLA DLK++P G LPSDT
Subjt:  APAFQHKVSYPPGFVNQGQMVARRQSEGS-IASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

TrEMBL top hitse value%identityAlignment
A0A6J1DRG1 uncharacterized protein LOC1110236692.2e-9866.9Show/hide
Query:  MKYFPPSKNAKYRSEINNFQQFAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDP
        MKYFPP KNAKYRSEI NFQQ   ESV+ESWE FK+LLQ CPHHGIPRCIQIE YYK L+DATRL                                 DP
Subjt:  MKYFPPSKNAKYRSEINNFQQFAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDP

Query:  RAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWR
        RA+QG+  KGL ESESY  LNS +ENLT LVMRSM QQ++VGA  G ANV+ IQGISCSFCEG+HHYNN P NPESVYYLGN QNN  N YSNTYNPGWR
Subjt:  RAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWR

Query:  NHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQ
        NHPNFSWSG+QGG+NAGTS+APA+Q K SYPP F NQGQ+  +  SEGS ASLE LMK+ M  ND TVQSQA SLRNL++QVGQ
Subjt:  NHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQ

A0A6J1DTD1 uncharacterized protein LOC1110241361.9e-11064.86Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQTV +FHGHATEDPHQHLKF MGVCNSFK+EGLS +V+RLKLFP+SLRDEARTWLESLP ESITSWDDLAEKFLMKYFPP+KNAKYR+EINNFQQ
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGES--VSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVN
        F GES   +E++   +R+  S  +H         +++          DP+AVQGKSSK LVESESYTTLNS IENLT LVMRS+ QQS  GA  G  NVN
Subjt:  FAGES--VSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVN

Query:  QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIA
        QIQGISCSF EGDHHYNNCPGNPES                               SG+QGGHNAGTS+APAFQ K                RQSEGS A
Subjt:  QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIA

Query:  SLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
        SLEKLMKQYMANNDATV+ Q + LRNL+LQVGQLATDL S+P+GALPSDT
Subjt:  SLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

A0A6J1DWK1 uncharacterized protein LOC1110250532.9e-9555.53Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQM+  V QFHGHATE PHQHLKFFMGV NSFK+EGLS  VLRLKLF YSLR EARTWLESL  E ITSWDDL EKFLMKYF PS              
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDPRAVQGKSSKGLVESESYTTLN
                     KRL Q CP+HGIP  IQIETYYK L++ATRL                                 D RA++GKSSK LVESESYTTLN
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRL--------------------------------PDPRAVQGKSSKGLVESESYTTLN

Query:  SNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSA
        S IE LT L                                                      NNRN  YSNTYNP  RNHPNF WSG+QGGHN G S+A
Subjt:  SNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSA

Query:  PAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
        P FQ KVSYPPGF  QGQMV   QS+GSI SLE +MKQYMANNDATVQSQA SLRNL+LQVGQLA DLKS+PVGALPSDT
Subjt:  PAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

A0A6J1E1F3 uncharacterized protein LOC1110250654.5e-19799.71Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI
        FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQI

Query:  QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL
        QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL
Subjt:  QGISCSFCEGDHHYNNCPGNPESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASL

Query:  EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPV
        EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKP+
Subjt:  EKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPV

A0A6J1G7Q6 uncharacterized protein LOC1114515981.3e-7943.58Show/hide
Query:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ
        MFQMLQT+ QFHG +++DPH HLK F+GV +SF+ +G+  +V+RL  F YSLRD A++WL  L L  I SW+ LAEKFL KYFPP+++A++R+EI  FQ+
Subjt:  MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQ

Query:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATR--------------------------------LPDPRAVQGKSSKGLVESESYTTLN
        F  E++SE+WE FK  L+ CPHHG+P CIQIET+Y  LN AT+                                  D R+  GK ++ ++E ++ +++N
Subjt:  FAGESVSESWECFKRLLQSCPHHGIPRCIQIETYYKDLNDATR--------------------------------LPDPRAVQGKSSKGLVESESYTTLN

Query:  SNIENLT-VLVMRSMMQQSSVGALTGTANVN-QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQ---NNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNA
        + + ++T +L   +  Q S + A   TA V  Q    SC +C   H ++ CP NP S++Y+GN     N + N  SNTYNPGWRNHPNF   G QG +N 
Subjt:  SNIENLT-VLVMRSMMQQSSVGALTGTANVN-QIQGISCSFCEGDHHYNNCPGNPESVYYLGNPQ---NNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNA

Query:  GTSSAPAFQHKVSYPPGFVNQGQMV-----ARRQSEGSIAS-------LEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT
                  K +YPPGF  Q Q+      A  Q EG+  +       LE L+K+YMA NDA +QSQ  SLRNL++QVGQLA +L+++P+G LP+DT
Subjt:  GTSSAPAFQHKVSYPPGFVNQGQMV-----ARRQSEGSIAS-------LEKLMKQYMANNDATVQSQATSLRNLKLQVGQLATDLKSKPVGALPSDT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTCAGATGCTCCAGACAGTGGATCAATTTCATGGACATGCAACGGAGGACCCACATCAGCATCTGAAATTCTTTATGGGAGTTTGCAATTCATTCAAAAACGAAGG
ATTGAGCAATGAAGTGCTGAGGCTTAAGCTATTTCCATATTCACTTAGAGATGAAGCCAGAACATGGTTGGAGTCCCTTCCCTTAGAGTCTATTACAAGTTGGGATGATT
TGGCCGAGAAGTTCTTGATGAAATACTTCCCACCCAGCAAAAATGCTAAGTATCGCAGTGAGATCAACAATTTTCAACAATTTGCTGGGGAATCAGTCAGTGAATCCTGG
GAGTGTTTCAAACGATTATTGCAGAGCTGTCCTCACCATGGGATCCCAAGGTGCATACAGATAGAGACATATTACAAAGATCTGAATGATGCCACACGCCTACCTGATCC
CAGAGCTGTTCAAGGAAAATCAAGTAAGGGGCTAGTTGAGTCAGAATCATACACTACATTGAATTCAAACATTGAGAATCTGACGGTCTTGGTAATGAGAAGTATGATGC
AGCAAAGTTCAGTTGGAGCATTAACTGGTACGGCTAATGTCAACCAAATTCAAGGGATTTCATGTTCTTTTTGCGAGGGAGATCACCACTACAATAACTGCCCTGGAAAT
CCGGAGTCAGTTTATTATTTGGGGAACCCGCAGAATAATAGAAACAATCTGTATTCGAATACGTACAATCCTGGCTGGAGGAATCACCCCAATTTTAGTTGGAGTGGTGA
TCAAGGAGGACACAATGCGGGAACATCTAGTGCTCCAGCATTTCAGCATAAGGTAAGTTATCCTCCTGGTTTTGTGAATCAAGGACAAATGGTAGCACGAAGGCAATCAG
AAGGATCAATTGCATCTTTGGAAAAGCTGATGAAGCAATACATGGCCAATAATGATGCCACTGTGCAAAGCCAAGCTACATCATTAAGGAATTTAAAATTACAAGTAGGC
CAATTAGCTACGGATTTGAAGAGCAAACCGGTTGGAGCATTACCTAGCGATACATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTTCAGATGCTCCAGACAGTGGATCAATTTCATGGACATGCAACGGAGGACCCACATCAGCATCTGAAATTCTTTATGGGAGTTTGCAATTCATTCAAAAACGAAGG
ATTGAGCAATGAAGTGCTGAGGCTTAAGCTATTTCCATATTCACTTAGAGATGAAGCCAGAACATGGTTGGAGTCCCTTCCCTTAGAGTCTATTACAAGTTGGGATGATT
TGGCCGAGAAGTTCTTGATGAAATACTTCCCACCCAGCAAAAATGCTAAGTATCGCAGTGAGATCAACAATTTTCAACAATTTGCTGGGGAATCAGTCAGTGAATCCTGG
GAGTGTTTCAAACGATTATTGCAGAGCTGTCCTCACCATGGGATCCCAAGGTGCATACAGATAGAGACATATTACAAAGATCTGAATGATGCCACACGCCTACCTGATCC
CAGAGCTGTTCAAGGAAAATCAAGTAAGGGGCTAGTTGAGTCAGAATCATACACTACATTGAATTCAAACATTGAGAATCTGACGGTCTTGGTAATGAGAAGTATGATGC
AGCAAAGTTCAGTTGGAGCATTAACTGGTACGGCTAATGTCAACCAAATTCAAGGGATTTCATGTTCTTTTTGCGAGGGAGATCACCACTACAATAACTGCCCTGGAAAT
CCGGAGTCAGTTTATTATTTGGGGAACCCGCAGAATAATAGAAACAATCTGTATTCGAATACGTACAATCCTGGCTGGAGGAATCACCCCAATTTTAGTTGGAGTGGTGA
TCAAGGAGGACACAATGCGGGAACATCTAGTGCTCCAGCATTTCAGCATAAGGTAAGTTATCCTCCTGGTTTTGTGAATCAAGGACAAATGGTAGCACGAAGGCAATCAG
AAGGATCAATTGCATCTTTGGAAAAGCTGATGAAGCAATACATGGCCAATAATGATGCCACTGTGCAAAGCCAAGCTACATCATTAAGGAATTTAAAATTACAAGTAGGC
CAATTAGCTACGGATTTGAAGAGCAAACCGGTTGGAGCATTACCTAGCGATACATAA
Protein sequenceShow/hide protein sequence
MFQMLQTVDQFHGHATEDPHQHLKFFMGVCNSFKNEGLSNEVLRLKLFPYSLRDEARTWLESLPLESITSWDDLAEKFLMKYFPPSKNAKYRSEINNFQQFAGESVSESW
ECFKRLLQSCPHHGIPRCIQIETYYKDLNDATRLPDPRAVQGKSSKGLVESESYTTLNSNIENLTVLVMRSMMQQSSVGALTGTANVNQIQGISCSFCEGDHHYNNCPGN
PESVYYLGNPQNNRNNLYSNTYNPGWRNHPNFSWSGDQGGHNAGTSSAPAFQHKVSYPPGFVNQGQMVARRQSEGSIASLEKLMKQYMANNDATVQSQATSLRNLKLQVG
QLATDLKSKPVGALPSDT