; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g39010 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g39010
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionVat protein
Genome locationchr8:29441740..29443020
RNA-Seq ExpressionMoc08g39010
SyntenyMoc08g39010
Gene Ontology termsNA
InterPro domainsIPR032675 - Leucine-rich repeat domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0051446.1 putative disease resistance protein [Cucumis melo var. makuwa]2.1e-7350.31Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY
        M  S+ E    +DG KLFS+LK L+L GS  Y  +HLP+ IV+I+HN+E FE+R+   +E+FP E+   NVEE++N R++ S L LFELPKL++ W    
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY

Query:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS
        +  SS+ +NL  L + GCGIL M VPSS+SFRNL+ L V KCH++T+LLNPSVARTLVQL+ LVL +CKRM TVI E VEE N+EI+F+RL  + L D+ 
Subjt:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS

Query:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        KLTSFHSGKC IRFP LD++ I  CP+M  FSLGI+ST  LL  ++ ++             +  +E   ++ED   I      +I   I+Q WED+YDT
Subjt:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQSDHSSSCVE
         ++YLF E+N E+NQ D SS   E
Subjt:  GIQYLFTEKNLEENQSDHSSSCVE

XP_008441731.1 PREDICTED: probable disease resistance protein At4g27220 [Cucumis melo]2.1e-7350.31Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY
        M  S+ E    +DG KLFS+LK L+L GS  Y  +HLP+ IV+I+HN+E FE+R+   +E+FP E+   NVEE++N R++ S L LFELPKL++ W    
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY

Query:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS
        +  SS+ +NL  L + GCGIL M VPSS+SFRNL+ L V KCH++T+LLNPSVARTLVQL+ LVL +CKRM TVI E VEE N+EI+F+RL  + L D+ 
Subjt:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS

Query:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        KLTSFHSGKC IRFP LD++ I  CP+M  FSLGI+ST  LL  ++ ++             +  +E   ++ED   I      +I   I+Q WED+YDT
Subjt:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQSDHSSSCVE
         ++YLF E+N E+NQ D SS   E
Subjt:  GIQYLFTEKNLEENQSDHSSSCVE

XP_022149891.1 probable disease resistance protein At1g61300 [Momordica charantia]2.2e-17999.69Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYK
        MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRF+PSDLSLFELPKLKHFWKDDYK
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYK

Query:  STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL
        STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL
Subjt:  STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL

Query:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI
        TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI
Subjt:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI

Query:  QYLFTEKNLEENQSDHSSSCVEE
        QYLFTEKNLEENQSDHSSSCVEE
Subjt:  QYLFTEKNLEENQSDHSSSCVEE

XP_022150758.1 uncharacterized protein LOC111018819 [Momordica charantia]4.7e-10267.08Show/hide
Query:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSS-HLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYR-NIRFQPSDLSLFELPKLKHFWKDDYKS
        SQ  TTQLQDGL+LF KLK+LKL G L YNS+ HLP+E+ R+VHNLE FE RRM +KEIFPNE+L+NVEE + N RF+PS L L+ELPKLKHFWKD++KS
Subjt:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSS-HLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYR-NIRFQPSDLSLFELPKLKHFWKDDYKS

Query:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIA-EVVE-EGNEEIVFSRLKYLFLEDLSK
        +SSL+ L  LIISGCG+LDMLVPSSVSF NL +LEV+KCHRLTHLLNPSVA+TLVQL  L LK+CKRMTTVIA EVVE +GN+EIVF +L  L LEDLSK
Subjt:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIA-EVVE-EGNEEIVFSRLKYLFLEDLSK

Query:  LTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGY-EDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        LTSFHSG C IRFP L  V I +CP+M+ FS GI ST NLLV D++       SRYG   SN  +   ++VVEDINGIIR           Q WED+YD 
Subjt:  LTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGY-EDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQ-SDHSSS
        GIQYLFTEKNLEE+Q SDH SS
Subjt:  GIQYLFTEKNLEENQ-SDHSSS

XP_038890456.1 probable disease resistance protein At4g27220 isoform X1 [Benincasa hispida]1.3e-7050.47Show/hide
Query:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKL-INVEEYRNIRFQPSDLSLFELPKLKHFWKDD-YKS
        S+ E  QL+DGL LF KL++LKL GSL    + LPIEIV+I+HNLE FE+R+ L++E+F +E+L  ++E+++N +   S LSL+ELPKL+H   +D  KS
Subjt:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKL-INVEEYRNIRFQPSDLSLFELPKLKHFWKDD-YKS

Query:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVE-EGNEEIVFSRLKYLFLEDLSKL
        +S L+NL  L + GCGIL+M++PSS+ F NL++L V+ CH+LT+LLNPS+ R LV L  L ++ CKRMTTVIA  +E E N+EI+F+RL  L L+D SKL
Subjt:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVE-EGNEEIVFSRLKYLFLEDLSKL

Query:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSK--VVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        TSFHSGKC IRFP L  + +  CP+M  FSLGI+ST  LL   + +       R  Y+   Y  +DS+  +VEDIN  IR           Q WEDNY T
Subjt:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSK--VVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQSDHS
         +QYLF E+NLEE+ +  S
Subjt:  GIQYLFTEKNLEENQSDHS

TrEMBL top hitse value%identityAlignment
A0A1S3B439 probable disease resistance protein At4g272201.0e-7350.31Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY
        M  S+ E    +DG KLFS+LK L+L GS  Y  +HLP+ IV+I+HN+E FE+R+   +E+FP E+   NVEE++N R++ S L LFELPKL++ W    
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY

Query:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS
        +  SS+ +NL  L + GCGIL M VPSS+SFRNL+ L V KCH++T+LLNPSVARTLVQL+ LVL +CKRM TVI E VEE N+EI+F+RL  + L D+ 
Subjt:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS

Query:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        KLTSFHSGKC IRFP LD++ I  CP+M  FSLGI+ST  LL  ++ ++             +  +E   ++ED   I      +I   I+Q WED+YDT
Subjt:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQSDHSSSCVE
         ++YLF E+N E+NQ D SS   E
Subjt:  GIQYLFTEKNLEENQSDHSSSCVE

A0A5A7UB75 Putative disease resistance protein1.0e-7350.31Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY
        M  S+ E    +DG KLFS+LK L+L GS  Y  +HLP+ IV+I+HN+E FE+R+   +E+FP E+   NVEE++N R++ S L LFELPKL++ W    
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDY

Query:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS
        +  SS+ +NL  L + GCGIL M VPSS+SFRNL+ L V KCH++T+LLNPSVARTLVQL+ LVL +CKRM TVI E VEE N+EI+F+RL  + L D+ 
Subjt:  KSTSSL-KNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLS

Query:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        KLTSFHSGKC IRFP LD++ I  CP+M  FSLGI+ST  LL  ++ ++             +  +E   ++ED   I      +I   I+Q WED+YDT
Subjt:  KLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQSDHSSSCVE
         ++YLF E+N E+NQ D SS   E
Subjt:  GIQYLFTEKNLEENQSDHSSSCVE

A0A5D3DG21 Putative disease resistance protein5.3e-6750.34Show/hide
Query:  NSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYKSTSSL-KNLASLIISGCGILDMLVPSSVSFR
        + +HLP+ IV+I+HN+E FE+R+   +E+FP E+   NVEE++N R++ S L LFELPKL++ W    +  SS+ +NL  L + GCGIL M VPSS+SFR
Subjt:  NSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEK-LINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYKSTSSL-KNLASLIISGCGILDMLVPSSVSFR

Query:  NLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFS
        NL+ L V KCH++T+LLNPSVARTLVQL+ LVL +CKRM TVI E VEE N+EI+F+RL  + L D+ KLTSFHSGKC IRFP LD++ I  CP+M  FS
Subjt:  NLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKLTSFHSGKCIIRFPYLDDVTIWTCPKMEVFS

Query:  LGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGIQYLFTEKNLEENQSDHSSSCVE
        LGI+ST  LL  ++ ++             +  +E   ++ED   I      +I   I+Q WED+YDT ++YLF E+N E+NQ D SS   E
Subjt:  LGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGIQYLFTEKNLEENQSDHSSSCVE

A0A6J1D9T1 probable disease resistance protein At1g613001.8e-17999.69Show/hide
Query:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYK
        MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRF+PSDLSLFELPKLKHFWKDDYK
Subjt:  MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYK

Query:  STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL
        STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL
Subjt:  STSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKL

Query:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI
        TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI
Subjt:  TSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGI

Query:  QYLFTEKNLEENQSDHSSSCVEE
        QYLFTEKNLEENQSDHSSSCVEE
Subjt:  QYLFTEKNLEENQSDHSSSCVEE

A0A6J1DAA6 uncharacterized protein LOC1110188192.3e-10267.08Show/hide
Query:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSS-HLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYR-NIRFQPSDLSLFELPKLKHFWKDDYKS
        SQ  TTQLQDGL+LF KLK+LKL G L YNS+ HLP+E+ R+VHNLE FE RRM +KEIFPNE+L+NVEE + N RF+PS L L+ELPKLKHFWKD++KS
Subjt:  SQVETTQLQDGLKLFSKLKSLKLSGSLIYNSS-HLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYR-NIRFQPSDLSLFELPKLKHFWKDDYKS

Query:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIA-EVVE-EGNEEIVFSRLKYLFLEDLSK
        +SSL+ L  LIISGCG+LDMLVPSSVSF NL +LEV+KCHRLTHLLNPSVA+TLVQL  L LK+CKRMTTVIA EVVE +GN+EIVF +L  L LEDLSK
Subjt:  TSSLKNLASLIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIA-EVVE-EGNEEIVFSRLKYLFLEDLSK

Query:  LTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGY-EDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT
        LTSFHSG C IRFP L  V I +CP+M+ FS GI ST NLLV D++       SRYG   SN  +   ++VVEDINGIIR           Q WED+YD 
Subjt:  LTSFHSGKCIIRFPYLDDVTIWTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGY-EDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDT

Query:  GIQYLFTEKNLEENQ-SDHSSS
        GIQYLFTEKNLEE+Q SDH SS
Subjt:  GIQYLFTEKNLEENQ-SDHSSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGGATTCCCAGGTTGAGACTACACAATTGCAAGATGGTCTGAAGTTGTTTTCCAAGCTTAAAAGTCTGAAACTATCTGGTTCTCTCATCTACAACTCATCGCATTT
GCCAATAGAAATTGTACGAATAGTACACAATCTTGAACGGTTTGAATTAAGAAGGATGCTTGTTAAAGAAATATTCCCAAATGAGAAATTGATCAATGTGGAAGAATATC
GTAATATAAGATTTCAGCCTTCGGATTTGAGTCTATTCGAATTGCCCAAGCTTAAGCATTTCTGGAAGGATGACTACAAAAGCACCTCATCACTTAAAAATTTGGCTTCT
CTAATCATATCAGGATGTGGCATATTGGATATGTTAGTGCCATCATCAGTGTCTTTTAGAAACTTGTCCAAGCTTGAGGTGGATAAATGTCATAGACTGACTCATTTGCT
AAATCCTTCGGTGGCCAGAACATTGGTGCAGCTTAAACGGTTGGTCTTAAAAGATTGCAAAAGGATGACCACTGTAATTGCAGAAGTTGTTGAAGAAGGAAATGAAGAAA
TTGTATTTAGCAGATTAAAGTATTTATTCCTCGAGGATTTGTCCAAATTAACAAGCTTTCATTCGGGAAAATGCATCATAAGATTTCCATACTTGGATGATGTAACTATT
TGGACTTGTCCTAAAATGGAAGTTTTCTCTCTTGGAATCATAAGCACGACTAACTTACTAGTTAGAGATCTTAGAATACATCATGGAATTAAAGGTTCACGATATGGGTA
TGAAGATTCAAATTATGGCTATGAAGATTCAAAAGTTGTTGAAGATATCAATGGTATTATCCGACAAGCTTGGGAGGACATTTATGGCATTATCCAACAAGCTTGGGAGG
ACAATTATGACACTGGGATTCAATATTTGTTTACAGAAAAGAATTTGGAGGAGAACCAATCTGATCATTCATCTTCATGTGTTGAGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTTGGATTCCCAGGTTGAGACTACACAATTGCAAGATGGTCTGAAGTTGTTTTCCAAGCTTAAAAGTCTGAAACTATCTGGTTCTCTCATCTACAACTCATCGCATTT
GCCAATAGAAATTGTACGAATAGTACACAATCTTGAACGGTTTGAATTAAGAAGGATGCTTGTTAAAGAAATATTCCCAAATGAGAAATTGATCAATGTGGAAGAATATC
GTAATATAAGATTTCAGCCTTCGGATTTGAGTCTATTCGAATTGCCCAAGCTTAAGCATTTCTGGAAGGATGACTACAAAAGCACCTCATCACTTAAAAATTTGGCTTCT
CTAATCATATCAGGATGTGGCATATTGGATATGTTAGTGCCATCATCAGTGTCTTTTAGAAACTTGTCCAAGCTTGAGGTGGATAAATGTCATAGACTGACTCATTTGCT
AAATCCTTCGGTGGCCAGAACATTGGTGCAGCTTAAACGGTTGGTCTTAAAAGATTGCAAAAGGATGACCACTGTAATTGCAGAAGTTGTTGAAGAAGGAAATGAAGAAA
TTGTATTTAGCAGATTAAAGTATTTATTCCTCGAGGATTTGTCCAAATTAACAAGCTTTCATTCGGGAAAATGCATCATAAGATTTCCATACTTGGATGATGTAACTATT
TGGACTTGTCCTAAAATGGAAGTTTTCTCTCTTGGAATCATAAGCACGACTAACTTACTAGTTAGAGATCTTAGAATACATCATGGAATTAAAGGTTCACGATATGGGTA
TGAAGATTCAAATTATGGCTATGAAGATTCAAAAGTTGTTGAAGATATCAATGGTATTATCCGACAAGCTTGGGAGGACATTTATGGCATTATCCAACAAGCTTGGGAGG
ACAATTATGACACTGGGATTCAATATTTGTTTACAGAAAAGAATTTGGAGGAGAACCAATCTGATCATTCATCTTCATGTGTTGAGGAATAA
Protein sequenceShow/hide protein sequence
MLDSQVETTQLQDGLKLFSKLKSLKLSGSLIYNSSHLPIEIVRIVHNLERFELRRMLVKEIFPNEKLINVEEYRNIRFQPSDLSLFELPKLKHFWKDDYKSTSSLKNLAS
LIISGCGILDMLVPSSVSFRNLSKLEVDKCHRLTHLLNPSVARTLVQLKRLVLKDCKRMTTVIAEVVEEGNEEIVFSRLKYLFLEDLSKLTSFHSGKCIIRFPYLDDVTI
WTCPKMEVFSLGIISTTNLLVRDLRIHHGIKGSRYGYEDSNYGYEDSKVVEDINGIIRQAWEDIYGIIQQAWEDNYDTGIQYLFTEKNLEENQSDHSSSCVEE