; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g07280 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g07280
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionUnknown protein
Genome locationchr3:5108604..5111431
RNA-Seq ExpressionMoc03g07280
SyntenyMoc03g07280
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147182.1 uncharacterized protein LOC111016193 [Momordica charantia]1.1e-3537.05Show/hide
Query:  MRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARV
        MRF++E SS+G+K++V++ S+ C DR L++AS+FV  P S +++ ID   +    S H+A+++K++LD R+ +   ERE  S  LE ATTL+ EL +A+ 
Subjt:  MRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARV

Query:  ENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFS
        E ++L+++++AK    + E E  +   ++ + I KGLE EKF+L++  D L +  +   + +  L  E++  K +L++G LLEE+F+ H +FD F  DFS
Subjt:  ENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFS

Query:  DVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
        D  FKFLMKGI                                +SLVD+YVR+L+S    D EEED  +Q+   V TT
Subjt:  DVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

XP_022150343.1 uncharacterized protein LOC111018538 [Momordica charantia]5.8e-3736.94Show/hide
Query:  IYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVA-TTLERELKE
        I  + +IEPSS+G++++V++ S+   DR L++ASKFV  P S +++ IDY  +    S  +A+ +K++LD R+++   E+E FS ALE A +T++ EL +
Subjt:  IYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVA-TTLERELKE

Query:  ARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVN
        A  E E LK+++E++ +  + E + +Q   ++ + I +GLE EKF+L++  D + +  +    E++    E+E  K +LSNGVLLEEAF+ H DFD F  
Subjt:  ARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVN

Query:  DFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEE
        DFSD  FKFLMKGI                                ++LVD+YVRDL+S   D +E++
Subjt:  DFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEE

XP_022150867.1 uncharacterized protein LOC111018913 [Momordica charantia]5.3e-14798.95Show/hide
Query:  MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS
        MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS
Subjt:  MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS

Query:  KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV
        KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV
Subjt:  KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV

Query:  IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIMESLVD
        IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIME  +D
Subjt:  IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIMESLVD

XP_022152119.1 uncharacterized protein LOC111019909 [Momordica charantia]1.5e-4038.81Show/hide
Query:  VGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLE
        +G TFD+  RF++EPSS+G+K++V++ S+ C DR L++ASKFV  P S +++ ID   +    S H+AI++K++LD R+ +   ERE  S ALE ATTL+
Subjt:  VGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLE

Query:  RELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDF
         EL +A+ E  +L+++++AK +  + E E  +   ++ + I KGLE EKF+L++  D L +  +   + +  L  E++  K +L+NG LLEE+F+ HLDF
Subjt:  RELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDF

Query:  DVFVNDFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
        D F  DFSD  FKFLMKGI                                +SLV +YVR+L+S    D EEED  +Q+ + + TT
Subjt:  DVFVNDFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

XP_022159252.1 uncharacterized protein LOC111025665 [Momordica charantia]5.2e-4637.28Show/hide
Query:  EVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRS
        E  DVSPL EV+ +SP  + +  K+ + SS+             L +DP+AR+  T ++ MRF +EPSS+G+K++V++ S+ C DR L++ASKFV  P S
Subjt:  EVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRS

Query:  GIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENE
         +++ ID   +    S H A+++K++LD R+ +   ERE    ALE ATTL+ EL +A+ E ++L+++++AK    + E E  +   ++ + I KGLE E
Subjt:  GIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENE

Query:  KFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIM----------------------------
        KF+L++  D L +  +   + +  L  E++  K +L+NG LLEE+F+ H DFD F  DFSD  FKFLMKGI                             
Subjt:  KFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIM----------------------------

Query:  ---ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
           +SLVD+YVR+L+S    D EEED  +Q+   V TT
Subjt:  ---ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

TrEMBL top hitse value%identityAlignment
A0A6J1D1N9 uncharacterized protein LOC1110161935.3e-3637.05Show/hide
Query:  MRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARV
        MRF++E SS+G+K++V++ S+ C DR L++AS+FV  P S +++ ID   +    S H+A+++K++LD R+ +   ERE  S  LE ATTL+ EL +A+ 
Subjt:  MRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARV

Query:  ENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFS
        E ++L+++++AK    + E E  +   ++ + I KGLE EKF+L++  D L +  +   + +  L  E++  K +L++G LLEE+F+ H +FD F  DFS
Subjt:  ENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFS

Query:  DVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
        D  FKFLMKGI                                +SLVD+YVR+L+S    D EEED  +Q+   V TT
Subjt:  DVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

A0A6J1D971 uncharacterized protein LOC1110185382.8e-3736.94Show/hide
Query:  IYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVA-TTLERELKE
        I  + +IEPSS+G++++V++ S+   DR L++ASKFV  P S +++ IDY  +    S  +A+ +K++LD R+++   E+E FS ALE A +T++ EL +
Subjt:  IYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVA-TTLERELKE

Query:  ARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVN
        A  E E LK+++E++ +  + E + +Q   ++ + I +GLE EKF+L++  D + +  +    E++    E+E  K +LSNGVLLEEAF+ H DFD F  
Subjt:  ARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVN

Query:  DFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEE
        DFSD  FKFLMKGI                                ++LVD+YVRDL+S   D +E++
Subjt:  DFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEE

A0A6J1DBX9 uncharacterized protein LOC1110189132.6e-14798.95Show/hide
Query:  MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS
        MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS
Subjt:  MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQAS

Query:  KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV
        KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV
Subjt:  KFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYV

Query:  IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIMESLVD
        IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIME  +D
Subjt:  IVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIMESLVD

A0A6J1DF31 uncharacterized protein LOC1110199097.2e-4138.81Show/hide
Query:  VGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLE
        +G TFD+  RF++EPSS+G+K++V++ S+ C DR L++ASKFV  P S +++ ID   +    S H+AI++K++LD R+ +   ERE  S ALE ATTL+
Subjt:  VGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLE

Query:  RELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDF
         EL +A+ E  +L+++++AK +  + E E  +   ++ + I KGLE EKF+L++  D L +  +   + +  L  E++  K +L+NG LLEE+F+ HLDF
Subjt:  RELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDF

Query:  DVFVNDFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
        D F  DFSD  FKFLMKGI                                +SLV +YVR+L+S    D EEED  +Q+ + + TT
Subjt:  DVFVNDFSDVDFKFLMKGIM-------------------------------ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

A0A6J1DZB3 uncharacterized protein LOC1110256652.5e-4637.28Show/hide
Query:  EVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRS
        E  DVSPL EV+ +SP  + +  K+ + SS+             L +DP+AR+  T ++ MRF +EPSS+G+K++V++ S+ C DR L++ASKFV  P S
Subjt:  EVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIVPRS

Query:  GIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENE
         +++ ID   +    S H A+++K++LD R+ +   ERE    ALE ATTL+ EL +A+ E ++L+++++AK    + E E  +   ++ + I KGLE E
Subjt:  GIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENE

Query:  KFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIM----------------------------
        KF+L++  D L +  +   + +  L  E++  K +L+NG LLEE+F+ H DFD F  DFSD  FKFLMKGI                             
Subjt:  KFKLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIM----------------------------

Query:  ---ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
           +SLVD+YVR+L+S    D EEED  +Q+   V TT
Subjt:  ---ESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGGGATGTCGAGACCTCACTTGAGGTATTTGATGTTTCACCTCTTCAAGAGGTTCAAAGGAAGTCTCCATCCAATAAATCTAAGAATAATAAGAGAAAAACT
ATCTCTTCTGATGATGTCGTGCCCGAGGTGAGGGTAGACGGCATTACTGGTCTTGCGAACGATCCCAAAGCTAGGGTAGGGGCAACGTTCGACATTTATATGAGG
TTCAAAATTGAACCTTCAAGCGCTGGTATCAAGGAGAAGGTGGCAAAAACGTCCAGCGTCTGCTTTGATCGCGGCTTGCAGCAGGCGTCAAAATTTGTCATCGTC
CCTAGGTCGGGCATCAAGAAGATTATCGACTACACAGTCAAGGTGCATGCGGTCAGCTGCCATGCCGCCATCATTATGAAGTCTAAACTGGACGACCGTGACCTC
ATCATGGTAAACGAACGCGAGGCCTTCTCTGTGGCTTTAGAGGTTGCTACTACTCTCGAAAGGGAGCTAAAAGAAGCAAGGGTCGAGAACGAGGTCCTCAAGTCT
AAACTGGAGGCTAAGTTCAAAAGTGATGAGAATGAGGTGGAGCACCAACAAGAGCTATTCAAGTCCACATATGTCATCGTCAAAGGGTTGGAGAATGAAAAATTC
AAGCTGATGAGGCGCAATGATCATCTTACTCGTGATGCCAAGACTCACCAGTCCGAGGTAAAAGAGTTGAAGGTTGAAGTGGAGCTACACAAGGCAAAACTCAGC
AATGGTGTTCTTCTTGAAGAAGCCTTCCAAGCCCATCTTGACTTCGATGTGTTTGTGAACGACTTCAGCGATGTCGACTTCAAATTTCTAATGAAAGGCATCATG
GAGAGTTTGGTGGACAGGTATGTGAGGGATCTCAACTCGAAGGCTGAGGACGATGACGAGGAGGAAGATGACCTAGCCCAGAAAAGCGATGTTGTTCGCACCACA
CCATTGGTGAAGATGGGGTGTTCTTTATGCTCCTGCTGCTCCCTCGTCTCAAGAAGCTGGCCTTTGAGACTCCAAAGAGCTTCACTTCCTAGCTTCCCAGGCAGA
GCTCGGGTCGCACCTCAAAAACTCCTGATTGACCCCCAACTATCTTATGACCAGATATCCGACTTGGAAACTCCTAGGTTGGACTCGTGCATTGTAGTGTCTTGT
CATTCGGTTCTGTTAATCCTCGCTGTAGGCATTCCTATCTCGATTGACACCATAAACTCAGAATCGAAAGTCAAGGAGAAGGGGGTTTCTGCAGTCGACTCTCAT
GGAGGAATTGAAGAATCTATAATGGCTAAGCTTAGTACTGCAAAGACTGGTGATGAACGATGCTGGAGCCTGAATCATAGAGTGTCTATTCGGGAGCCTGTGGAA
CTTTGGATGCTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGGGATGTCGAGACCTCACTTGAGGTATTTGATGTTTCACCTCTTCAAGAGGTTCAAAGGAAGTCTCCATCCAATAAATCTAAGAATAATAAGAGAAAAACT
ATCTCTTCTGATGATGTCGTGCCCGAGGTGAGGGTAGACGGCATTACTGGTCTTGCGAACGATCCCAAAGCTAGGGTAGGGGCAACGTTCGACATTTATATGAGG
TTCAAAATTGAACCTTCAAGCGCTGGTATCAAGGAGAAGGTGGCAAAAACGTCCAGCGTCTGCTTTGATCGCGGCTTGCAGCAGGCGTCAAAATTTGTCATCGTC
CCTAGGTCGGGCATCAAGAAGATTATCGACTACACAGTCAAGGTGCATGCGGTCAGCTGCCATGCCGCCATCATTATGAAGTCTAAACTGGACGACCGTGACCTC
ATCATGGTAAACGAACGCGAGGCCTTCTCTGTGGCTTTAGAGGTTGCTACTACTCTCGAAAGGGAGCTAAAAGAAGCAAGGGTCGAGAACGAGGTCCTCAAGTCT
AAACTGGAGGCTAAGTTCAAAAGTGATGAGAATGAGGTGGAGCACCAACAAGAGCTATTCAAGTCCACATATGTCATCGTCAAAGGGTTGGAGAATGAAAAATTC
AAGCTGATGAGGCGCAATGATCATCTTACTCGTGATGCCAAGACTCACCAGTCCGAGGTAAAAGAGTTGAAGGTTGAAGTGGAGCTACACAAGGCAAAACTCAGC
AATGGTGTTCTTCTTGAAGAAGCCTTCCAAGCCCATCTTGACTTCGATGTGTTTGTGAACGACTTCAGCGATGTCGACTTCAAATTTCTAATGAAAGGCATCATG
GAGAGTTTGGTGGACAGGTATGTGAGGGATCTCAACTCGAAGGCTGAGGACGATGACGAGGAGGAAGATGACCTAGCCCAGAAAAGCGATGTTGTTCGCACCACA
CCATTGGTGAAGATGGGGTGTTCTTTATGCTCCTGCTGCTCCCTCGTCTCAAGAAGCTGGCCTTTGAGACTCCAAAGAGCTTCACTTCCTAGCTTCCCAGGCAGA
GCTCGGGTCGCACCTCAAAAACTCCTGATTGACCCCCAACTATCTTATGACCAGATATCCGACTTGGAAACTCCTAGGTTGGACTCGTGCATTGTAGTGTCTTGT
CATTCGGTTCTGTTAATCCTCGCTGTAGGCATTCCTATCTCGATTGACACCATAAACTCAGAATCGAAAGTCAAGGAGAAGGGGGTTTCTGCAGTCGACTCTCAT
GGAGGAATTGAAGAATCTATAATGGCTAAGCTTAGTACTGCAAAGACTGGTGATGAACGATGCTGGAGCCTGAATCATAGAGTGTCTATTCGGGAGCCTGTGGAA
CTTTGGATGCTCTAG
Protein sequenceShow/hide protein sequence
MRDVETSLEVFDVSPLQEVQRKSPSNKSKNNKRKTISSDDVVPEVRVDGITGLANDPKARVGATFDIYMRFKIEPSSAGIKEKVAKTSSVCFDRGLQQASKFVIV
PRSGIKKIIDYTVKVHAVSCHAAIIMKSKLDDRDLIMVNEREAFSVALEVATTLERELKEARVENEVLKSKLEAKFKSDENEVEHQQELFKSTYVIVKGLENEKF
KLMRRNDHLTRDAKTHQSEVKELKVEVELHKAKLSNGVLLEEAFQAHLDFDVFVNDFSDVDFKFLMKGIMESLVDRYVRDLNSKAEDDDEEEDDLAQKSDVVRTT
PLVKMGCSLCSCCSLVSRSWPLRLQRASLPSFPGRARVAPQKLLIDPQLSYDQISDLETPRLDSCIVVSCHSVLLILAVGIPISIDTINSESKVKEKGVSAVDSH
GGIEESIMAKLSTAKTGDERCWSLNHRVSIREPVELWML