; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g14230 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g14230
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:9591996..9598564
RNA-Seq ExpressionMoc03g14230
SyntenyMoc03g14230
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ERM93404.1 hypothetical protein AMTR_s04947p00003620 [Amborella trichopoda]1.9e-4747.6Show/hide
Query:  LNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEA
        +N ++LAD+  R +R YAAP F   +P I  PEI  P+FELKPVMFQMLQTVGQF G PTED H HL                     LKLFP+SL+D A
Subjt:  LNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEA

Query:  DTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQA
         +WL +LP +S+T+W D+AEK L KYFPP++NAK+RS+I  F Q   ES                         +E +YN L+ A R+V+DA ANGA+ +
Subjt:  DTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQA

Query:  KPYAKAFNILERISSNNHSWSNPRAIQSR
        K Y +AF ILE I+SNN+ WSN RA  SR
Subjt:  KPYAKAFNILERISSNNHSWSNPRAIQSR

XP_022150863.1 uncharacterized protein LOC111018910 [Momordica charantia]2.1e-8345.59Show/hide
Query:  MNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDL
        MNPPNPN+ QPI PNVRIEEI DG P+A N EV VP LN VLLA  IDRE+RAYAAP FYNF+PVITE EI  PKFELK                  E  
Subjt:  MNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDL

Query:  HSHLNMLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNEL
        +  +  LKLF +SL+DEA TWL SLPSESITSW+D+AE  LMKYFPPSKNAKYRSDIN F QF+ ES                         IEMYYN L
Subjt:  HSHLNMLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNEL

Query:  DDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGAER-----SGSIDWTVATDLKSIPYVALSSDTKVPMRDGKKQ----------
        DDA RLV     N AL AKPYA+AFNILERISSN HS S+ RAIQ RG +R     S S   +   ++  +   +++  + V    GK            
Subjt:  DDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGAER-----SGSIDWTVATDLKSIPYVALSSDTKVPMRDGKKQ----------

Query:  -------------------CKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLA----KNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTS
                           C   T  S +    ++   P + + P  +   E  L     ++            ++I+ + R+ Q+    E  PV    S
Subjt:  -------------------CKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLA----KNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTS

Query:  GVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE
            T IR+ +KRKQ +H DALA+Y+L P YPKR Q KE NVQ  +FL+VLKQLHVN PLVEALE
Subjt:  GVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE

XP_030483210.1 uncharacterized protein LOC115699807 [Cannabis sativa]3.5e-4647.95Show/hide
Query:  LADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQ
        +AD+ D+ +R YAAP F   +P I  PEI  P+FELKPVMFQMLQTVGQF G PTED H HL +                   LKLFPYSL+D+A  WL 
Subjt:  LADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQ

Query:  SLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRESI-------------------------EMYYNELDDAIRLVIDALANGALQAKPYAK
        SLPS S+T+W+++AE+ LMKYFPP+KNAK R +I  F QF  ES+                         E +YN L+   R+V+DA ANGAL AK Y +
Subjt:  SLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRESI-------------------------EMYYNELDDAIRLVIDALANGALQAKPYAK

Query:  AFNILERISSNNHSWSNPR
        A++I+ERIS+NN+ W   R
Subjt:  AFNILERISSNNHSWSNPR

XP_030497803.1 uncharacterized protein LOC115713460 [Cannabis sativa]6.4e-4830.03Show/hide
Query:  IADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN---------------
        +ADGF I      +    N + LAD+  R +R YAAP F   +P I  PEI  P FELKPVMFQMLQTVGQF G PTED H H+                
Subjt:  IADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN---------------

Query:  ----MLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELD
             LKLFP+SL+D A  WL +LP +S+T+W D+AEK L KYFPP++NAK+RS+I  F Q   E+                         +E +YN L+
Subjt:  ----MLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELD

Query:  DAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSR--------------------------GAERSGSI--------------------
         A R+V+DA ANGA+ +K Y +AF ILERI+SNN+ WS  RA  SR                               GS+                    
Subjt:  DAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSR--------------------------GAERSGSI--------------------

Query:  -----------------------------------------DW---------------------------------------------------------
                                                  W                                                         
Subjt:  -----------------------------------------DW---------------------------------------------------------

Query:  -----TVATDLKSIPYVALSSDTKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKA
              +A DLK+ P   L SDT+ P RDGK+ CKA+TLRSGK +                    ES +A                   EP  +Q   + 
Subjt:  -----TVATDLKSIPYVALSSDTKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKA

Query:  EGTPVRINTSGVELTHIRMPEKRKQPKHADA-LAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE
        +  P    TS VE+     P      +H+ A  +  +  P +P+R + ++ + Q +RFL+VLKQLH+N PLVEALE
Subjt:  EGTPVRINTSGVELTHIRMPEKRKQPKHADA-LAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE

XP_030508936.1 uncharacterized protein LOC115723589 [Cannabis sativa]2.2e-4842.86Show/hide
Query:  EIERTFHRNRHEQIRAQAEANMNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELK
        EIERTF + R EQ +A+   NM                    ADGF +      +    N + LAD+  R +R YAAP F   +P I  PEI  P FELK
Subjt:  EIERTFHRNRHEQIRAQAEANMNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELK

Query:  PVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYF
        PVMFQMLQTVGQF G PTED H H+                     LKLFP+SL+D A  WL +LP +S+T+W D+AEK L KYFPP++NAK+RS+I  F
Subjt:  PVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYF

Query:  HQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSR
         Q   E+                         +E +YN L+ A R+V+DA ANGA+ +K Y +AF ILERI+SNN+ WS  RA  SR
Subjt:  HQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSR

TrEMBL top hitse value%identityAlignment
A0A6J1DAK9 uncharacterized protein LOC1110189101.0e-8345.59Show/hide
Query:  MNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDL
        MNPPNPN+ QPI PNVRIEEI DG P+A N EV VP LN VLLA  IDRE+RAYAAP FYNF+PVITE EI  PKFELK                  E  
Subjt:  MNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDL

Query:  HSHLNMLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNEL
        +  +  LKLF +SL+DEA TWL SLPSESITSW+D+AE  LMKYFPPSKNAKYRSDIN F QF+ ES                         IEMYYN L
Subjt:  HSHLNMLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNEL

Query:  DDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGAER-----SGSIDWTVATDLKSIPYVALSSDTKVPMRDGKKQ----------
        DDA RLV     N AL AKPYA+AFNILERISSN HS S+ RAIQ RG +R     S S   +   ++  +   +++  + V    GK            
Subjt:  DDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGAER-----SGSIDWTVATDLKSIPYVALSSDTKVPMRDGKKQ----------

Query:  -------------------CKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLA----KNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTS
                           C   T  S +    ++   P + + P  +   E  L     ++            ++I+ + R+ Q+    E  PV    S
Subjt:  -------------------CKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLA----KNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTS

Query:  GVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE
            T IR+ +KRKQ +H DALA+Y+L P YPKR Q KE NVQ  +FL+VLKQLHVN PLVEALE
Subjt:  GVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE

A0A6J1DWK1 uncharacterized protein LOC1110250534.2e-4536.48Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQ
        MFQM+  VGQFHGH TE  H HL                     LKLF YSL+ EA TWL+SL SE ITSW+D+ EK LMKYF PSK    R    Y   
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQ

Query:  FSRESIEMYYNELDDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGA--------------------------------------
             IE YY  LD+A RLVIDA  NGAL  KPYAKA NILERISS+NHSWS+ RAI+ + +                                      
Subjt:  FSRESIEMYYNELDDAIRLVIDALANGALQAKPYAKAFNILERISSNNHSWSNPRAIQSRGA--------------------------------------

Query:  --------------------------------------------ERSGSI--------------DWTV-----------------ATDLKSIPYVALSSD
                                                    +  GSI              D TV                 A DLKS P  AL SD
Subjt:  --------------------------------------------ERSGSI--------------DWTV-----------------ATDLKSIPYVALSSD

Query:  TKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTSGVELTHIRMPEK
        T+VP RD K+QC ALTLRSGKALP  H NAP    EP      E Q  +++EPAE  +P  P QI  +P++ Q+  K    PV         +   +PEK
Subjt:  TKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTSGVELTHIRMPEK

Query:  RKQ
          +
Subjt:  RKQ

A0A6J1E1F3 uncharacterized protein LOC1110250652.5e-4234.45Show/hide
Query:  MFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQ
        MFQMLQTV QFHGH TED H HL                     LKLFPYSL+DEA TWL+SLP ESITSW+D+AEK LMKYFPPSKNAKYRS+IN F Q
Subjt:  MFQMLQTVGQFHGHPTEDLHSHLNM-------------------LKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQ

Query:  FSRES-------------------------IEMYYNELDDAIRL-----VIDALANGALQAKPYAKAFNILERIS-------------------------
        F+ ES                         IE YY +L+DA RL     V    + G ++++ Y    + +E ++                         
Subjt:  FSRES-------------------------IEMYYNELDDAIRL-----VIDALANGALQAKPYAKAFNILERIS-------------------------

Query:  --------SNNHSWS----NPRAIQSRG---AERSGSIDWTVATDLKSIPYVALSSD--------TKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVS
                  +H ++    NP ++   G     R+     T     ++ P  + S D        +  P    K       +  G+ +         A  
Subjt:  --------SNNHSWS----NPRAIQSRG---AERSGSIDWTVATDLKSIPYVALSSD--------TKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVS

Query:  EPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTSGVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRF
        E  +K     Q   NN+    +  TS   + ++  ++   +K++               IR+PEKRKQ +H +A A+Y   P YPKRLQ KERNVQ  +F
Subjt:  EPTLKESRESQLAKNNEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTSGVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRF

Query:  LNVLKQLHVNTPLVEALE
        L+VLKQLHVN PLVEALE
Subjt:  LNVLKQLHVNTPLVEALE

A0A6J1H7E4 uncharacterized protein LOC1114611684.8e-4143.3Show/hide
Query:  NAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEAD
        NA+ LAD+ +R +RAYA PA    +P I  PE+    FELKPVMFQMLQT+GQFHG P+ED H HL                     L LFPYSL+D A 
Subjt:  NAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEAD

Query:  TWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRESI-------------------------EMYYNELDDAIRLVIDALANGALQAK
        +WL +L   +I SW  +AEK L+KYFPP++NA++R++I  F QF  E++                         E +YN L+ A + V+DA ANGA+ +K
Subjt:  TWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRESI-------------------------EMYYNELDDAIRLVIDALANGALQAK

Query:  PYAKAFNILERISSNNHSWSNPRA
         Y +A+ ILERI+SNN  W++ R+
Subjt:  PYAKAFNILERISSNNHSWSNPRA

U5CUI2 Retrotrans_gag domain-containing protein9.0e-4847.6Show/hide
Query:  LNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEA
        +N ++LAD+  R +R YAAP F   +P I  PEI  P+FELKPVMFQMLQTVGQF G PTED H HL                     LKLFP+SL+D A
Subjt:  LNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTVGQFHGHPTEDLHSHLN-------------------MLKLFPYSLKDEA

Query:  DTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQA
         +WL +LP +S+T+W D+AEK L KYFPP++NAK+RS+I  F Q   ES                         +E +YN L+ A R+V+DA ANGA+ +
Subjt:  DTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRES-------------------------IEMYYNELDDAIRLVIDALANGALQA

Query:  KPYAKAFNILERISSNNHSWSNPRAIQSR
        K Y +AF ILE I+SNN+ WSN RA  SR
Subjt:  KPYAKAFNILERISSNNHSWSNPRAIQSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCACAATAGTGTATTTCAGATTGCAGCTCGAACTCGGCTTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTC
AGTATAGGGTATTCTCTACGTAACGTCTCCGTGTTTGGTCGGTCGATCGATCCTGATCGATCTCGGTCATTCTTCTCTTTCCTAAAGACGGCAGTGTGCTTGATC
GGCGCGAGTAGAGACGATAGGCTTGCTTCTGGTGGCAAAACCCAATGGTGCATGAACGATTTTGAGAGCTTGGAATACATTATCTATCATGAGATAGAACGAACT
TTCCACAGAAATCGACATGAACAAATAAGAGCCCAAGCTGAAGCAAATATGAATCCACCCAACCCGAATATGCGTCAACCTATTCTACCGAATGTGAGGATCGAG
GAAATAGCAGATGGGTTTCCTATTGCTGCTAACCCTGAGGTATCAGTGCCCCCTCTCAATGCTGTATTACTAGCAGATAACATCGACAGAGAAGTCAGGGCGTAT
GCGGCTCCAGCCTTTTATAATTTCGACCCAGTAATCACGGAGCCGGAAATTGCTACCCCAAAGTTTGAACTAAAACCCGTAATGTTTCAGATGCTCCAGACAGTG
GGTCAGTTCCACGGTCATCCTACTGAAGATCTGCATTCACACCTGAATATGCTCAAGTTGTTCCCCTATTCACTTAAAGACGAAGCCGATACATGGTTACAGTCA
TTGCCGTCAGAATCTATTACAAGTTGGGAGGATATAGCCGAGAAATTATTGATGAAGTACTTCCCGCCCAGCAAGAACGCTAAGTACAGAAGCGATATTAATTAC
TTTCATCAATTTTCTAGGGAGTCGATTGAAATGTACTACAATGAATTGGATGACGCTATACGTCTGGTCATTGATGCCTTGGCAAATGGCGCATTGCAAGCAAAA
CCTTATGCTAAAGCATTCAATATCTTGGAGAGAATATCATCGAACAATCATTCATGGTCAAACCCTAGAGCCATTCAAAGTAGAGGAGCAGAGCGCAGTGGGAGC
ATCGACTGGACAGTAGCAACCGATTTAAAGAGCATACCTTATGTAGCATTGTCGAGCGACACTAAAGTACCGATGAGAGATGGTAAAAAGCAATGTAAAGCCCTC
ACACTGCGAAGTGGTAAGGCATTACCTCTAGCACACCTGAATGCTCCAAGGGCCGTGAGCGAGCCCACTCTGAAAGAATCAAGAGAATCTCAATTAGCGAAGAAT
AATGAGCCAGCAGAGACAACTTTACCCACTTCCCCAGTGCAGATCATGATGGAACCTAGGAAAGTTCAAGACATCATTAAAGCTGAGGGCACCCCAGTAAGAATC
AACACTTCCGGGGTAGAATTAACACATATTAGAATGCCTGAAAAAAGAAAGCAGCCAAAGCATGCAGATGCTCTAGCAAAATATAGGCTAACACCATCATATCCT
AAGCGGTTGCAGAATAAAGAGCGGAACGTTCAGTTAAAAAGGTTCCTAAATGTGCTGAAGCAATTGCATGTTAACACACCTTTGGTGGAAGCTCTAGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCACAATAGTGTATTTCAGATTGCAGCTCGAACTCGGCTTCCGGACCGATCTGAATACTTGGGCGGACCTGCACAAAAAGGTGAGCACTCCGACGATCAAGTC
AGTATAGGGTATTCTCTACGTAACGTCTCCGTGTTTGGTCGGTCGATCGATCCTGATCGATCTCGGTCATTCTTCTCTTTCCTAAAGACGGCAGTGTGCTTGATC
GGCGCGAGTAGAGACGATAGGCTTGCTTCTGGTGGCAAAACCCAATGGTGCATGAACGATTTTGAGAGCTTGGAATACATTATCTATCATGAGATAGAACGAACT
TTCCACAGAAATCGACATGAACAAATAAGAGCCCAAGCTGAAGCAAATATGAATCCACCCAACCCGAATATGCGTCAACCTATTCTACCGAATGTGAGGATCGAG
GAAATAGCAGATGGGTTTCCTATTGCTGCTAACCCTGAGGTATCAGTGCCCCCTCTCAATGCTGTATTACTAGCAGATAACATCGACAGAGAAGTCAGGGCGTAT
GCGGCTCCAGCCTTTTATAATTTCGACCCAGTAATCACGGAGCCGGAAATTGCTACCCCAAAGTTTGAACTAAAACCCGTAATGTTTCAGATGCTCCAGACAGTG
GGTCAGTTCCACGGTCATCCTACTGAAGATCTGCATTCACACCTGAATATGCTCAAGTTGTTCCCCTATTCACTTAAAGACGAAGCCGATACATGGTTACAGTCA
TTGCCGTCAGAATCTATTACAAGTTGGGAGGATATAGCCGAGAAATTATTGATGAAGTACTTCCCGCCCAGCAAGAACGCTAAGTACAGAAGCGATATTAATTAC
TTTCATCAATTTTCTAGGGAGTCGATTGAAATGTACTACAATGAATTGGATGACGCTATACGTCTGGTCATTGATGCCTTGGCAAATGGCGCATTGCAAGCAAAA
CCTTATGCTAAAGCATTCAATATCTTGGAGAGAATATCATCGAACAATCATTCATGGTCAAACCCTAGAGCCATTCAAAGTAGAGGAGCAGAGCGCAGTGGGAGC
ATCGACTGGACAGTAGCAACCGATTTAAAGAGCATACCTTATGTAGCATTGTCGAGCGACACTAAAGTACCGATGAGAGATGGTAAAAAGCAATGTAAAGCCCTC
ACACTGCGAAGTGGTAAGGCATTACCTCTAGCACACCTGAATGCTCCAAGGGCCGTGAGCGAGCCCACTCTGAAAGAATCAAGAGAATCTCAATTAGCGAAGAAT
AATGAGCCAGCAGAGACAACTTTACCCACTTCCCCAGTGCAGATCATGATGGAACCTAGGAAAGTTCAAGACATCATTAAAGCTGAGGGCACCCCAGTAAGAATC
AACACTTCCGGGGTAGAATTAACACATATTAGAATGCCTGAAAAAAGAAAGCAGCCAAAGCATGCAGATGCTCTAGCAAAATATAGGCTAACACCATCATATCCT
AAGCGGTTGCAGAATAAAGAGCGGAACGTTCAGTTAAAAAGGTTCCTAAATGTGCTGAAGCAATTGCATGTTAACACACCTTTGGTGGAAGCTCTAGAATAA
Protein sequenceShow/hide protein sequence
MHNSVFQIAARTRLPDRSEYLGGPAQKGEHSDDQVSIGYSLRNVSVFGRSIDPDRSRSFFSFLKTAVCLIGASRDDRLASGGKTQWCMNDFESLEYIIYHEIERT
FHRNRHEQIRAQAEANMNPPNPNMRQPILPNVRIEEIADGFPIAANPEVSVPPLNAVLLADNIDREVRAYAAPAFYNFDPVITEPEIATPKFELKPVMFQMLQTV
GQFHGHPTEDLHSHLNMLKLFPYSLKDEADTWLQSLPSESITSWEDIAEKLLMKYFPPSKNAKYRSDINYFHQFSRESIEMYYNELDDAIRLVIDALANGALQAK
PYAKAFNILERISSNNHSWSNPRAIQSRGAERSGSIDWTVATDLKSIPYVALSSDTKVPMRDGKKQCKALTLRSGKALPLAHLNAPRAVSEPTLKESRESQLAKN
NEPAETTLPTSPVQIMMEPRKVQDIIKAEGTPVRINTSGVELTHIRMPEKRKQPKHADALAKYRLTPSYPKRLQNKERNVQLKRFLNVLKQLHVNTPLVEALE