; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc05g12080 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc05g12080
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionRibonuclease H
Genome locationchr5:9447194..9451762
RNA-Seq ExpressionMoc05g12080
SyntenyMoc05g12080
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022152110.1 uncharacterized protein LOC111019899 [Momordica charantia]5.2e-5138.37Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGR---------TDKRSDQRKS---GGQDIRKSNSKPSLPNP---------PRAD
        MCYFL GLADE L VKL EEAP+T A +LQK KKVIDGQELL TK+G+         TD +S  + S   G  + R++ + P+   P         P ++
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGR---------TDKRSDQRKS---GGQDIRKSNSKPSLPNP---------PRAD

Query:  LTTGDQ-------TRTPAE--GGPMRA-------------------YMGSGKVQQ-------RQYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIF
        + T  +        + P +  G P R                    +    +++        +++VGK  +S A+KKEER RSRTPPRR DRPAVINTIF
Subjt:  LTTGDQ-------TRTPAE--GGPMRA-------------------YMGSGKVQQ-------RQYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIF

Query:  GGPSGGRSGNKRKELAREARRE------------------------------------------------------------------------------
        GGPSGG+SG+KRK+LAR ARRE                                                                              
Subjt:  GGPSGGRSGNKRKELAREARRE------------------------------------------------------------------------------

Query:  -----------------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLE
                                     MAEF+V++GRSAYN IFG+PIIHS RAIPS LHQV+KY T NGVG +RGEQ  SRECY S LKG++VC LE
Subjt:  -----------------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLE

Query:  ELANPLSPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEKQ
         L +    T++   D LP    RE AAP EELELV LLS EKQ
Subjt:  ELANPLSPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEKQ

XP_022154405.1 uncharacterized protein LOC111021682 [Momordica charantia]3.2e-5641.12Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNP----PRADLTTGD---------QTR
        MCYFL GLADETL VKLGEEAP+T A +LQK KKVIDGQEL+ TK GR +K+ DQRK   Q+ RK +SK     P     RAD    D         +  
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNP----PRADLTTGD---------QTR

Query:  TPA------------EGGPMRAYMGSGKVQQ----------------------------------------RQYVGKSGSSLAKKKEERNRSRTPPRRDD
        TP             E G  +      K+++                                        +++V K   +  +KKEE+ RSRT PRRDD
Subjt:  TPA------------EGGPMRAYMGSGKVQQ----------------------------------------RQYVGKSGSSLAKKKEERNRSRTPPRRDD

Query:  RPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------MA
        RP +IN IFGGP+GG+S NKRKELAREA+RE                                                                   MA
Subjt:  RPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------MA

Query:  EFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEE
        EF+VI+GRSAYN IFG+PIIHS R +PS +HQV+KY T NGVG +RGEQKT RECY SALKGS+VC L + AN     +  S  +LPK  +R+ +APTEE
Subjt:  EFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEE

Query:  LELVSLLSSEK
        LELV LLS E+
Subjt:  LELVSLLSSEK

XP_022154846.1 uncharacterized protein LOC111022006 [Momordica charantia]1.0e-5438.99Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSG--------------------GQDIRKSNSKPSLPNP-----
        M YFLIGLADETL ++LGEEAP T A +LQK KKVIDGQELL TK GR +K+ DQ+K G                      + R++ S P+   P     
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSG--------------------GQDIRKSNSKPSLPNP-----

Query:  ----PRADLTTGDQ-------TRTPAE--GGPMRAYMGSGKVQQRQ------------YVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGG
            P +++ T  +        + P +  G P +    +    +RQ            +VGK  S+  +KKEER RSRTPPRRDDRPAVINTIFGGPSGG
Subjt:  ----PRADLTTGDQ-------TRTPAE--GGPMRAYMGSGKVQQRQ------------YVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGG

Query:  RSGNKRKELAREARRE------------------------------------------------------------------------------------
        + GNKR +LAR  RRE                                                                                    
Subjt:  RSGNKRKELAREARRE------------------------------------------------------------------------------------

Query:  -----------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPL
                               MAEF+VI+ +SAYN IFG+PIIHS  A+ S LHQV+KY T+NGVG +RGEQKTSR+CY S LKG AVCTLEE  N  
Subjt:  -----------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPL

Query:  SPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEK
           +Q S  +LPK S+R+ + PTEELELV LLS EK
Subjt:  SPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEK

XP_022157676.1 uncharacterized protein LOC111024332 [Momordica charantia]1.4e-5139.61Show/hide
Query:  LADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY---------
        +ADE L VKLGEEAP+T A +LQK KKVIDGQELL TK GR +++  + +SG  +     SK       +   ++G      AE GP R+          
Subjt:  LADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY---------

Query:  ------------MGSGKVQQR--------QYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGGRSGNKRKELAREARRE------------
                     G  K+ +R        +   K  +S A+KKEER RSRTPPRR DRPAVINTIFGGPSGG+SG+KRKELAREARRE            
Subjt:  ------------MGSGKVQQR--------QYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGGRSGNKRKELAREARRE------------

Query:  -----------------------------------------------------------------------------------------------MAEFM
                                                                                                       M EF+
Subjt:  -----------------------------------------------------------------------------------------------MAEFM

Query:  VINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEELEL
        V++GRS YN IFG+PIIHS R IPS LHQV+KY T NGVG +RGEQ  SRECY +ALKGS+VC LE L +    T++   D LP+   +E AAPTEELEL
Subjt:  VINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEELEL

Query:  VSLLSSEKQ
        V LLS EKQ
Subjt:  VSLLSSEKQ

XP_022158414.1 uncharacterized protein LOC111024904 [Momordica charantia]2.1e-6343.34Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRT---PAEGGPMRA
        MCYFL  LADETL VKLGEEAP+T   +LQK KKVIDGQELL TK GR +K+ DQ+K   Q+ RK++SK S      +  +  +  R    P+   P   
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRT---PAEGGPMRA

Query:  YMGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRD
        Y  S                           G +++R                                    ++VGK  S+  +KKEER RSRTPPRR+
Subjt:  YMGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRD

Query:  DRPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------M
        DRPAVINTIFGGP+GG+SGNKRKELAREARRE                                                                   M
Subjt:  DRPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------M

Query:  AEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTE
        AEF+VI+GRSAYN IFG+PIIHS RA+PS LHQV+KY T N VGM+RGEQKTSRECY SALKGSAVC LEE  N     +Q S  +LPK  +R+   PTE
Subjt:  AEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTE

Query:  ELELVSLLSSEKQ
        ELELV LLS E+Q
Subjt:  ELELVSLLSSEKQ

TrEMBL top hitse value%identityAlignment
A0A6J1DJI4 uncharacterized protein LOC1110216821.5e-5641.12Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNP----PRADLTTGD---------QTR
        MCYFL GLADETL VKLGEEAP+T A +LQK KKVIDGQEL+ TK GR +K+ DQRK   Q+ RK +SK     P     RAD    D         +  
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNP----PRADLTTGD---------QTR

Query:  TPA------------EGGPMRAYMGSGKVQQ----------------------------------------RQYVGKSGSSLAKKKEERNRSRTPPRRDD
        TP             E G  +      K+++                                        +++V K   +  +KKEE+ RSRT PRRDD
Subjt:  TPA------------EGGPMRAYMGSGKVQQ----------------------------------------RQYVGKSGSSLAKKKEERNRSRTPPRRDD

Query:  RPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------MA
        RP +IN IFGGP+GG+S NKRKELAREA+RE                                                                   MA
Subjt:  RPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------MA

Query:  EFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEE
        EF+VI+GRSAYN IFG+PIIHS R +PS +HQV+KY T NGVG +RGEQKT RECY SALKGS+VC L + AN     +  S  +LPK  +R+ +APTEE
Subjt:  EFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEE

Query:  LELVSLLSSEK
        LELV LLS E+
Subjt:  LELVSLLSSEK

A0A6J1DPX9 uncharacterized protein LOC1110220064.9e-5538.99Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSG--------------------GQDIRKSNSKPSLPNP-----
        M YFLIGLADETL ++LGEEAP T A +LQK KKVIDGQELL TK GR +K+ DQ+K G                      + R++ S P+   P     
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSG--------------------GQDIRKSNSKPSLPNP-----

Query:  ----PRADLTTGDQ-------TRTPAE--GGPMRAYMGSGKVQQRQ------------YVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGG
            P +++ T  +        + P +  G P +    +    +RQ            +VGK  S+  +KKEER RSRTPPRRDDRPAVINTIFGGPSGG
Subjt:  ----PRADLTTGDQ-------TRTPAE--GGPMRAYMGSGKVQQRQ------------YVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGG

Query:  RSGNKRKELAREARRE------------------------------------------------------------------------------------
        + GNKR +LAR  RRE                                                                                    
Subjt:  RSGNKRKELAREARRE------------------------------------------------------------------------------------

Query:  -----------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPL
                               MAEF+VI+ +SAYN IFG+PIIHS  A+ S LHQV+KY T+NGVG +RGEQKTSR+CY S LKG AVCTLEE  N  
Subjt:  -----------------------MAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPL

Query:  SPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEK
           +Q S  +LPK S+R+ + PTEELELV LLS EK
Subjt:  SPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEK

A0A6J1DWS1 uncharacterized protein LOC1110247912.5e-5146.1Show/hide
Query:  CYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIR---KSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY
        CYF  GLADETL VKLGEEA +T A +LQK KKVIDGQELL  K GR +K+ DQ+K+  +  R   KS  K S  +  RA+    D    P+   P   Y
Subjt:  CYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIR---KSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY

Query:  MGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRDD
          +                           G +++R                                    ++VGKS S+L +KKEER RSRTPPR+DD
Subjt:  MGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRDD

Query:  RPAVINTIFGGPSGGRSGNKRKELAREARREMAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKG
        RPAVINTIFGG SGG+S NKRKELARE+RR+MAEF+VI+G+SAYN IFG+PIIHS RA+PS LHQV+KY T NGVG +R  +K          KG
Subjt:  RPAVINTIFGGPSGGRSGNKRKELAREARREMAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKG

A0A6J1DYW5 uncharacterized protein LOC1110243326.7e-5239.61Show/hide
Query:  LADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY---------
        +ADE L VKLGEEAP+T A +LQK KKVIDGQELL TK GR +++  + +SG  +     SK       +   ++G      AE GP R+          
Subjt:  LADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAY---------

Query:  ------------MGSGKVQQR--------QYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGGRSGNKRKELAREARRE------------
                     G  K+ +R        +   K  +S A+KKEER RSRTPPRR DRPAVINTIFGGPSGG+SG+KRKELAREARRE            
Subjt:  ------------MGSGKVQQR--------QYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGGRSGNKRKELAREARRE------------

Query:  -----------------------------------------------------------------------------------------------MAEFM
                                                                                                       M EF+
Subjt:  -----------------------------------------------------------------------------------------------MAEFM

Query:  VINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEELEL
        V++GRS YN IFG+PIIHS R IPS LHQV+KY T NGVG +RGEQ  SRECY +ALKGS+VC LE L +    T++   D LP+   +E AAPTEELEL
Subjt:  VINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEELEL

Query:  VSLLSSEKQ
        V LLS EKQ
Subjt:  VSLLSSEKQ

A0A6J1DZB9 uncharacterized protein LOC1110249049.9e-6443.34Show/hide
Query:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRT---PAEGGPMRA
        MCYFL  LADETL VKLGEEAP+T   +LQK KKVIDGQELL TK GR +K+ DQ+K   Q+ RK++SK S      +  +  +  R    P+   P   
Subjt:  MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRT---PAEGGPMRA

Query:  YMGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRD
        Y  S                           G +++R                                    ++VGK  S+  +KKEER RSRTPPRR+
Subjt:  YMGS---------------------------GKVQQR------------------------------------QYVGKSGSSLAKKKEERNRSRTPPRRD

Query:  DRPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------M
        DRPAVINTIFGGP+GG+SGNKRKELAREARRE                                                                   M
Subjt:  DRPAVINTIFGGPSGGRSGNKRKELAREARRE-------------------------------------------------------------------M

Query:  AEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTE
        AEF+VI+GRSAYN IFG+PIIHS RA+PS LHQV+KY T N VGM+RGEQKTSRECY SALKGSAVC LEE  N     +Q S  +LPK  +R+   PTE
Subjt:  AEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGVGMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTE

Query:  ELELVSLLSSEKQ
        ELELV LLS E+Q
Subjt:  ELELVSLLSSEKQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTGCTACTTCCTTATCGGCCTGGCTGACGAAACCTTGATGGTTAAGCTGGGCGAAGAAGCACCGTCAACCTTGGCTGTGATGTTGCAGAAGACCAAGAAGGTC
ATAGATGGGCAGGAGCTGCTCATAACCAAGCTAGGTCGGACAGATAAGAGGTCCGACCAGAGAAAGTCGGGCGGCCAAGACATAAGGAAGTCCAATTCTAAGCCC
AGTCTTCCCAACCCACCTAGGGCTGATCTGACTACAGGAGATCAGACTCGGACTCCAGCCGAAGGGGGCCCTATGAGAGCTTACATGGGATCTGGAAAAGTGCAG
CAAAGACAGTATGTTGGAAAGTCGGGCTCGAGCTTGGCAAAGAAAAAAGAAGAGAGGAACCGTTCAAGAACGCCACCTCGGAGGGATGACCGGCCTGCGGTCATC
AACACCATCTTTGGCGGCCCTAGTGGCGGCCGGTCAGGGAACAAAAGGAAGGAGCTTGCCAGGGAGGCTCGGCGTGAAATGGCCGAGTTCATGGTAATAAATGGA
AGGTCGGCCTACAATACCATCTTCGGACAACCTATAATTCATTCACTTCGTGCCATCCCATCGAATTTGCACCAAGTCATGAAGTACTTTACAGCCAATGGGGTT
GGCATGATCCGAGGTGAGCAGAAAACATCCCGCGAGTGCTATACCTCGGCACTCAAGGGATCGGCAGTCTGCACCCTGGAAGAACTAGCAAATCCTCTAAGTCCA
ACAGTTCAAACTTCCATGGACGAGCTGCCCAAGACTAGCCAGCGAGAAGTGGCTGCGCCTACAGAGGAGTTGGAGCTTGTCTCTTTACTCAGTTCAGAGAAGCAA
ACTTATGAAATTAGTCAGATTCCGAGATCGCAAAACTCCAATGCCGATGCCTTGGCCAAATTAGAGCTAGCCTACGAGACCGACCTGACGAGATCAATCCCAGTG
GAAATCTTGGACAACCCTTCCATTTTGGAGTCGGATATGATGGAAGTTAATTCCATTAAAGGGCACCGTATCATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTGCTACTTCCTTATCGGCCTGGCTGACGAAACCTTGATGGTTAAGCTGGGCGAAGAAGCACCGTCAACCTTGGCTGTGATGTTGCAGAAGACCAAGAAGGTC
ATAGATGGGCAGGAGCTGCTCATAACCAAGCTAGGTCGGACAGATAAGAGGTCCGACCAGAGAAAGTCGGGCGGCCAAGACATAAGGAAGTCCAATTCTAAGCCC
AGTCTTCCCAACCCACCTAGGGCTGATCTGACTACAGGAGATCAGACTCGGACTCCAGCCGAAGGGGGCCCTATGAGAGCTTACATGGGATCTGGAAAAGTGCAG
CAAAGACAGTATGTTGGAAAGTCGGGCTCGAGCTTGGCAAAGAAAAAAGAAGAGAGGAACCGTTCAAGAACGCCACCTCGGAGGGATGACCGGCCTGCGGTCATC
AACACCATCTTTGGCGGCCCTAGTGGCGGCCGGTCAGGGAACAAAAGGAAGGAGCTTGCCAGGGAGGCTCGGCGTGAAATGGCCGAGTTCATGGTAATAAATGGA
AGGTCGGCCTACAATACCATCTTCGGACAACCTATAATTCATTCACTTCGTGCCATCCCATCGAATTTGCACCAAGTCATGAAGTACTTTACAGCCAATGGGGTT
GGCATGATCCGAGGTGAGCAGAAAACATCCCGCGAGTGCTATACCTCGGCACTCAAGGGATCGGCAGTCTGCACCCTGGAAGAACTAGCAAATCCTCTAAGTCCA
ACAGTTCAAACTTCCATGGACGAGCTGCCCAAGACTAGCCAGCGAGAAGTGGCTGCGCCTACAGAGGAGTTGGAGCTTGTCTCTTTACTCAGTTCAGAGAAGCAA
ACTTATGAAATTAGTCAGATTCCGAGATCGCAAAACTCCAATGCCGATGCCTTGGCCAAATTAGAGCTAGCCTACGAGACCGACCTGACGAGATCAATCCCAGTG
GAAATCTTGGACAACCCTTCCATTTTGGAGTCGGATATGATGGAAGTTAATTCCATTAAAGGGCACCGTATCATTTAA
Protein sequenceShow/hide protein sequence
MCYFLIGLADETLMVKLGEEAPSTLAVMLQKTKKVIDGQELLITKLGRTDKRSDQRKSGGQDIRKSNSKPSLPNPPRADLTTGDQTRTPAEGGPMRAYMGSGKVQ
QRQYVGKSGSSLAKKKEERNRSRTPPRRDDRPAVINTIFGGPSGGRSGNKRKELAREARREMAEFMVINGRSAYNTIFGQPIIHSLRAIPSNLHQVMKYFTANGV
GMIRGEQKTSRECYTSALKGSAVCTLEELANPLSPTVQTSMDELPKTSQREVAAPTEELELVSLLSSEKQTYEISQIPRSQNSNADALAKLELAYETDLTRSIPV
EILDNPSILESDMMEVNSIKGHRII