; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g10660 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g10660
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr7:8148952..8151751
RNA-Seq ExpressionMoc07g10660
SyntenyMoc07g10660
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]1.4e-21777.71Show/hide
Query:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN
        M+KNFQFKVKKST ELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAH CYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGVN
Subjt:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN

Query:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN
        LSYDRA RSSEEALRLIRGD ASSYGLL AYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVD AHLKGKFRGVLLSASGVDAN
Subjt:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN

Query:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI
        NQIYPVAF IVDGE                           RHQTICK ID               KVNLIAKFKNDAKAV ELFLKAAKAYRESYFNSI
Subjt:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI

Query:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------
        WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVN LFRHARKLPVTALLDHIR                                     
Subjt:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------

Query:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR
            DNIDQFH QV+D NLDGIVDLNAMTC+CREFDYFKI CS+ I A TMRNINPYSLCDE YT NSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEP 
Subjt:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR

Query:  RLFQELG
        +    +G
Subjt:  RLFQELG

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]1.1e-23871.29Show/hide
Query:  EEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKK
        EEG  + E+ N ++DDALD+E E DVEQVH EI RDE AV+  GC+GLTG  N E LQLIVQSS TNDV EG+VFD KK+LSL+MHLV MR NFQFKVKK
Subjt:  EEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKK

Query:  STSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE
        ST ELYIL CV   CTWRLRATKL++C LFKIKKY + H C GG LK DHRQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGVNLSYD+AWRSSE
Subjt:  STSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE

Query:  EALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIV
        EALRLIRGD ASSYGLL  YGEALKIMNPGTIFELEL+GGKYFKYVFM LG+SIRGFL CIRPVLVVD AHLKGKF GVLL ASG DANNQIYPVAF IV
Subjt:  EALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIV

Query:  DGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVR
        DGE                           RH TICK IDKVFP AFHCF IQHIK+NL+AKFK DAKA+EELFLKAAKAYRESYFNSIWAQL AYPGVR
Subjt:  DGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVR

Query:  EYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRD-----------------------------------------NIDQFH
        EYLDDIGKERWARCFQT+LRYTQMT+N AESVNALFRHARKLPVTALLDHIR                                          NIDQFH
Subjt:  EYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRD-----------------------------------------NIDQFH

Query:  LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGR
        +QVRDGNLDGIVD N+ TC+CREFDYFKI CS+ I    MRNINPY+LCDE YT NSW++AYAEPIFP+GHVSTWNSSP+FV+ PVE   +   +G  R
Subjt:  LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGR

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]5.8e-19561.16Show/hide
Query:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHD+ GLTPL SDVVPCNLGDDRVC W++PG+WNDN+DES ESYDSL +SEEG  + E+ N ++DDA D++ E DVEQV  EIRRDE  V   GC+GL G
Subjt:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH
         PNDEKLQLIVQSS TNDV EG VFD KK+LSL+ HLVAM  NFQFKVKKST ELYILRCV + CTWRLRA KL +C LFKIKKY + H C G  LK DH
Subjt:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL
        RQAK+WVV HLVQ KFTDVS TYRPKDIIQDMRKEYGVNLSYD+AW+S+EEALRLIRGD  +SYGLL AY                              
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL

Query:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD------------GERHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKN
             GFL CIRPVLVVD AHLKGKFR VLL+ASG DANNQIYPVAFVIVD             +RHQTICK IDKVF  AFHCF IQHIK NL+AKFK 
Subjt:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD------------GERHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKN

Query:  DAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRDNIDQFHLQVRDG
        DAKA+EELFLKAAKAY+ESYFNSIWAQL AYPG+REYLDDIGKERW RCFQT+LRYTQMT+N AESVNALFRHA  LPVTALLDHIR             
Subjt:  DAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRDNIDQFHLQVRDG

Query:  NLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGRQLGFLPR
                                                                 EPIFP+ HVSTW SSP+FV+IP E   L   +G  + +   PR
Subjt:  NLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGRQLGFLPR

Query:  ARYDK
        AR+ K
Subjt:  ARYDK

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]3.1e-20488.02Show/hide
Query:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYD LA SEEGHSQ EYGN EHDDALDDELE DVEQVHTEIRRDEEAVR PGCNGLTG
Subjt:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH
        DPNDEKLQLIVQSS TNDVNEGDVFDNKK+LSLKMHLVAMRKNFQFKVKKST +LYILRCVHADCTWRLRATKLKECTLFKIKKYCA H CYGGALKHDH
Subjt:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL
        RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGD ASSYGLL AYG+ALKIMNPGTIFELELEGGKYFKYVFMTL
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL

Query:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCF
        GQSIRGFL CIRPVLVVD AHLKGKFRGVL SASGVDANNQIYPVAF IVDGE                           RHQ ICKTIDKVFPAAFHCF
Subjt:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCF

Query:  YIQHIKVNL
         IQHIKVN+
Subjt:  YIQHIKVNL

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]7.8e-20072.58Show/hide
Query:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN
        MRKNFQFKVKKST ELYILRCVHADCTWRLRATKLKECTLFKI KYCAAH CYGGALKHDHRQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGVN
Subjt:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN

Query:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN
        LSYDRAWRSSEEALRLIRGD ASSYGLL AYGEALKIMNPGTIFEL+LEGGKYFKYVFMTLGQSIRGFLGC RPVLVVD AHLKGKFRGVLLSASGVDAN
Subjt:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN

Query:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI
        NQIY VAF I+DGE                           RHQTICK IDKVF  AFHCF  QHIKVNLIAKFK+DAKAVEELFLKAAKAYRESYFNSI
Subjt:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI

Query:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------
        WAQLRAYP                                 S+NALFRH RKLPVTALLDHIR                                     
Subjt:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------

Query:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR
            DNIDQFH QVRDGNLDGIVDLNAM CSCREFDYFKI CS+ I A TMRNINPYSLCDE YT NSWILAYAEPIFPVGH+STWNSSPEFVNIPVEP 
Subjt:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR

Query:  RLFQELG
        +    +G
Subjt:  RLFQELG

TrEMBL top hitse value%identityAlignment
A0A6J1CVL4 uncharacterized protein LOC1110151816.9e-21877.71Show/hide
Query:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN
        M+KNFQFKVKKST ELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAH CYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGVN
Subjt:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN

Query:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN
        LSYDRA RSSEEALRLIRGD ASSYGLL AYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVD AHLKGKFRGVLLSASGVDAN
Subjt:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN

Query:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI
        NQIYPVAF IVDGE                           RHQTICK ID               KVNLIAKFKNDAKAV ELFLKAAKAYRESYFNSI
Subjt:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI

Query:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------
        WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVN LFRHARKLPVTALLDHIR                                     
Subjt:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------

Query:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR
            DNIDQFH QV+D NLDGIVDLNAMTC+CREFDYFKI CS+ I A TMRNINPYSLCDE YT NSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEP 
Subjt:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR

Query:  RLFQELG
        +    +G
Subjt:  RLFQELG

A0A6J1DJT1 uncharacterized protein LOC1110207155.4e-23971.29Show/hide
Query:  EEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKK
        EEG  + E+ N ++DDALD+E E DVEQVH EI RDE AV+  GC+GLTG  N E LQLIVQSS TNDV EG+VFD KK+LSL+MHLV MR NFQFKVKK
Subjt:  EEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKK

Query:  STSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE
        ST ELYIL CV   CTWRLRATKL++C LFKIKKY + H C GG LK DHRQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGVNLSYD+AWRSSE
Subjt:  STSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE

Query:  EALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIV
        EALRLIRGD ASSYGLL  YGEALKIMNPGTIFELEL+GGKYFKYVFM LG+SIRGFL CIRPVLVVD AHLKGKF GVLL ASG DANNQIYPVAF IV
Subjt:  EALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIV

Query:  DGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVR
        DGE                           RH TICK IDKVFP AFHCF IQHIK+NL+AKFK DAKA+EELFLKAAKAYRESYFNSIWAQL AYPGVR
Subjt:  DGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVR

Query:  EYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRD-----------------------------------------NIDQFH
        EYLDDIGKERWARCFQT+LRYTQMT+N AESVNALFRHARKLPVTALLDHIR                                          NIDQFH
Subjt:  EYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRD-----------------------------------------NIDQFH

Query:  LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGR
        +QVRDGNLDGIVD N+ TC+CREFDYFKI CS+ I    MRNINPY+LCDE YT NSW++AYAEPIFP+GHVSTWNSSP+FV+ PVE   +   +G  R
Subjt:  LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGR

A0A6J1DP00 uncharacterized protein LOC1110229542.8e-19561.16Show/hide
Query:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHD+ GLTPL SDVVPCNLGDDRVC W++PG+WNDN+DES ESYDSL +SEEG  + E+ N ++DDA D++ E DVEQV  EIRRDE  V   GC+GL G
Subjt:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH
         PNDEKLQLIVQSS TNDV EG VFD KK+LSL+ HLVAM  NFQFKVKKST ELYILRCV + CTWRLRA KL +C LFKIKKY + H C G  LK DH
Subjt:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL
        RQAK+WVV HLVQ KFTDVS TYRPKDIIQDMRKEYGVNLSYD+AW+S+EEALRLIRGD  +SYGLL AY                              
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL

Query:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD------------GERHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKN
             GFL CIRPVLVVD AHLKGKFR VLL+ASG DANNQIYPVAFVIVD             +RHQTICK IDKVF  AFHCF IQHIK NL+AKFK 
Subjt:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD------------GERHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKN

Query:  DAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRDNIDQFHLQVRDG
        DAKA+EELFLKAAKAY+ESYFNSIWAQL AYPG+REYLDDIGKERW RCFQT+LRYTQMT+N AESVNALFRHA  LPVTALLDHIR             
Subjt:  DAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRDNIDQFHLQVRDG

Query:  NLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGRQLGFLPR
                                                                 EPIFP+ HVSTW SSP+FV+IP E   L   +G  + +   PR
Subjt:  NLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGRQLGFLPR

Query:  ARYDK
        AR+ K
Subjt:  ARYDK

A0A6J1DTG5 uncharacterized protein LOC1110238431.5e-20488.02Show/hide
Query:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG
        GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYD LA SEEGHSQ EYGN EHDDALDDELE DVEQVHTEIRRDEEAVR PGCNGLTG
Subjt:  GHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDALDDELESDVEQVHTEIRRDEEAVRPPGCNGLTG

Query:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH
        DPNDEKLQLIVQSS TNDVNEGDVFDNKK+LSLKMHLVAMRKNFQFKVKKST +LYILRCVHADCTWRLRATKLKECTLFKIKKYCA H CYGGALKHDH
Subjt:  DPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL
        RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGD ASSYGLL AYG+ALKIMNPGTIFELELEGGKYFKYVFMTL
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTL

Query:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCF
        GQSIRGFL CIRPVLVVD AHLKGKFRGVL SASGVDANNQIYPVAF IVDGE                           RHQ ICKTIDKVFPAAFHCF
Subjt:  GQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCF

Query:  YIQHIKVNL
         IQHIKVN+
Subjt:  YIQHIKVNL

A0A6J1DYC4 uncharacterized protein LOC1110256783.8e-20072.58Show/hide
Query:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN
        MRKNFQFKVKKST ELYILRCVHADCTWRLRATKLKECTLFKI KYCAAH CYGGALKHDHRQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGVN
Subjt:  MRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVN

Query:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN
        LSYDRAWRSSEEALRLIRGD ASSYGLL AYGEALKIMNPGTIFEL+LEGGKYFKYVFMTLGQSIRGFLGC RPVLVVD AHLKGKFRGVLLSASGVDAN
Subjt:  LSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDAN

Query:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI
        NQIY VAF I+DGE                           RHQTICK IDKVF  AFHCF  QHIKVNLIAKFK+DAKAVEELFLKAAKAYRESYFNSI
Subjt:  NQIYPVAFVIVDGE---------------------------RHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSI

Query:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------
        WAQLRAYP                                 S+NALFRH RKLPVTALLDHIR                                     
Subjt:  WAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIR-------------------------------------

Query:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR
            DNIDQFH QVRDGNLDGIVDLNAM CSCREFDYFKI CS+ I A TMRNINPYSLCDE YT NSWILAYAEPIFPVGH+STWNSSPEFVNIPVEP 
Subjt:  ----DNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPR

Query:  RLFQELG
        +    +G
Subjt:  RLFQELG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase9.1e-2120.69Show/hide
Query:  GDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSR
        G  F +  ++   +   ++++  +  ++++  ++Y++ C    C W + A++ +E  LF+I +    H CY   L     +   + +     E+   V  
Subjt:  GDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSR

Query:  TYRPKDIIQDMRKEYGVNL-------SYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKY------FKYVFMTLGQSIRGFL
        T    ++ +   K++G  L       S      +  +A++   GD   S+ L+      L   N G + + + +   +      F+ +F    QSI+GF 
Subjt:  TYRPKDIIQDMRKEYGVNL-------SYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELEGGKY------FKYVFMTLGQSIRGFL

Query:  GCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVI-----VDGER------------HQTIC------KTIDKVF---------PAAFHCFYIQ
         C RP++VVD  +L GK++  L+ AS  DA NQ +P+AF +     VD  R             Q IC        I  V          P A+H F + 
Subjt:  GCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVI-----VDGER------------HQTIC------KTIDKVF---------PAAFHCFYIQ

Query:  HIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLR-AYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVT-------
        H+   L +        +  L  +A  + ++  F+S   +++   P   ++LD     +WA       RY  M  +  E++ A+ +  RK+ +        
Subjt:  HIKVNLIAKFKNDAKAVEELFLKAAKAYRESYFNSIWAQLR-AYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVT-------

Query:  ----------------------ALLDHIRDNIDQFH------------------------------LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNV
                                 +H+ + +++F                               +   + +  GIV LN  TC+C EF   K  C + 
Subjt:  ----------------------ALLDHIRDNIDQFH------------------------------LQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNV

Query:  IVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIP
        +       INP    D+ YT+  +   Y+    PV  +S W   PE   +P
Subjt:  IVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIP

AT1G64255.1 MuDR family transposase9.7e-1522.6Show/hide
Query:  NDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKF
        +D+  G  F +  +L   +   +++   +  V+++  + YI  C+   C W L A ++K+  L +I KY   H C+    +    + ++  +   V+   
Subjt:  NDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKF

Query:  TDVSRTYRPKDIIQDM----RKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELE-----GGKYFKYVFMTLGQSIRGF
              Y P   I ++    +K+ G  L       + E+A++ + GD   S+        AL   N G + + + +         F  VF    QSI GF
Subjt:  TDVSRTYRPKDIIQDM----RKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELE-----GGKYFKYVFMTLGQSIRGF

Query:  LGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD---------------------------GERHQTICKTIDK-----VFPAAFHCFYI
          C RP++VVD  +L  +++  L+ ASGVDA N+ +P+AF +                                H  I   +++       P A+H F +
Subjt:  LGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVD---------------------------GERHQTICKTIDK-----VFPAAFHCFYI

Query:  QHIKVNLIAKFKND--AKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFR---------HARK
         H        F +      +      + K    SY N I       P  R++LD   + RWA       RY  M  N      ALF          H   
Subjt:  QHIKVNLIAKFKND--AKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFR---------HARK

Query:  LPVTALLDHIRDNIDQ
          V  L D +R   D+
Subjt:  LPVTALLDHIRDNIDQ

AT1G64260.1 MuDR family transposase3.1e-2922.72Show/hide
Query:  NDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKF
        +D++ G  F ++ +L   +    +R+     V+++  E+Y   CV   C W LRA +++E  L +I KY   H C      H++             E+ 
Subjt:  NDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECTLFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKF

Query:  TDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELE-----GGKYFKYVFMTLGQSIRGFLGCI
          +  T    ++ +  +++ G  L   +      E ++ + GD   S+ ++     A    N G + + + +         F+ VF +  QSI GF  C 
Subjt:  TDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELE-----GGKYFKYVFMTLGQSIRGFLGCI

Query:  RPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGERHQT----ICKTIDKVF----------------------------PAAFHCFYIQHIK
        RP++VVD   L GK++  L+ ASGVDA N+ +P+AF +       +      K  +KV                             P A H F + H++
Subjt:  RPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGERHQT----ICKTIDKVF----------------------------PAAFHCFYIQHIK

Query:  VNLIAKFK--NDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRY--------------------------------T
           +  F+  N    VE+      K   +SY N I       P   ++LD I + +WA    + LRY                                 
Subjt:  VNLIAKFK--NDAKAVEELFLKAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRY--------------------------------T

Query:  QMTTNIAESVNALFRHARKLPV------TALLDHIRDNI---------DQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSL
        ++ ++  +S+++++    +  V        L + + D+I         D F +       + IV LN  TC+CR+F  +K  C + +       INP   
Subjt:  QMTTNIAESVNALFRHARKLPV------TALLDHIRDNI---------DQFHLQVRDGNLDGIVDLNAMTCSCREFDYFKILCSNVIVAVTMRNINPYSL

Query:  CDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIP
         DE YT+  +   YA    PV  V+ W   PE   +P
Subjt:  CDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTATCTCGGACCAGTCTTGGTCCGAGACATGTCCTGGACTAATCTCGAAGCATATTTAGTCCGGGACATGTCTTGGACCAGTCTTGCTTCGAGATTTGTCCGGGACAT
GTCTCGGACCAAGACTGGTCATGATGTAGAGGGTTTAACACCATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGATAGGGTATGTGATTGGGATGTGCCGGGAG
TATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACTCGTTGGCAGAGTCTGAAGAAGGACACTCTCAAGTAGAATATGGGAACGCAGAGCATGACGATGCGCTT
GATGATGAGCTTGAGTCTGATGTCGAACAAGTGCACACTGAGATTCGTAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTTACAGGAGACCCTAATGATGA
GAAATTGCAACTCATAGTACAGTCTTCTAGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGAAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGA
AGAATTTTCAGTTTAAAGTAAAGAAGTCGACGTCGGAGCTATATATACTGCGGTGCGTTCATGCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACT
TTGTTCAAGATAAAAAAATATTGTGCTGCCCATATGTGCTATGGTGGAGCTTTAAAACATGATCATAGGCAAGCCAAAAGTTGGGTGGTAGGACATCTTGTGCAAGAGAA
GTTCACAGACGTCTCCCGCACGTATAGACCAAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAG
CACTCCGGCTTATTAGAGGTGATCTAGCATCGTCATATGGTCTACTTGCAGCTTATGGTGAAGCTTTGAAAATCATGAACCCAGGTACTATTTTCGAATTAGAACTAGAA
GGTGGCAAGTATTTCAAATATGTATTTATGACACTGGGTCAATCGATTCGAGGTTTTCTGGGTTGTATTAGACCAGTGTTGGTTGTTGACAGGGCCCACCTAAAGGGGAA
GTTCAGAGGGGTATTGTTATCAGCTTCTGGTGTCGATGCGAATAACCAGATTTACCCGGTAGCATTTGTGATTGTCGACGGTGAGAGACATCAAACCATTTGCAAGACAA
TCGACAAGGTATTTCCTGCTGCATTTCATTGCTTTTATATACAGCATATCAAGGTTAACTTGATAGCAAAATTTAAAAACGATGCGAAGGCAGTCGAGGAACTATTTTTA
AAGGCTGCAAAGGCGTATCGCGAGTCATATTTCAACTCGATCTGGGCCCAACTTCGTGCATACCCCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGC
TCGTTGTTTCCAAACTCAATTGAGGTACACACAGATGACTACAAATATCGCAGAGTCTGTAAATGCCCTCTTCAGGCACGCCCGTAAGTTGCCGGTTACCGCCTTACTTG
ACCACATTAGAGACAACATTGACCAGTTCCATCTCCAGGTACGGGATGGCAACCTTGACGGGATTGTTGATTTGAACGCTATGACGTGTAGTTGTCGGGAGTTTGATTAC
TTTAAGATTCTGTGCTCTAATGTTATTGTGGCGGTGACGATGCGAAATATAAATCCATACAGTTTGTGCGACGAGACATATACGATGAACTCCTGGATATTGGCTTATGC
AGAACCCATATTTCCAGTCGGACACGTCTCGACATGGAACAGTTCGCCAGAGTTTGTCAACATACCGGTGGAACCACGAAGACTGTTCCAAGAGTTGGGAGGAGGAAGAC
AACTAGGATTCCTTCCACGTGCGAGGTACGACAAACACGTAAGTGTGGTCGATGTGGTGCGTGGGGACACAATCGCAAAACATGTAGCGAACCCCTTACCACATTGTGAA
TGTATGCTTCTGTTGTTCATTTACTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTATCTCGGACCAGTCTTGGTCCGAGACATGTCCTGGACTAATCTCGAAGCATATTTAGTCCGGGACATGTCTTGGACCAGTCTTGCTTCGAGATTTGTCCGGGACAT
GTCTCGGACCAAGACTGGTCATGATGTAGAGGGTTTAACACCATTGGGGTCAGATGTTGTTCCATGTAATCTAGGAGATGATAGGGTATGTGATTGGGATGTGCCGGGAG
TATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACTCGTTGGCAGAGTCTGAAGAAGGACACTCTCAAGTAGAATATGGGAACGCAGAGCATGACGATGCGCTT
GATGATGAGCTTGAGTCTGATGTCGAACAAGTGCACACTGAGATTCGTAGGGATGAAGAAGCGGTCCGGCCACCGGGATGTAATGGTCTTACAGGAGACCCTAATGATGA
GAAATTGCAACTCATAGTACAGTCTTCTAGGACAAATGATGTTAATGAGGGCGATGTATTTGATAATAAGAAGAAGTTGAGTTTGAAAATGCATTTAGTTGCAATGCGGA
AGAATTTTCAGTTTAAAGTAAAGAAGTCGACGTCGGAGCTATATATACTGCGGTGCGTTCATGCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACT
TTGTTCAAGATAAAAAAATATTGTGCTGCCCATATGTGCTATGGTGGAGCTTTAAAACATGATCATAGGCAAGCCAAAAGTTGGGTGGTAGGACATCTTGTGCAAGAGAA
GTTCACAGACGTCTCCCGCACGTATAGACCAAAGGACATTATACAAGACATGAGGAAGGAGTATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAG
CACTCCGGCTTATTAGAGGTGATCTAGCATCGTCATATGGTCTACTTGCAGCTTATGGTGAAGCTTTGAAAATCATGAACCCAGGTACTATTTTCGAATTAGAACTAGAA
GGTGGCAAGTATTTCAAATATGTATTTATGACACTGGGTCAATCGATTCGAGGTTTTCTGGGTTGTATTAGACCAGTGTTGGTTGTTGACAGGGCCCACCTAAAGGGGAA
GTTCAGAGGGGTATTGTTATCAGCTTCTGGTGTCGATGCGAATAACCAGATTTACCCGGTAGCATTTGTGATTGTCGACGGTGAGAGACATCAAACCATTTGCAAGACAA
TCGACAAGGTATTTCCTGCTGCATTTCATTGCTTTTATATACAGCATATCAAGGTTAACTTGATAGCAAAATTTAAAAACGATGCGAAGGCAGTCGAGGAACTATTTTTA
AAGGCTGCAAAGGCGTATCGCGAGTCATATTTCAACTCGATCTGGGCCCAACTTCGTGCATACCCCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGC
TCGTTGTTTCCAAACTCAATTGAGGTACACACAGATGACTACAAATATCGCAGAGTCTGTAAATGCCCTCTTCAGGCACGCCCGTAAGTTGCCGGTTACCGCCTTACTTG
ACCACATTAGAGACAACATTGACCAGTTCCATCTCCAGGTACGGGATGGCAACCTTGACGGGATTGTTGATTTGAACGCTATGACGTGTAGTTGTCGGGAGTTTGATTAC
TTTAAGATTCTGTGCTCTAATGTTATTGTGGCGGTGACGATGCGAAATATAAATCCATACAGTTTGTGCGACGAGACATATACGATGAACTCCTGGATATTGGCTTATGC
AGAACCCATATTTCCAGTCGGACACGTCTCGACATGGAACAGTTCGCCAGAGTTTGTCAACATACCGGTGGAACCACGAAGACTGTTCCAAGAGTTGGGAGGAGGAAGAC
AACTAGGATTCCTTCCACGTGCGAGGTACGACAAACACGTAAGTGTGGTCGATGTGGTGCGTGGGGACACAATCGCAAAACATGTAGCGAACCCCTTACCACATTGTGAA
TGTATGCTTCTGTTGTTCATTTACTTTTAA
Protein sequenceShow/hide protein sequence
MYLGPVLVRDMSWTNLEAYLVRDMSWTSLASRFVRDMSRTKTGHDVEGLTPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDSLAESEEGHSQVEYGNAEHDDAL
DDELESDVEQVHTEIRRDEEAVRPPGCNGLTGDPNDEKLQLIVQSSRTNDVNEGDVFDNKKKLSLKMHLVAMRKNFQFKVKKSTSELYILRCVHADCTWRLRATKLKECT
LFKIKKYCAAHMCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEALRLIRGDLASSYGLLAAYGEALKIMNPGTIFELELE
GGKYFKYVFMTLGQSIRGFLGCIRPVLVVDRAHLKGKFRGVLLSASGVDANNQIYPVAFVIVDGERHQTICKTIDKVFPAAFHCFYIQHIKVNLIAKFKNDAKAVEELFL
KAAKAYRESYFNSIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTQMTTNIAESVNALFRHARKLPVTALLDHIRDNIDQFHLQVRDGNLDGIVDLNAMTCSCREFDY
FKILCSNVIVAVTMRNINPYSLCDETYTMNSWILAYAEPIFPVGHVSTWNSSPEFVNIPVEPRRLFQELGGGRQLGFLPRARYDKHVSVVDVVRGDTIAKHVANPLPHCE
CMLLLFIYF