; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g32840 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g32840
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr8:23829048..23837452
RNA-Seq ExpressionMoc08g32840
SyntenyMoc08g32840
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]1.5e-15658.09Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD VHE  SNR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSS
        STP+LYL+RC+DPTCTWRLR TKIRDCNLFKIKKYIA+HS CNGA+MKQDHRQAKS VVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDK WRSS
Subjt:  STPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSS

Query:  EEALRLIRGDPASSYGLLSAYGEA----------------------------------------------------------------------------
        EEALRLIRGDPASSY LL AYGEA                                                                            
Subjt:  EEALRLIRGDPASSYGLLSAYGEA----------------------------------------------------------------------------

Query:  -----------------SVVDTVQNLVFISDRHAVICKAIDE----------------------------------------------------------
                         SVVDTVQNLVFISDRHA ICKAIDE                                                          
Subjt:  -----------------SVVDTVQNLVFISDRHAVICKAIDE----------------------------------------------------------

Query:  -------------------------------------------------------RWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
                                                               RWFYEHRTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  -------------------------------------------------------RWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS

XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]3.3e-11146.12Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        ++KNFQF+VKKST ELY+LRCV   CTWRLRATK+++C LFKIKKY A H+ C G  +K DHRQAKS VVGHLVQ KFTDVSRTYRPK+IIQD+R+EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------
        N+SYD+  RSSEEALRLIRGDPASSYGLL AYGEA                                                                 
Subjt:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------

Query:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------
                 ++VD                    V NLVF+SDRH  ICKAID                                                
Subjt:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------

Query:  ----------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
                                                            + WFY+ RTLA+SR +TLSDYAE M AE S++ RRH+V NIDQF+F+V
Subjt:  ----------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTI
        +D NL+G VDL + TC CREFDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  +TW SSP FVNI V+PPK V RVGRR+T  I
Subjt:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTI

Query:  PSTGEVRPPRKCSRCG
        PSTGEVR  RKC RCG
Subjt:  PSTGEVRPPRKCSRCG

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]6.9e-13345.02Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      AV +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRS
        KSTPELY+L CVD +CTWRLRATK+RDCNLFKIKKY +IH+ CNG ++KQDHRQAKS VVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDK WRS
Subjt:  KSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRS

Query:  SEEALRLIRGDPASSYGLLSAYGEA--------------------------------------------------------------------------S
        SEEALRLIRGDPASSYGLL  YGEA                                                                          +
Subjt:  SEEALRLIRGDPASSYGLLSAYGEA--------------------------------------------------------------------------S

Query:  VVD-------------------TVQNLVFISDRHAVICKAID----------------------------------------------------------
        +VD                    V NLVF+S+RH  ICKAID                                                          
Subjt:  VVD-------------------TVQNLVFISDRHAVICKAID----------------------------------------------------------

Query:  ---------------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
                                                                 + WFY+ RTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  ---------------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  +TW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTVTIPSTGEVRPPRKCSRCGLSKKGKIIRNQRQGNESIETS
        +TV IPSTGEVR  RKC RCG S       N +  NE + T+
Subjt:  QTVTIPSTGEVRPPRKCSRCGLSKKGKIIRNQRQGNESIETS

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.6e-11041.92Show/hide
Query:  RVFISFGEEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINRLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKSV
        RVFI+FG EW D EKDYVGGR RGLTVDS+   +  +   C + S  R  + I                                   P + S+    S 
Subjt:  RVFISFGEEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINRLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKSV

Query:  SSVLVVEPLFFPPTTPYFGHIGHDIASLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEE
        S     +P        Y+GH+GHDIA L PL S+VVPCNLGDDR   W++PGLWN ++   ++SDESY  +  +EEGD E E+ N+   D  D + E + 
Subjt:  SSVLVVEPLFFPPTTPYFGHIGHDIASLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEE

Query:  VTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIR
          +   +    V  V +   + L GQ   ++LQ +VQS+GTNDVKEG VFD+KKEL ++ HL+A+  NFQF+VKKSTPELY+LRCVD +CTWRLRA K+ 
Subjt:  VTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIR

Query:  DCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSSEEALRLIRGDPASSYGLLSAYG---
        DCNLFKIKKY +IH+ CNG ++KQDHRQAK+ VV HLVQ+KFTDVS TYRPKDIIQD+R+EYGVN+SYDK W+S+EEALRLIRGDP +SYGLL AYG   
Subjt:  DCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSSEEALRLIRGDPASSYGLLSAYG---

Query:  ----------------------------------------EASVVDTVQNLVFISDRHAVICKAIDERW--------FYEHRTLASSRQSTLSDYAEEMI
                                                  S +D V NLVF+SDRH  ICKAID+ +            +T   ++    +   EE+ 
Subjt:  ----------------------------------------EASVVDTVQNLVFISDRHAVICKAIDERW--------FYEHRTLASSRQSTLSDYAEEMI

Query:  AEASDNARRHIVMNI-DQFNFEVRDGNLNGDVDLQSQT-CTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSP
         +A+   +     +I  Q            D+  +  T C   E  Y ++  ++A       S+N       +  V + +    EPIFP+   +TWKSSP
Subjt:  AEASDNARRHIVMNI-DQFNFEVRDGNLNGDVDLQSQT-CTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSP

Query:  GFVNIDVQPPKKVVRVGRRQTVTIP
         FV+I  + P  V RVG+RQ+V IP
Subjt:  GFVNIDVQPPKKVVRVGRRQTVTIP

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]8.4e-11548.5Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        +RKNFQF+VKKST ELY+LRCV   CTWRLRATK+++C LFKI KY A H+ C G  +K DHRQ KS VVGHLVQ KFTDVSRTYRPKDIIQD+R EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------
        N+SYD+ WRSSEEALRLIRGDPASSYGLL AYGEA                                                                 
Subjt:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------

Query:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------
                 +++D                    V NLVF+SDRH  ICKAID                                                
Subjt:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------

Query:  -----------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCT
                                           + WFY+ RTLA+SR +TLSDYAE M AE SD+ARRH+V NIDQF+F+VRDGNL+G VDL +  C+
Subjt:  -----------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCT

Query:  CREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTIPSTGEVRPPRKCSRCG
        CREFDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  +TW SSP FVNI V+PPK V RVGRR+TV IPSTGEVR  RKC RCG
Subjt:  CREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTIPSTGEVRPPRKCSRCG

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like7.4e-15758.09Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD VHE  SNR+TGQSE DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSS
        STP+LYL+RC+DPTCTWRLR TKIRDCNLFKIKKYIA+HS CNGA+MKQDHRQAKS VVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDK WRSS
Subjt:  STPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSS

Query:  EEALRLIRGDPASSYGLLSAYGEA----------------------------------------------------------------------------
        EEALRLIRGDPASSY LL AYGEA                                                                            
Subjt:  EEALRLIRGDPASSYGLLSAYGEA----------------------------------------------------------------------------

Query:  -----------------SVVDTVQNLVFISDRHAVICKAIDE----------------------------------------------------------
                         SVVDTVQNLVFISDRHA ICKAIDE                                                          
Subjt:  -----------------SVVDTVQNLVFISDRHAVICKAIDE----------------------------------------------------------

Query:  -------------------------------------------------------RWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
                                                               RWFYEHRTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  -------------------------------------------------------RWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS

A0A6J1CVL4 uncharacterized protein LOC1110151811.6e-11146.12Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        ++KNFQF+VKKST ELY+LRCV   CTWRLRATK+++C LFKIKKY A H+ C G  +K DHRQAKS VVGHLVQ KFTDVSRTYRPK+IIQD+R+EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------
        N+SYD+  RSSEEALRLIRGDPASSYGLL AYGEA                                                                 
Subjt:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------

Query:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------
                 ++VD                    V NLVF+SDRH  ICKAID                                                
Subjt:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------

Query:  ----------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
                                                            + WFY+ RTLA+SR +TLSDYAE M AE S++ RRH+V NIDQF+F+V
Subjt:  ----------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTI
        +D NL+G VDL + TC CREFDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  +TW SSP FVNI V+PPK V RVGRR+T  I
Subjt:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTI

Query:  PSTGEVRPPRKCSRCG
        PSTGEVR  RKC RCG
Subjt:  PSTGEVRPPRKCSRCG

A0A6J1DJT1 uncharacterized protein LOC1110207153.3e-13345.02Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      AV +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRS
        KSTPELY+L CVD +CTWRLRATK+RDCNLFKIKKY +IH+ CNG ++KQDHRQAKS VVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDK WRS
Subjt:  KSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRS

Query:  SEEALRLIRGDPASSYGLLSAYGEA--------------------------------------------------------------------------S
        SEEALRLIRGDPASSYGLL  YGEA                                                                          +
Subjt:  SEEALRLIRGDPASSYGLLSAYGEA--------------------------------------------------------------------------S

Query:  VVD-------------------TVQNLVFISDRHAVICKAID----------------------------------------------------------
        +VD                    V NLVF+S+RH  ICKAID                                                          
Subjt:  VVD-------------------TVQNLVFISDRHAVICKAID----------------------------------------------------------

Query:  ---------------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
                                                                 + WFY+ RTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  ---------------------------------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  +TW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTVTIPSTGEVRPPRKCSRCGLSKKGKIIRNQRQGNESIETS
        +TV IPSTGEVR  RKC RCG S       N +  NE + T+
Subjt:  QTVTIPSTGEVRPPRKCSRCGLSKKGKIIRNQRQGNESIETS

A0A6J1DP00 uncharacterized protein LOC1110229548.0e-11141.92Show/hide
Query:  RVFISFGEEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINRLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKSV
        RVFI+FG EW D EKDYVGGR RGLTVDS+   +  +   C + S  R  + I                                   P + S+    S 
Subjt:  RVFISFGEEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINRLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKSV

Query:  SSVLVVEPLFFPPTTPYFGHIGHDIASLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEE
        S     +P        Y+GH+GHDIA L PL S+VVPCNLGDDR   W++PGLWN ++   ++SDESY  +  +EEGD E E+ N+   D  D + E + 
Subjt:  SSVLVVEPLFFPPTTPYFGHIGHDIASLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEE

Query:  VTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIR
          +   +    V  V +   + L GQ   ++LQ +VQS+GTNDVKEG VFD+KKEL ++ HL+A+  NFQF+VKKSTPELY+LRCVD +CTWRLRA K+ 
Subjt:  VTIHNTMAEYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIR

Query:  DCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSSEEALRLIRGDPASSYGLLSAYG---
        DCNLFKIKKY +IH+ CNG ++KQDHRQAK+ VV HLVQ+KFTDVS TYRPKDIIQD+R+EYGVN+SYDK W+S+EEALRLIRGDP +SYGLL AYG   
Subjt:  DCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSSEEALRLIRGDPASSYGLLSAYG---

Query:  ----------------------------------------EASVVDTVQNLVFISDRHAVICKAIDERW--------FYEHRTLASSRQSTLSDYAEEMI
                                                  S +D V NLVF+SDRH  ICKAID+ +            +T   ++    +   EE+ 
Subjt:  ----------------------------------------EASVVDTVQNLVFISDRHAVICKAIDERW--------FYEHRTLASSRQSTLSDYAEEMI

Query:  AEASDNARRHIVMNI-DQFNFEVRDGNLNGDVDLQSQT-CTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSP
         +A+   +     +I  Q            D+  +  T C   E  Y ++  ++A       S+N       +  V + +    EPIFP+   +TWKSSP
Subjt:  AEASDNARRHIVMNI-DQFNFEVRDGNLNGDVDLQSQT-CTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSP

Query:  GFVNIDVQPPKKVVRVGRRQTVTIP
         FV+I  + P  V RVG+RQ+V IP
Subjt:  GFVNIDVQPPKKVVRVGRRQTVTIP

A0A6J1DYC4 uncharacterized protein LOC1110256784.1e-11548.5Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        +RKNFQF+VKKST ELY+LRCV   CTWRLRATK+++C LFKI KY A H+ C G  +K DHRQ KS VVGHLVQ KFTDVSRTYRPKDIIQD+R EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAIHSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------
        N+SYD+ WRSSEEALRLIRGDPASSYGLL AYGEA                                                                 
Subjt:  NMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEA-----------------------------------------------------------------

Query:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------
                 +++D                    V NLVF+SDRH  ICKAID                                                
Subjt:  ---------SVVD-------------------TVQNLVFISDRHAVICKAID------------------------------------------------

Query:  -----------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCT
                                           + WFY+ RTLA+SR +TLSDYAE M AE SD+ARRH+V NIDQF+F+VRDGNL+G VDL +  C+
Subjt:  -----------------------------------ERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCT

Query:  CREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTIPSTGEVRPPRKCSRCG
        CREFDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  +TW SSP FVNI V+PPK V RVGRR+TV IPSTGEVR  RKC RCG
Subjt:  CREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTIPSTGEVRPPRKCSRCG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase4.3e-0833.68Show/hide
Query:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVN----IDVQPPKKVVRVGRRQ
        +G V L   TCTC EF   K PC HA+A      INP    D+ YTV  +   Y+    PV   + W  + G       +   PP KV   G+ +
Subjt:  NGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPGFVN----IDVQPPKKVVRVGRRQ

AT1G64255.1 MuDR family transposase5.3e-0630.43Show/hide
Query:  HIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPG
        +IV  +D   F+V      G+  V L   +CTC +F  +K PC HA+A       NP    D+ YT+      YA     V   + W  + G
Subjt:  HIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTWKSSPG

AT1G64260.1 MuDR family transposase3.9e-0933.33Show/hide
Query:  HIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTW
        +++  +++ +F+V + +   +  V L   TCTCR+F  +K PC HA+A      INP    DE YTV  +   YA    PV     W
Subjt:  HIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSTTW

AT5G45570.1 Ulp1 protease family protein2.0e-0533.02Show/hide
Query:  TMDHQLRIKENDRFPAQATSMSHLSNVSRL--IKDKLTADQLDMFRRRT--IFGRFVDLEMMFCSGVVHHFLSREVAGSSDDSMSFLIGGNVLTFSKDQF
        T  + LR+ E  + P Q  SM+H   VS++  IKD L AD  D  ++ T  +F +F +   ++ +  VH FL+ ++   +   M  LI    + FS  +F
Subjt:  TMDHQLRIKENDRFPAQATSMSHLSNVSRL--IKDKLTADQLDMFRRRT--IFGRFVDLEMMFCSGVVHHFLSREVAGSSDDSMSFLIGGNVLTFSKDQF

Query:  MLITGL
          ITGL
Subjt:  MLITGL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCCTCGTGTTTTTATATCATTCGGTGAAGAATGGAAAGATATTGAAAAGGATTATGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTAT
GCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCGATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATG
GAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGTCCGTCAGTTCCGTT
CTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAGCATCTCTCGCACCGTTAGGGTCAAATGTTGTTCCTTGT
AATTTGGGAGATGATAGGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCTAATGACCGAC
ACAGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACATGAGCATGAAGAGGTAACAATTCATAATACAATGGCT
GAATATCCTGTAGATGCCGTCCATGAAACGACAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACTAATGAT
GTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCG
GAACTATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAATC
CATTCTAATTGCAATGGTGCCCTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGCGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACG
TACAGGCCGAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGTCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGA
GGGGATCCAGCTTCATCGTACGGGCTACTATCCGCTTATGGGGAAGCCAGTGTTGTCGACACCGTTCAGAATTTGGTCTTCATTTCTGATCGACATGCCGTCATC
TGTAAAGCGATTGATGAGAGGTGGTTCTACGAACATCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCG
GATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGGTACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACT
TGTCGGGAGTTCGATTATTTTAAAGTCCCGTGCTCCCATGCTATTGCTGCAGCCAGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTC
AACAGCTGGATGTTGGCGTATGCAGAACCAATATTTCCAGTGGGTTCATCCACAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAG
AAGGTCGTTAGGGTTGGACGGCGACAGACGGTGACGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGGCTTTCGAAGAAGGGGAAA
ATAATACGTAATCAGCGCCAGGGAAATGAATCGATTGAGACTAGCCAAAGAGCCAAAAATGAGATCGAGAAATACAGTGAACATGAAAGGTGCACAATGGACCAT
CAGTTGAGGATTAAGGAGAATGACCGTTTTCCGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAGCAGGCTTATCAAGGATAAACTCACAGCGGACCAA
CTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTTGCTGGG
AGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCGAAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCCGGTAAGGTG
GTTCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAGCAATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGAGCAAGAGCAAGTCGAAGGTT
GACATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGACAGTCAACGGTCTGAAGCGTGCGATG
AATGGAAACGTTGCGCTATACAAGAACAAAGTAATCAATACGTATGCATACTCGTTGTCCTTGTTTGTGAAAACTTTTTTCAGGTGTGAAACTAAACTGAAATTT
TATATTCGGGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGT
AGAGTCGTCGGTACAAAAGATCTGGGGGAGGAGGTCATTGGTTCAGCGGTGTTGGTCATATCGTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGCGG
TGTCCATTGGACGAAAGAGAGGTGGTTGATTTAACTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTTGCCCATCACCGACAATCTTGGC
GCCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAGGCAATGCCGGACGAGTCTTCGGATATGCCACGTACAAAGGCCGCATCTGAA
GGTGGGCAACGGACATCGGTCGAAGTACTTCGACGAAGTACTTCTATTCGGTCGAATGTGGGCCAAAGCCCGCGGCAATCACCGCGAGCGACGTCACGCGCTGCT
TGCCCTACACAACGGCACGACACCCGTCGATCGAATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACGGAT
TTGGCGGAGGTCAAGTCCGACTTGAGTGAAATGAAACTCCTGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACA
GTCCAAGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCCGAGGGTGGAAAGGAGGACGATGTG
GTTCCTGTAGAAGCATCGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCAATCTTGGAGATGCAGAACTAGCCAACCCCGCATGTATT
GTCAATTCGGTGGAGTTGGACGTTGCAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCGCCAATAGTACAGGATCCACAATCCCAT
ACGACGACGTCCGATCCAACCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCCGTGTGGCATGATCCATGGGCCTCGTCAAGCCGAGCATATTGAGTTGGCC
CTTACACCAGCGGATACGAGTCCCACAACTCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAGCGTAAGCGGACGGAAACGAAA
CCATTCAGTCTGGAGGACACGCGTCGGCAAAAGAAGAAGCAGAAGGTGGTGGATGTGGATCCCGTACCTGCCAGCCAGGTTAGGTCATTCCGTCCCAAATACAAC
CCGTTGCATAACTTTCCGGATGCCAAGTTTAGGGAGATGATGTGTTGGATACGGGACCCTGAGAATGACAAAACAACGCGGCCGTCTACAACTTGGAATGTGGAG
AGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCATGTGGAGAAGGTGGAAGACCCGGAAGCTGCTGGCATTCTATATTTCATTATGAGGAAGCTCGAT
AGTCGGCCGCACCTGTGCGTTCATAAGTTCTTTGTCCTGGACCCACTACAAATGCAAGTTCTTGCCGCTGCAGGTGGTCCCTACGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGATCAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGTTGGGTCTTGTGGAAGATTTCATTCCAGCCTGGGTGGACGTCGACGTAGTG
TACAGCCCGCTCTGTATCAAGGATCACTGGGTCCTGGCTGCGATGGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTCG
AAGTTGCTGACAGACATGCGGCCGTTGAGTCATACAATCTCATCGCTTTTGTATGCATGTGGGCTGATGGATACGGTCGATTGCAAGCTGAAGAGGACTCCTTGG
CGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGAAGCAAACATGTCAACATCTTACGAGTCGGACGGTTCCTCTTCCGATGGAGCAACCAGTCCTATCTCA
AACCATCTCTCCTCCTTCGCGGAGAACGGAGTGCCAATTAACGGTCCTCTTCAACAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCGAACGCAA
CACGAGCTCAACAACACGAGGTATAAGTTAGCCCGGGTTGAAGAAATGCGGGACTTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAG
GACAGGGTGGATCAGTTACTGGCCCGTCTACGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCCTCGTGTTTTTATATCATTCGGTGAAGAATGGAAAGATATTGAAAAGGATTATGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTAT
GCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCGATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATG
GAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGTCCGTCAGTTCCGTT
CTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAGCATCTCTCGCACCGTTAGGGTCAAATGTTGTTCCTTGT
AATTTGGGAGATGATAGGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCTAATGACCGAC
ACAGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACATGAGCATGAAGAGGTAACAATTCATAATACAATGGCT
GAATATCCTGTAGATGCCGTCCATGAAACGACAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGACTAATGAT
GTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACGCCG
GAACTATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAATC
CATTCTAATTGCAATGGTGCCCTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGCGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACG
TACAGGCCGAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGTCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGA
GGGGATCCAGCTTCATCGTACGGGCTACTATCCGCTTATGGGGAAGCCAGTGTTGTCGACACCGTTCAGAATTTGGTCTTCATTTCTGATCGACATGCCGTCATC
TGTAAAGCGATTGATGAGAGGTGGTTCTACGAACATCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCG
GATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGGTACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACT
TGTCGGGAGTTCGATTATTTTAAAGTCCCGTGCTCCCATGCTATTGCTGCAGCCAGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTC
AACAGCTGGATGTTGGCGTATGCAGAACCAATATTTCCAGTGGGTTCATCCACAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAG
AAGGTCGTTAGGGTTGGACGGCGACAGACGGTGACGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGGCTTTCGAAGAAGGGGAAA
ATAATACGTAATCAGCGCCAGGGAAATGAATCGATTGAGACTAGCCAAAGAGCCAAAAATGAGATCGAGAAATACAGTGAACATGAAAGGTGCACAATGGACCAT
CAGTTGAGGATTAAGGAGAATGACCGTTTTCCGGCTCAAGCTACCAGCATGTCTCACTTGAGCAATGTCAGCAGGCTTATCAAGGATAAACTCACAGCGGACCAA
CTTGATATGTTCCGTAGAAGAACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGGTTGCTGGG
AGCAGTGACGACAGCATGAGTTTCTTAATTGGTGGCAACGTGTTGACATTCTCGAAGGATCAATTCATGCTTATAACGGGATTGTGGCGGCTGCCCGGTAAGGTG
GTTCAGAAAAAGATTGGAAAGAATAGGTTGCGGAGGAAGTACTTCAGCAATGAAGCCTCCATGCTGCTCGAAGAGTTTGTGGAGAGCAAGAGCAAGTCGAAGGTT
GACATCGACTTGTACAACCAAGTCGATGACTTGGACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGACAGTCAACGGTCTGAAGCGTGCGATG
AATGGAAACGTTGCGCTATACAAGAACAAAGTAATCAATACGTATGCATACTCGTTGTCCTTGTTTGTGAAAACTTTTTTCAGGTGTGAAACTAAACTGAAATTT
TATATTCGGGTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTCAATCGTTTGAGCGAGACCGCCATTCCCCGGATAATTCGGTATTCGTGCAGT
AGAGTCGTCGGTACAAAAGATCTGGGGGAGGAGGTCATTGGTTCAGCGGTGTTGGTCATATCGTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGCGG
TGTCCATTGGACGAAAGAGAGGTGGTTGATTTAACTGCGCCTGGGTGTTCCACCTCCGACAGTGATGATGGACACAATCCTTTGCCCATCACCGACAATCTTGGC
GCCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAGGCAATGCCGGACGAGTCTTCGGATATGCCACGTACAAAGGCCGCATCTGAA
GGTGGGCAACGGACATCGGTCGAAGTACTTCGACGAAGTACTTCTATTCGGTCGAATGTGGGCCAAAGCCCGCGGCAATCACCGCGAGCGACGTCACGCGCTGCT
TGCCCTACACAACGGCACGACACCCGTCGATCGAATGATAGATTCGGGGCTATGGAGAGAAGGCTGGATCTTTTAGCTTCGGACATGGCGGAGGTGAAGACGGAT
TTGGCGGAGGTCAAGTCCGACTTGAGTGAAATGAAACTCCTGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACA
GTCCAAGTGTCACATCCATTGGTATCCAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCCGAGGGTGGAAAGGAGGACGATGTG
GTTCCTGTAGAAGCATCGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCAATCTTGGAGATGCAGAACTAGCCAACCCCGCATGTATT
GTCAATTCGGTGGAGTTGGACGTTGCAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCGCCAATAGTACAGGATCCACAATCCCAT
ACGACGACGTCCGATCCAACCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCCGTGTGGCATGATCCATGGGCCTCGTCAAGCCGAGCATATTGAGTTGGCC
CTTACACCAGCGGATACGAGTCCCACAACTCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAGCGTAAGCGGACGGAAACGAAA
CCATTCAGTCTGGAGGACACGCGTCGGCAAAAGAAGAAGCAGAAGGTGGTGGATGTGGATCCCGTACCTGCCAGCCAGGTTAGGTCATTCCGTCCCAAATACAAC
CCGTTGCATAACTTTCCGGATGCCAAGTTTAGGGAGATGATGTGTTGGATACGGGACCCTGAGAATGACAAAACAACGCGGCCGTCTACAACTTGGAATGTGGAG
AGCGGATATTCCAGAAGATTCTTCATTAACATCCTCAATCATGTGGAGAAGGTGGAAGACCCGGAAGCTGCTGGCATTCTATATTTCATTATGAGGAAGCTCGAT
AGTCGGCCGCACCTGTGCGTTCATAAGTTCTTTGTCCTGGACCCACTACAAATGCAAGTTCTTGCCGCTGCAGGTGGTCCCTACGCACGAATCAAGGGGAAGGTC
GTCCAGGACACGATCAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGTTGGGTCTTGTGGAAGATTTCATTCCAGCCTGGGTGGACGTCGACGTAGTG
TACAGCCCGCTCTGTATCAAGGATCACTGGGTCCTGGCTGCGATGGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGCCAGGCCACATTTCCACGTCG
AAGTTGCTGACAGACATGCGGCCGTTGAGTCATACAATCTCATCGCTTTTGTATGCATGTGGGCTGATGGATACGGTCGATTGCAAGCTGAAGAGGACTCCTTGG
CGTGTATACCGTCCTACGACCGACACGAGGCAGAAAGAAGCAAACATGTCAACATCTTACGAGTCGGACGGTTCCTCTTCCGATGGAGCAACCAGTCCTATCTCA
AACCATCTCTCCTCCTTCGCGGAGAACGGAGTGCCAATTAACGGTCCTCTTCAACAAGCACGGGAAGAAAACGACCAGCTCAGGAGAGAGCTACGTCGAACGCAA
CACGAGCTCAACAACACGAGGTATAAGTTAGCCCGGGTTGAAGAAATGCGGGACTTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAG
GACAGGGTGGATCAGTTACTGGCCCGTCTACGCTGA
Protein sequenceShow/hide protein sequence
MPRVFISFGEEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINRLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKSVSSV
LVVEPLFFPPTTPYFGHIGHDIASLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMA
EYPVDAVHETTSNRLTGQSEADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKKSTPELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAI
HSNCNGALMKQDHRQAKSCVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKVWRSSEEALRLIRGDPASSYGLLSAYGEASVVDTVQNLVFISDRHAVI
CKAIDERWFYEHRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAIAAASSRSINPYTLCDEAYTV
NSWMLAYAEPIFPVGSSTTWKSSPGFVNIDVQPPKKVVRVGRRQTVTIPSTGEVRPPRKCSRCGLSKKGKIIRNQRQGNESIETSQRAKNEIEKYSEHERCTMDH
QLRIKENDRFPAQATSMSHLSNVSRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSREVAGSSDDSMSFLIGGNVLTFSKDQFMLITGLWRLPGKV
VQKKIGKNRLRRKYFSNEASMLLEEFVESKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRTVNGLKRAMNGNVALYKNKVINTYAYSLSLFVKTFFRCETKLKF
YIRVWIYEVVPSLITPGVNRLSETAIPRIIRYSCSRVVGTKDLGEEVIGSAVLVISYPLVETELDKDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPLPITDNLG
AEDDLPLDDAHSLETNVQAMPDESSDMPRTKAASEGGQRTSVEVLRRSTSIRSNVGQSPRQSPRATSRAACPTQRHDTRRSNDRFGAMERRLDLLASDMAEVKTD
LAEVKSDLSEMKLLLQRLCQIDRREVNIGVSPSDTVQVSHPLVSNVIPEHDGDADDHQPGGSEGGKEDDVVPVEASLHEKATDGVEMTIPPSNLGDAELANPACI
VNSVELDVAVVTPIVSTEMVELEMAPPIVQDPQSHTTTSDPTFEPPASTNIDGPCGMIHGPRQAEHIELALTPADTSPTTQSARKIEVSYPDETRRTERKRTETK
PFSLEDTRRQKKKQKVVDVDPVPASQVRSFRPKYNPLHNFPDAKFREMMCWIRDPENDKTTRPSTTWNVESGYSRRFFINILNHVEKVEDPEAAGILYFIMRKLD
SRPHLCVHKFFVLDPLQMQVLAAAGGPYARIKGKVVQDTINAWDEYKECMDAVLGLVEDFIPAWVDVDVVYSPLCIKDHWVLAAMDMTQSEIFVYDSLPGHISTS
KLLTDMRPLSHTISSLLYACGLMDTVDCKLKRTPWRVYRPTTDTRQKEANMSTSYESDGSSSDGATSPISNHLSSFAENGVPINGPLQQAREENDQLRRELRRTQ
HELNNTRYKLARVEEMRDLLEGLLKEEKEERLRLEDRVDQLLARLR