; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc07g00810 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc07g00810
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionMuDRA-like transposase
Genome locationchr7:534107..543669
RNA-Seq ExpressionMoc07g00810
SyntenyMoc07g00810
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR003653 - Ulp1 protease family, C-terminal catalytic domain
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR038765 - Papain-like cysteine peptidase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]6.5e-18865.29Show/hide
Query:  EGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKK
        EGDDE EY NE+ SDRLDVQHEHE+VTIHNTMAEYPV+ VHE ASNR+TGQSE DRLQAMVQS  T+DVKE DV+DSKKELV+KMHLLALRKNFQF+VKK
Subjt:  EGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKK

Query:  STLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSS
        ST +LYL+RC+DPTCTWRLR TKIRDCNLFKIKKYIAVHS CN A+MKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDKAWRSS
Subjt:  STLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIKGDPTSSYGLLPACGKA----------------------------------------------------------------------------
        EEALRLI+GDP SSY LLPA G+A                                                                            
Subjt:  EEALRLIKGDPTSSYGLLPACGKA----------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVRE
                                                                                   AAKAFRESYFNENWVQLCAHPGVRE
Subjt:  ---------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVRE

Query:  YLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
        YLEAIGKERWA CF TKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  YLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAI AA SRS N YTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNS

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]1.8e-14561.57Show/hide
Query:  MAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFK
        MAEYPV+ VHE A+NR+TGQSE DRLQAMVQS GT+DVKEGDV+DSKKELV+KMH  ALRKNFQFRVKKST ELYLLRC+DPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-------
        IKKYIAVHS CN A+MKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDKAWR SEEALRLI+GDP SSY LLPA G+A       
Subjt:  IKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESV
                                                    AAKAFRESYFNENWVQLCAHP VREYLEAIGKERWA CF TKLRYSQMTTNIAESV
Subjt:  --------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        NALFRHARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAEEMI+EASDNARRHIVMNIDQFNFE+
Subjt:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]3.5e-14955.43Show/hide
Query:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        ++KNFQF+VKKSTLELY+LRCV   CTWRLRATK+++C LFKIKKY A H+ C    +K DHRQAKSWVVGHLVQ KFTDVSRTYRPK+IIQD+R+EYGV
Subjt:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-----------------------------------------------------------------
        N+SYD+A RSSEEALRLI+GDP SSYGLLPA G+A                                                                 
Subjt:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYL
                                                                                 AAKA+RESYFN  W QL A+PGVREYL
Subjt:  -------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYL

Query:  EAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        + IGKERWA CF T+LRY+QMTTNIAESVN LFRHARKLPVTALLDHIRG LQ WFY+RRTLA+SR +TLSDYAE M AE S++ RRH+V NIDQF+F+V
Subjt:  EAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI
        +D NL+G VDL + TC CREFDYFK+PCSHAI AA  R+ N Y+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+T RI
Subjt:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI

Query:  PSTGEVRPPRKCSRCG
        PSTGEVR  RKC RCG
Subjt:  PSTGEVRPPRKCSRCG

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]2.2e-16751.71Show/hide
Query:  EEGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +       V +   + LTGQ   + LQ +VQS GTNDVKEG+V+D+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVK

Query:  KSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRS
        KST ELY+L CVD +CTWRLRATK+RDCNLFKIKKY ++H+ CN  ++KQDHRQAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDKAWRS
Subjt:  KSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRS

Query:  SEEALRLIKGDPTSSYGLLPACGKA---------------------------------------------------------------------------
        SEEALRLI+GDP SSYGLLP  G+A                                                                           
Subjt:  SEEALRLIKGDPTSSYGLLPACGKA---------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPG
                                                                                      AAKA+RESYFN  W QL A+PG
Subjt:  ------------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPG

Query:  VREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
        VREYL+ IGKERWA CF T+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  VREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAI  A  R+ N YTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTVRIPSTGEVRPPRKCSRCGLSKKRKIIRNRRQGNESIETS
        +TVRIPSTGEVR  RKC RCG S       N +  NE + T+
Subjt:  QTVRIPSTGEVRPPRKCSRCGLSKKRKIIRNRRQGNESIETS

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]3.8e-12753.29Show/hide
Query:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        +RKNFQF+VKKSTLELY+LRCV   CTWRLRATK+++C LFKI KY A H+ C    +K DHRQ KSWVVGHLVQ KFTDVSRTYRPKDIIQD+R EYGV
Subjt:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA----------AAKAFRESYFNENWVQL---------CAHP--------------GVREYLEAIGK
        N+SYD+AWRSSEEALRLI+GDP SSYGLLPA G+A            K     YF   ++ L         C  P              GV   L A G 
Subjt:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA----------AAKAFRESYFNENWVQL---------CAHP--------------GVREYLEAIGK

Query:  E-------------------RWA--------------------------------------HCFNTK---------------------------LRYSQM
        +                    W                                       HCF T+                            R S  
Subjt:  E-------------------RWA--------------------------------------HCFNTK---------------------------LRYSQM

Query:  TTNIAE------SVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
         +  A+      S+NALFRH RKLPVTALLDHIRG LQ WFY+RRTLA+SR +TLSDYAE M AE SD+ARRH+V NIDQF+F+VRDGNL+G VDL +  
Subjt:  TTNIAE------SVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT

Query:  CTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
        C+CREFDYFK+PCSHAI AA  R+ N Y+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+TVRIPSTGEVR  RKC RC
Subjt:  CTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like3.2e-18865.29Show/hide
Query:  EGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKK
        EGDDE EY NE+ SDRLDVQHEHE+VTIHNTMAEYPV+ VHE ASNR+TGQSE DRLQAMVQS  T+DVKE DV+DSKKELV+KMHLLALRKNFQF+VKK
Subjt:  EGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKK

Query:  STLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSS
        ST +LYL+RC+DPTCTWRLR TKIRDCNLFKIKKYIAVHS CN A+MKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDKAWRSS
Subjt:  STLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIKGDPTSSYGLLPACGKA----------------------------------------------------------------------------
        EEALRLI+GDP SSY LLPA G+A                                                                            
Subjt:  EEALRLIKGDPTSSYGLLPACGKA----------------------------------------------------------------------------

Query:  ---------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVRE
                                                                                   AAKAFRESYFNENWVQLCAHPGVRE
Subjt:  ---------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVRE

Query:  YLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF
        YLEAIGKERWA CF TKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIAEA DNARRHIVMNIDQFNF
Subjt:  YLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTCREFDYFKVPCSHAI AA SRS N YTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNS

A0A6J1CNJ2 uncharacterized protein LOC1110127338.7e-14661.57Show/hide
Query:  MAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFK
        MAEYPV+ VHE A+NR+TGQSE DRLQAMVQS GT+DVKEGDV+DSKKELV+KMH  ALRKNFQFRVKKST ELYLLRC+DPTCTWRLRATKIRDCNLFK
Subjt:  MAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFK

Query:  IKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-------
        IKKYIAVHS CN A+MKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDI+QDIREEYGVNMSYDKAWR SEEALRLI+GDP SSY LLPA G+A       
Subjt:  IKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESV
                                                    AAKAFRESYFNENWVQLCAHP VREYLEAIGKERWA CF TKLRYSQMTTNIAESV
Subjt:  --------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESV

Query:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        NALFRHARKLPVTALLDHIR VLQRWFYERRTLASSRQSTLSDYAEEMI+EASDNARRHIVMNIDQFNFE+
Subjt:  NALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

A0A6J1CVL4 uncharacterized protein LOC1110151811.7e-14955.43Show/hide
Query:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        ++KNFQF+VKKSTLELY+LRCV   CTWRLRATK+++C LFKIKKY A H+ C    +K DHRQAKSWVVGHLVQ KFTDVSRTYRPK+IIQD+R+EYGV
Subjt:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-----------------------------------------------------------------
        N+SYD+A RSSEEALRLI+GDP SSYGLLPA G+A                                                                 
Subjt:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA-----------------------------------------------------------------

Query:  -------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYL
                                                                                 AAKA+RESYFN  W QL A+PGVREYL
Subjt:  -------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPGVREYL

Query:  EAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV
        + IGKERWA CF T+LRY+QMTTNIAESVN LFRHARKLPVTALLDHIRG LQ WFY+RRTLA+SR +TLSDYAE M AE S++ RRH+V NIDQF+F+V
Subjt:  EAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI
        +D NL+G VDL + TC CREFDYFK+PCSHAI AA  R+ N Y+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+T RI
Subjt:  RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI

Query:  PSTGEVRPPRKCSRCG
        PSTGEVR  RKC RCG
Subjt:  PSTGEVRPPRKCSRCG

A0A6J1DJT1 uncharacterized protein LOC1110207151.1e-16751.71Show/hide
Query:  EEGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +       V +   + LTGQ   + LQ +VQS GTNDVKEG+V+D+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREY-NEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQFRVK

Query:  KSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRS
        KST ELY+L CVD +CTWRLRATK+RDCNLFKIKKY ++H+ CN  ++KQDHRQAKSWVVGHLVQ+KFTDVSRTYRPKDIIQD+R+EYGVN+SYDKAWRS
Subjt:  KSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRS

Query:  SEEALRLIKGDPTSSYGLLPACGKA---------------------------------------------------------------------------
        SEEALRLI+GDP SSYGLLP  G+A                                                                           
Subjt:  SEEALRLIKGDPTSSYGLLPACGKA---------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPG
                                                                                      AAKA+RESYFN  W QL A+PG
Subjt:  ------------------------------------------------------------------------------AAKAFRESYFNENWVQLCAHPG

Query:  VREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ
        VREYL+ IGKERWA CF T+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +AE SDNARRH+V+NIDQ
Subjt:  VREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQ

Query:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        F+ +VRDGNL+G VD  S+TC CREFDYFK+PCSHAI  A  R+ N YTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR
Subjt:  FNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

Query:  QTVRIPSTGEVRPPRKCSRCGLSKKRKIIRNRRQGNESIETS
        +TVRIPSTGEVR  RKC RCG S       N +  NE + T+
Subjt:  QTVRIPSTGEVRPPRKCSRCGLSKKRKIIRNRRQGNESIETS

A0A6J1DYC4 uncharacterized protein LOC1110256781.8e-12753.29Show/hide
Query:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV
        +RKNFQF+VKKSTLELY+LRCV   CTWRLRATK+++C LFKI KY A H+ C    +K DHRQ KSWVVGHLVQ KFTDVSRTYRPKDIIQD+R EYGV
Subjt:  LRKNFQFRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGV

Query:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA----------AAKAFRESYFNENWVQL---------CAHP--------------GVREYLEAIGK
        N+SYD+AWRSSEEALRLI+GDP SSYGLLPA G+A            K     YF   ++ L         C  P              GV   L A G 
Subjt:  NMSYDKAWRSSEEALRLIKGDPTSSYGLLPACGKA----------AAKAFRESYFNENWVQL---------CAHP--------------GVREYLEAIGK

Query:  E-------------------RWA--------------------------------------HCFNTK---------------------------LRYSQM
        +                    W                                       HCF T+                            R S  
Subjt:  E-------------------RWA--------------------------------------HCFNTK---------------------------LRYSQM

Query:  TTNIAE------SVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
         +  A+      S+NALFRH RKLPVTALLDHIRG LQ WFY+RRTLA+SR +TLSDYAE M AE SD+ARRH+V NIDQF+F+VRDGNL+G VDL +  
Subjt:  TTNIAE------SVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT

Query:  CTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
        C+CREFDYFK+PCSHAI AA  R+ N Y+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+TVRIPSTGEVR  RKC RC
Subjt:  CTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC

Query:  G
        G
Subjt:  G

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.2e-0621.74Show/hide
Query:  LLPACGKAAAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLAS-
        L+   G ++ K   +SY  E   +   +P   ++L+     +WA   +   RY  M  +  E++ A+ +  RK+ +   +  + G L+  F E   L+  
Subjt:  LLPACGKAAAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLAS-

Query:  --SRQSTLSDYAEEMIAEASDNARRHIV----MNIDQFNFEV-----------RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCD
                +++  E + E   ++   ++    +  D +   +            + + +G V L   TCTC EF   K PC HA+        N     D
Subjt:  --SRQSTLSDYAEEMIAEASDNARRHIV----MNIDQFNFEV-----------RDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCD

Query:  EAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ
        + YTV  +   Y+    PV   S W  + G       +   PP KV   G+ +
Subjt:  EAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ

AT1G64255.1 MuDR family transposase1.6e-0625.62Show/hide
Query:  HPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQS-TLSDYAEEMIAEAS
        +P  R++L+   + RWA   +   RY  M  N      ALF          H     V  L D +R    + F      + SR S    D   E + +  
Subjt:  HPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQS-TLSDYAEEMIAEAS

Query:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS
        +  R       +IV  +D   F+V      G+  V L   +CTC +F  +K PC HA+        N     D+ YT+      YA     V   S W  
Subjt:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS

Query:  SPG
        + G
Subjt:  SPG

AT1G64260.1 MuDR family transposase3.4e-0926.03Show/hide
Query:  LLPACGKAAAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRT----
        L+   G    K   +SY N+   +   +P   ++L+ I + +WA   ++ LRY      I     ALF   R  P   +   + G +   F E R+    
Subjt:  LLPACGKAAAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRT----

Query:  LASSRQSTLSD---YAE---EMIAEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVN
          SS  S+L+    Y E   + + E   ++  +++  +++ +F+V + +   +  V L   TCTCR+F  +K PC HA+        N     DE YTV 
Subjt:  LASSRQSTLSD---YAE---EMIAEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVN

Query:  SWMLAYAEPIFPVGSSSTW
         +   YA    PV   + W
Subjt:  SWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTAT
GCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATG
GAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAA
CCTTACATGCCTTCTTTCCCATATTATTTAGGCCAACACGTGTCCAATGTTCCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCC
TCATTTTCAAGTCCGTCAGTTCCGCCCTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCGCA
CCGTTAGGGTCAAATGTTGTTCCTTGTAATTTGGGTGATGATAGGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGT
GATGAATCATATCGTCTAATGACCGACACGGAAGAAGGAGACGATGAAAGGGAATATAATGAGCACGTCAGTGATCGACTTGATGTGCAACATGAGCATGAAGAG
GTAACAATTCATAATACAATGGCTGAATATCCTGTAAATGTCGTCCATGAAACGGCAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATG
GTCCAATCGGACGGGACCAATGATGTTAAGGAGGGTGATGTATACGACTCAAAGAAGGAACTAGTCATTAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAG
TTTCGAGTGAAGAAGTCTACGTTGGAACTATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTT
AAGATAAAGAAATATATCGCAGTCCATTCTAATTGCAATGATGCCCTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCA
AAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTCGAGC
GAAGAAGCACTCCGACTTATCAAAGGGGATCCAACTTCATCGTACGGGCTGCTACCCGCTTGTGGGAAAGCCGCTGCGAAGGCATTTCGCGAGTCATATTTCAAT
GAGAACTGGGTCCAACTGTGCGCACACCCAGGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCACTGCTTTAACACGAAACTAAGATACTCA
CAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTACCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGG
TGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATT
GTTATGAACATCGACCAGTTTAATTTTGAGGTACGCGATGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTT
AAAGTCCCGTGCTCCCATGCTATTACTGCAGCCTGTTCTCGTAGCACAAATCTGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTAT
GCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGG
CGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGGCTTTCGAAGAAGAGGAAAATAATACGTAATCGGCGCCAG
GGAAATGAATCGATTGAGACTAGCCAAATAGCCAAAAATGAGATCAAGAAATACAGTGAACATGAAAGGTGCACAATGGACCATCAGTTGAGGATTGAGGAGAAT
GACCGCTTTCCGGCTCAAGCTACCAGCATGTCTCACTTAAGCAATTTCAACAGGCTTATCAAAGATAAACTCACAGCGGACCAACTAGATATGTTCCGTAGAAGA
ACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGCTGCCCGATAAGGTGGTTCAGAAAAAGATT
GGAAAGAATAGGTTGCAGAGGAAGTACTTCAACAATGAAGCATCCATGCTGCTCAAAGAGTTTGTGGAGGTTTACAAGCAGACTGATTTCGAGGATGACGAGGAC
GCCATTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTG
GACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGATAGTTAACGGTCTGAAGCGTGCGATGAATGGTAAAGTTGCGCTATACAAGAACAAAGTA
AGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACCGGGATTTCCGCTTGCGTTTCAGTTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTC
AATCGTTTGAGCAAGACCGCCATTCCTCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGGTACAAAAGATCTGGGGGAGGAGGTCCTTCGTTCAGCGGTGTTG
GTCATATCTTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGCGGTGTCCATTGGACGAAAGAGAGGTGGTGGATTTAACTGCGCCTGGGTGTTCCACC
TCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCACCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAG
GCAATGTCGGACGAGTCTTCGGATATGCCACGTATAGAGGCCGCATCTGAAGGTAGGCAACGGACACCGGTCGAAGTACTTCGACGAAGTACTTCTATTCGGTCG
AATGTGGGGCAAAGCCCGCGGCAATCACCGCGAGCGACGTCACGCGCTGCTTGCCCCACACAACGGCACGACACCCGTCGATCGAGTGATAGATTCGGGGTTATG
GAGAGAAGGCTAGATCTTTTATCTTCGGACATTGCGGAGGTGAAGACGGATTTGGTGGAGGTGAAGACGGGTTTGGCAGAGGTGAAGACGGGTTTGGCGGAGGTC
AAGTCCGACTTGAGTGAAATGAAACTCCGGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACAGTCCAAGTGTCA
CATCCATTGGTATCTAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCTGAGGGTGGGAAGGAAGATGATGTGGTTCCTGTAGAA
GCGTTGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAACCCCGCATGTATTGTCGATTCGGTG
GAGTTGGACGTTACAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCGCCAATAGTAGAGGATCCACAATCCCATACGACGACATCC
GATCAACCCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCTGTGTGGCATGATCCATAGGCCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCT
GATACGAGCCCCACAACTCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAGCGTAAGCGGACGGAAACGAAACCATTCAGTCCG
GAGGACACGCGTCGGCAGAAGAAGAAGCAGAAGGTGGTGGATGTGGATCCCGTACCTGCCAGCCAGGTTAGGCCATTCCGTCCCAAATACAACCCGTTGCATAAC
TTTCCGGATGCCAAGTTTAGGGAAATGATGCGTTGGGTACGGGACCCTGAGAATGACAAAACAACGCGGCCGTCTACAACTTGGAATGTGGAGAGTGGATATTCC
AGAAGATTCTTCATTAACATCCTCAATCCTGTGGAGAAGGTGGAAGACCCGGAAGCTGCTGGCATTCTATATTTCATTATGAGGAAGCTCGGTAGTCGGCCGCAC
CTGTGCGTTCATAAGTTCTCTGTCCTGGACCCACTACAAATGCAAGTTCTTGCCGCTACAGGTGGTCCCTACGCACGAATCAAGGGGAAGGTCGTCCAGGACACG
ATCAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGCTGGGTCTTGTGGAAGATTTCATTCCAGCCTGGGTGGACGTTGACGTAGTGTACAGCCCGCTC
TGTATCAAGGATCACTGGGTCCTGGTTGCGATGGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGTCAGGCCACATTTCCACGTCGAAGTTGCTGACA
GACATGCGGCCGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCTGATTGCAAGCTGAAGAGGACTCCTTGGCGTGTATACCGT
CCTACGACCGACACGAGGCAGAAAGAAGCAAACATGTCAACATCTTACGAGTCGGACGATTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCTCTCC
TCCTTCGCGGAGAGCGGAGTGCCAATTAACGGTCCTTTTCAACAAACACGGGAAGAAAACGACCAGCTTAGGAGAGAGCTACGTCGAACGCAACACGAGCTCAAC
AACACGAGGTATAAGTTAGCCCGGGTTGAAGAAATGCGGGACTTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGGTGGAT
CAGTTACTGGCTCGTCTACGCCGATACCATGTACGCTCTCGTCAAGCTCATGCACGCTCTAGCTGTACGCTCTCGATCAAGCTCATGCACACTCTCGTTATCGAT
CACGATCAACCTCATTCACGCCCTCGTCAAGCTCATGCACGCTCTCGCTGTACGATCTCGATCAAGCCCATGCACGTTCTAGATGTACGCTCTCGTCAAGCTCAT
GCACGCTATCGCTGTACGCTCTCGATCAAGCTCATGCACGCTCTTGATCGCGATGACGACCAACCTCATTCACGCTCTCGCTGTACGCTCTCGATGAAGCTCATG
CACGTTCTCGATCTCGATCAGATCTCGCTCTCGCTCATGAACGCTCTCGCTCTCGCTAATGAACGATCTCGATCTATAACATACTCCAAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTCTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTAT
GCTGAATTTCTAGGACATGTATGTAGGCTAAGTAGTATAAATCCATTACAGGAAGATATCATAATTAGACGTGTATATAATTTTAAGGCGAAGGTTTGTGTAATG
GAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCCGCTATACATATCTACCGTGCCAAAGAAGGTACATCAGAATGAA
CCTTACATGCCTTCTTTCCCATATTATTTAGGCCAACACGTGTCCAATGTTCCTATTCCCTCAGCTTGTGCCCCCCCATTTGCAAAACCCTTATTTCCGAGACCC
TCATTTTCAAGTCCGTCAGTTCCGCCCTCGTCGTCGAACCCCTCTTCTTCCCGCCCACCACCCCCCTACTTTGGTCATATTGGTCATGATATAACATCTCTCGCA
CCGTTAGGGTCAAATGTTGTTCCTTGTAATTTGGGTGATGATAGGGCATATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGT
GATGAATCATATCGTCTAATGACCGACACGGAAGAAGGAGACGATGAAAGGGAATATAATGAGCACGTCAGTGATCGACTTGATGTGCAACATGAGCATGAAGAG
GTAACAATTCATAATACAATGGCTGAATATCCTGTAAATGTCGTCCATGAAACGGCAAGCAATAGACTCACCGGTCAGTCAGAAGCTGATAGATTGCAAGCCATG
GTCCAATCGGACGGGACCAATGATGTTAAGGAGGGTGATGTATACGACTCAAAGAAGGAACTAGTCATTAAAATGCATTTACTTGCATTGCGGAAGAACTTTCAG
TTTCGAGTGAAGAAGTCTACGTTGGAACTATACTTGCTGCGATGCGTCGATCCTACTTGCACGTGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTT
AAGATAAAGAAATATATCGCAGTCCATTCTAATTGCAATGATGCCCTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCA
AAGTTTACTGATGTTTCTCGCACGTACAGGCCGAAGGACATCATCCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTCGAGC
GAAGAAGCACTCCGACTTATCAAAGGGGATCCAACTTCATCGTACGGGCTGCTACCCGCTTGTGGGAAAGCCGCTGCGAAGGCATTTCGCGAGTCATATTTCAAT
GAGAACTGGGTCCAACTGTGCGCACACCCAGGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCACTGCTTTAACACGAAACTAAGATACTCA
CAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTACCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGG
TGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCGAAGCTTCGGATAATGCACGGAGACACATT
GTTATGAACATCGACCAGTTTAATTTTGAGGTACGCGATGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGAGTTCGATTATTTT
AAAGTCCCGTGCTCCCATGCTATTACTGCAGCCTGTTCTCGTAGCACAAATCTGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTAT
GCAGAACCAATATTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGG
CGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGGCTTTCGAAGAAGAGGAAAATAATACGTAATCGGCGCCAG
GGAAATGAATCGATTGAGACTAGCCAAATAGCCAAAAATGAGATCAAGAAATACAGTGAACATGAAAGGTGCACAATGGACCATCAGTTGAGGATTGAGGAGAAT
GACCGCTTTCCGGCTCAAGCTACCAGCATGTCTCACTTAAGCAATTTCAACAGGCTTATCAAAGATAAACTCACAGCGGACCAACTAGATATGTTCCGTAGAAGA
ACAATATTTGGTCGATTTGTCGACTTGGAGATGATGTTCTGCAGTGGTGTAGTTCATCACTTTCTGTCAAGGGAGCTGCCCGATAAGGTGGTTCAGAAAAAGATT
GGAAAGAATAGGTTGCAGAGGAAGTACTTCAACAATGAAGCATCCATGCTGCTCAAAGAGTTTGTGGAGGTTTACAAGCAGACTGATTTCGAGGATGACGAGGAC
GCCATTAAAGTGACATTAATTTTGTACACGGAGCTTGTGATGATGGGAAAGAGCAAGAGCAAGTCGAAGGTTGACATCGACTTGTACAACCAAGTCGATGACTTG
GACTACTTCAACCATTTGGACTGGGGTTCTGATGTCTGGAGTAGGATAGTTAACGGTCTGAAGCGTGCGATGAATGGTAAAGTTGCGCTATACAAGAACAAAGTA
AGAACGAACAAAAAGTATCTAGTAAAGTATAGCCTACCGGGATTTCCGCTTGCGTTTCAGTTGTGGATATACGAGGTTGTCCCATCTCTCATCACTCCCGGTGTC
AATCGTTTGAGCAAGACCGCCATTCCTCGGATAATTCGGTATTCGTGCAGTAGAGTCGTCGGTACAAAAGATCTGGGGGAGGAGGTCCTTCGTTCAGCGGTGTTG
GTCATATCTTATCCACTCGTGGAGACGGAGCTGGATAAGGACTACCAGCGGTGTCCATTGGACGAAAGAGAGGTGGTGGATTTAACTGCGCCTGGGTGTTCCACC
TCCGACAGTGATGATGGACACAATCCTTCCCCCATCACCGACAATCTTGGCACCGAAGACGATCTCCCACTCGACGATGCGCATTCGTTGGAAACGAATGTACAG
GCAATGTCGGACGAGTCTTCGGATATGCCACGTATAGAGGCCGCATCTGAAGGTAGGCAACGGACACCGGTCGAAGTACTTCGACGAAGTACTTCTATTCGGTCG
AATGTGGGGCAAAGCCCGCGGCAATCACCGCGAGCGACGTCACGCGCTGCTTGCCCCACACAACGGCACGACACCCGTCGATCGAGTGATAGATTCGGGGTTATG
GAGAGAAGGCTAGATCTTTTATCTTCGGACATTGCGGAGGTGAAGACGGATTTGGTGGAGGTGAAGACGGGTTTGGCAGAGGTGAAGACGGGTTTGGCGGAGGTC
AAGTCCGACTTGAGTGAAATGAAACTCCGGCTTCAACGGTTGTGCCAGATCGATAGGCGAGAGGTGAATATTGGTGTCTCTCCGTCGGATACAGTCCAAGTGTCA
CATCCATTGGTATCTAATGTTATCCCCGAGCATGATGGGGATGCTGATGACCACCAACCTGGAGGTTCTGAGGGTGGGAAGGAAGATGATGTGGTTCCTGTAGAA
GCGTTGTTGCATGAAAAGGCAACGGATGGAGTAGAGATGACCATACCCCCATCCGATCTTGGAGATGCAGAACTAGCCAACCCCGCATGTATTGTCGATTCGGTG
GAGTTGGACGTTACAGTGGTGACACCCATTGTTTCGACAGAGATGGTGGAACTCGAAATGGCACCGCCAATAGTAGAGGATCCACAATCCCATACGACGACATCC
GATCAACCCTTCGAGCCTCCTGCCTCAACCAACATTGATGGTCTGTGTGGCATGATCCATAGGCCTCGTCAAGCCGAGCATATTGAGTTGGCCCTTACACCAGCT
GATACGAGCCCCACAACTCAGTCGGCCAGGAAAATCGAGGTTTCGTATCCCGACGAAACAAGAAGGACCGAGCGTAAGCGGACGGAAACGAAACCATTCAGTCCG
GAGGACACGCGTCGGCAGAAGAAGAAGCAGAAGGTGGTGGATGTGGATCCCGTACCTGCCAGCCAGGTTAGGCCATTCCGTCCCAAATACAACCCGTTGCATAAC
TTTCCGGATGCCAAGTTTAGGGAAATGATGCGTTGGGTACGGGACCCTGAGAATGACAAAACAACGCGGCCGTCTACAACTTGGAATGTGGAGAGTGGATATTCC
AGAAGATTCTTCATTAACATCCTCAATCCTGTGGAGAAGGTGGAAGACCCGGAAGCTGCTGGCATTCTATATTTCATTATGAGGAAGCTCGGTAGTCGGCCGCAC
CTGTGCGTTCATAAGTTCTCTGTCCTGGACCCACTACAAATGCAAGTTCTTGCCGCTACAGGTGGTCCCTACGCACGAATCAAGGGGAAGGTCGTCCAGGACACG
ATCAATGCTTGGGACGAGTATAAGGAGTGCATGGATGCCGTGCTGGGTCTTGTGGAAGATTTCATTCCAGCCTGGGTGGACGTTGACGTAGTGTACAGCCCGCTC
TGTATCAAGGATCACTGGGTCCTGGTTGCGATGGATATGACCCAGTCCGAGATTTTTGTATACGACTCATTGTCAGGCCACATTTCCACGTCGAAGTTGCTGACA
GACATGCGGCCGTTGAGTCATACAATCCCATCGCTTTTGTACGCATGTGGGCTGATGGATACGGCTGATTGCAAGCTGAAGAGGACTCCTTGGCGTGTATACCGT
CCTACGACCGACACGAGGCAGAAAGAAGCAAACATGTCAACATCTTACGAGTCGGACGATTCCTCTTCCGACGGAGCAACCAGTCCTATCTCAAACCATCTCTCC
TCCTTCGCGGAGAGCGGAGTGCCAATTAACGGTCCTTTTCAACAAACACGGGAAGAAAACGACCAGCTTAGGAGAGAGCTACGTCGAACGCAACACGAGCTCAAC
AACACGAGGTATAAGTTAGCCCGGGTTGAAGAAATGCGGGACTTGCTGGAGGGACTGCTGAAGGAGGAGAAGGAGGAACGACTTCGTCTGGAGGACAGGGTGGAT
CAGTTACTGGCTCGTCTACGCCGATACCATGTACGCTCTCGTCAAGCTCATGCACGCTCTAGCTGTACGCTCTCGATCAAGCTCATGCACACTCTCGTTATCGAT
CACGATCAACCTCATTCACGCCCTCGTCAAGCTCATGCACGCTCTCGCTGTACGATCTCGATCAAGCCCATGCACGTTCTAGATGTACGCTCTCGTCAAGCTCAT
GCACGCTATCGCTGTACGCTCTCGATCAAGCTCATGCACGCTCTTGATCGCGATGACGACCAACCTCATTCACGCTCTCGCTGTACGCTCTCGATGAAGCTCATG
CACGTTCTCGATCTCGATCAGATCTCGCTCTCGCTCATGAACGCTCTCGCTCTCGCTAATGAACGATCTCGATCTATAACATACTCCAAGTAG
Protein sequenceShow/hide protein sequence
MSRVFISFGGEWKDIEKDYVGGRTRGLTVDSKITYAEFLGHVCRLSSINPLQEDIIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYISTVPKKVHQNE
PYMPSFPYYLGQHVSNVPIPSACAPPFAKPLFPRPSFSSPSVPPSSSNPSSSRPPPPYFGHIGHDITSLAPLGSNVVPCNLGDDRAYDWDVPGLWNGSENVDEDS
DESYRLMTDTEEGDDEREYNEHVSDRLDVQHEHEEVTIHNTMAEYPVNVVHETASNRLTGQSEADRLQAMVQSDGTNDVKEGDVYDSKKELVIKMHLLALRKNFQ
FRVKKSTLELYLLRCVDPTCTWRLRATKIRDCNLFKIKKYIAVHSNCNDALMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIIQDIREEYGVNMSYDKAWRSS
EEALRLIKGDPTSSYGLLPACGKAAAKAFRESYFNENWVQLCAHPGVREYLEAIGKERWAHCFNTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQR
WFYERRTLASSRQSTLSDYAEEMIAEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCREFDYFKVPCSHAITAACSRSTNLYTLCDEAYTVNSWMLAY
AEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGLSKKRKIIRNRRQGNESIETSQIAKNEIKKYSEHERCTMDHQLRIEEN
DRFPAQATSMSHLSNFNRLIKDKLTADQLDMFRRRTIFGRFVDLEMMFCSGVVHHFLSRELPDKVVQKKIGKNRLQRKYFNNEASMLLKEFVEVYKQTDFEDDED
AIKVTLILYTELVMMGKSKSKSKVDIDLYNQVDDLDYFNHLDWGSDVWSRIVNGLKRAMNGKVALYKNKVRTNKKYLVKYSLPGFPLAFQLWIYEVVPSLITPGV
NRLSKTAIPRIIRYSCSRVVGTKDLGEEVLRSAVLVISYPLVETELDKDYQRCPLDEREVVDLTAPGCSTSDSDDGHNPSPITDNLGTEDDLPLDDAHSLETNVQ
AMSDESSDMPRIEAASEGRQRTPVEVLRRSTSIRSNVGQSPRQSPRATSRAACPTQRHDTRRSSDRFGVMERRLDLLSSDIAEVKTDLVEVKTGLAEVKTGLAEV
KSDLSEMKLRLQRLCQIDRREVNIGVSPSDTVQVSHPLVSNVIPEHDGDADDHQPGGSEGGKEDDVVPVEALLHEKATDGVEMTIPPSDLGDAELANPACIVDSV
ELDVTVVTPIVSTEMVELEMAPPIVEDPQSHTTTSDQPFEPPASTNIDGLCGMIHRPRQAEHIELALTPADTSPTTQSARKIEVSYPDETRRTERKRTETKPFSP
EDTRRQKKKQKVVDVDPVPASQVRPFRPKYNPLHNFPDAKFREMMRWVRDPENDKTTRPSTTWNVESGYSRRFFINILNPVEKVEDPEAAGILYFIMRKLGSRPH
LCVHKFSVLDPLQMQVLAATGGPYARIKGKVVQDTINAWDEYKECMDAVLGLVEDFIPAWVDVDVVYSPLCIKDHWVLVAMDMTQSEIFVYDSLSGHISTSKLLT
DMRPLSHTIPSLLYACGLMDTADCKLKRTPWRVYRPTTDTRQKEANMSTSYESDDSSSDGATSPISNHLSSFAESGVPINGPFQQTREENDQLRRELRRTQHELN
NTRYKLARVEEMRDLLEGLLKEEKEERLRLEDRVDQLLARLRRYHVRSRQAHARSSCTLSIKLMHTLVIDHDQPHSRPRQAHARSRCTISIKPMHVLDVRSRQAH
ARYRCTLSIKLMHALDRDDDQPHSRSRCTLSMKLMHVLDLDQISLSLMNALALANERSRSITYSK