; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc03g23700 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc03g23700
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr3:16769454..16774202
RNA-Seq ExpressionMoc03g23700
SyntenyMoc03g23700
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR004332 - Transposase, MuDR, plant
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]2.4e-19467.38Show/hide
Query:  MGEGDDEGEYGNEYASDRLDV----------------------HMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRV
        MGEGDDE EYGNEYASDRLDV                       M+  RVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKELVMKMH  ALRKNFQF+V
Subjt:  MGEGDDEGEYGNEYASDRLDV----------------------HMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRV

Query:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
        KKSTP+LYL+RCIDPTCTWRLR TKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
Subjt:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR

Query:  LSEEALRLIRGDPASSYVLLPAYGEA--------------------------------------------------------------------------
         SEEALRLIRGDPASSY LLPAYGEA                                                                          
Subjt:  LSEEALRLIRGDPASSYVLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRV
                                                                   MNLLAKFKT  LE LFFKAAKAFRESYFNENWVQLCAHP V
Subjt:  -----------------------------------------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQF
        REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIR VLQRWFYE RTLASSRQSTLSDYAEEMI+EA DNARRHIVMNIDQF
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
        NFEV DGNLNGDVDLQSQTCTCR FDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  NFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS

XP_022142677.1 uncharacterized protein LOC111012733 [Momordica charantia]9.5e-16769.36Show/hide
Query:  EYASDRLDVHMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI
        EY  D +   M+  RVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI
Subjt:  EYASDRLDVHMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI

Query:  KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA--------
        KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA        
Subjt:  KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN
                                 MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN
Subjt:  -------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN

Query:  ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV
        ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFE+
Subjt:  ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV

XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]2.0e-15656.9Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV
        ++KNFQF+VKKST ELY+LRC+   CTWRLRATK+++C LFKIKKY A H+ C G  +K DHRQAKSWVVGHLVQ KFTDVSRTYRPK+I+QD+R+EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV

Query:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------
        N+SYD+A R SEEALRLIRGDPASSY LLPAYGEA+                                                                
Subjt:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------

Query:  ------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYL
                                                              NL+AKFK     +  LF KAAKA+RESYFN  W QL A+P VREYL
Subjt:  ------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYL

Query:  EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV
        + IGKERWARCFQT+LRY+QMTTNIAESVN LFRHARKLPVTALLDHIR  LQ WFY+RRTLA+SR +TLSDYAE M +E S++ RRH+V NIDQF+F+V
Subjt:  EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI
        +D NL+G VDL + TC CR FDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+T RI
Subjt:  RDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI

Query:  PSTGEVRPPRKCSRCGTSGHNR
        PSTGEVR  RKC RCG  G ++
Subjt:  PSTGEVRPPRKCSRCGTSGHNR

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]8.6e-18455.84Show/hide
Query:  EGDDEGEYGNEYASDRLD---------VHMSMKR------------VTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKS
        EGD E E+ N+   D LD         VH  + R            +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL ++MH   +R NFQF+VKKS
Subjt:  EGDDEGEYGNEYASDRLD---------VHMSMKR------------VTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKS

Query:  TPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSE
        TPELY+L C+D +CTWRLRATK+RDCNLFKIKKY ++H+ CNG V+KQDHRQAKSWVVGHLVQ+KFTDVSRTYRPKDI+QD+R+EYGVN+SYDKAWR SE
Subjt:  TPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSE

Query:  EALRLIRGDPASSYVLLPAYGEA-----------------------------------------------------------------------------
        EALRLIRGDPASSY LLP YGEA                                                                             
Subjt:  EALRLIRGDPASSYVLLPAYGEA-----------------------------------------------------------------------------

Query:  --------------------------------------------------------MNLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVR
                                                                MNLLAKFK     LE LF KAAKA+RESYFN  W QL A+P VR
Subjt:  --------------------------------------------------------MNLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVR

Query:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFN
        EYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIR +LQ WFY+RRTLASSR +TLS YAE  ++E SDNARRH+V+NIDQF+
Subjt:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFN

Query:  FEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQT
         +VRDGNL+G VD  S+TC CR FDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR+T
Subjt:  FEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQT

Query:  VRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        VRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  VRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]1.8e-14151.92Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV
        +RKNFQF+VKKST ELY+LRC+   CTWRLRATK+++C LFKI KY A H+ C G  +K DHRQ KSWVVGHLVQ KFTDVSRTYRPKDI+QD+R EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV

Query:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------
        N+SYD+AWR SEEALRLIRGDPASSY LLPAYGEA+                                                                
Subjt:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------

Query:  ---------------------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNE
                                                                             NL+AKFK     +E LF KAAKA+RESYFN 
Subjt:  ---------------------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNE

Query:  NWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNA
         W QL A+P                                 S+NALFRH RKLPVTALLDHIR  LQ WFY+RRTLA+SR +TLSDYAE M +E SD+A
Subjt:  NWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNA

Query:  RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP
        RRH+V NIDQF+F+VRDGNL+G VDL +  C+CR FDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+P
Subjt:  RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP

Query:  PKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        PK V RVGRR+TVRIPSTGEVR  RKC RCG  GHNRKTC EPL T+
Subjt:  PKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like1.2e-19467.38Show/hide
Query:  MGEGDDEGEYGNEYASDRLDV----------------------HMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRV
        MGEGDDE EYGNEYASDRLDV                       M+  RVTGQSEGDRLQAMVQSA TDDVKE DVFDSKKELVMKMH  ALRKNFQF+V
Subjt:  MGEGDDEGEYGNEYASDRLDV----------------------HMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRV

Query:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
        KKSTP+LYL+RCIDPTCTWRLR TKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR
Subjt:  KKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWR

Query:  LSEEALRLIRGDPASSYVLLPAYGEA--------------------------------------------------------------------------
         SEEALRLIRGDPASSY LLPAYGEA                                                                          
Subjt:  LSEEALRLIRGDPASSYVLLPAYGEA--------------------------------------------------------------------------

Query:  -----------------------------------------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRV
                                                                   MNLLAKFKT  LE LFFKAAKAFRESYFNENWVQLCAHP V
Subjt:  -----------------------------------------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQF
        REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIR VLQRWFYE RTLASSRQSTLSDYAEEMI+EA DNARRHIVMNIDQF
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
        NFEV DGNLNGDVDLQSQTCTCR FDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS
Subjt:  NFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNS

A0A6J1CNJ2 uncharacterized protein LOC1110127334.6e-16769.36Show/hide
Query:  EYASDRLDVHMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI
        EY  D +   M+  RVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI
Subjt:  EYASDRLDVHMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKI

Query:  KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA--------
        KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA        
Subjt:  KKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEA--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN
                                 MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN
Subjt:  -------------------------MNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVN

Query:  ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV
        ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFE+
Subjt:  ALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV

A0A6J1CVL4 uncharacterized protein LOC1110151819.6e-15756.9Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV
        ++KNFQF+VKKST ELY+LRC+   CTWRLRATK+++C LFKIKKY A H+ C G  +K DHRQAKSWVVGHLVQ KFTDVSRTYRPK+I+QD+R+EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV

Query:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------
        N+SYD+A R SEEALRLIRGDPASSY LLPAYGEA+                                                                
Subjt:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------

Query:  ------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYL
                                                              NL+AKFK     +  LF KAAKA+RESYFN  W QL A+P VREYL
Subjt:  ------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYL

Query:  EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV
        + IGKERWARCFQT+LRY+QMTTNIAESVN LFRHARKLPVTALLDHIR  LQ WFY+RRTLA+SR +TLSDYAE M +E S++ RRH+V NIDQF+F+V
Subjt:  EAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEV

Query:  RDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI
        +D NL+G VDL + TC CR FDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+PPK V RVGRR+T RI
Subjt:  RDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRI

Query:  PSTGEVRPPRKCSRCGTSGHNR
        PSTGEVR  RKC RCG  G ++
Subjt:  PSTGEVRPPRKCSRCGTSGHNR

A0A6J1DJT1 uncharacterized protein LOC1110207154.1e-18455.84Show/hide
Query:  EGDDEGEYGNEYASDRLD---------VHMSMKR------------VTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKS
        EGD E E+ N+   D LD         VH  + R            +TGQ   + LQ +VQS+GT+DVKEG+VFD+KKEL ++MH   +R NFQF+VKKS
Subjt:  EGDDEGEYGNEYASDRLD---------VHMSMKR------------VTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKS

Query:  TPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSE
        TPELY+L C+D +CTWRLRATK+RDCNLFKIKKY ++H+ CNG V+KQDHRQAKSWVVGHLVQ+KFTDVSRTYRPKDI+QD+R+EYGVN+SYDKAWR SE
Subjt:  TPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSE

Query:  EALRLIRGDPASSYVLLPAYGEA-----------------------------------------------------------------------------
        EALRLIRGDPASSY LLP YGEA                                                                             
Subjt:  EALRLIRGDPASSYVLLPAYGEA-----------------------------------------------------------------------------

Query:  --------------------------------------------------------MNLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVR
                                                                MNLLAKFK     LE LF KAAKA+RESYFN  W QL A+P VR
Subjt:  --------------------------------------------------------MNLLAKFK--TPELEGLFFKAAKAFRESYFNENWVQLCAHPRVR

Query:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFN
        EYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIR +LQ WFY+RRTLASSR +TLS YAE  ++E SDNARRH+V+NIDQF+
Subjt:  EYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFN

Query:  FEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQT
         +VRDGNL+G VD  S+TC CR FDYFK+PCSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR+T
Subjt:  FEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQT

Query:  VRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        VRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  VRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

A0A6J1DYC4 uncharacterized protein LOC1110256788.7e-14251.92Show/hide
Query:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV
        +RKNFQF+VKKST ELY+LRC+   CTWRLRATK+++C LFKI KY A H+ C G  +K DHRQ KSWVVGHLVQ KFTDVSRTYRPKDI+QD+R EYGV
Subjt:  LRKNFQFRVKKSTPELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGV

Query:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------
        N+SYD+AWR SEEALRLIRGDPASSY LLPAYGEA+                                                                
Subjt:  NMSYDKAWRLSEEALRLIRGDPASSYVLLPAYGEAM----------------------------------------------------------------

Query:  ---------------------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNE
                                                                             NL+AKFK     +E LF KAAKA+RESYFN 
Subjt:  ---------------------------------------------------------------------NLLAKFK--TPELEGLFFKAAKAFRESYFNE

Query:  NWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNA
         W QL A+P                                 S+NALFRH RKLPVTALLDHIR  LQ WFY+RRTLA+SR +TLSDYAE M +E SD+A
Subjt:  NWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNA

Query:  RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP
        RRH+V NIDQF+F+VRDGNL+G VDL +  C+CR FDYFK+PCSHAIAAA+ R+INPY+LCDEAYT NSW+LAYAEPIFPVG  STW SSP FVNI V+P
Subjt:  RRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQP

Query:  PKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        PK V RVGRR+TVRIPSTGEVR  RKC RCG  GHNRKTC EPL T+
Subjt:  PKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase5.6e-0833.68Show/hide
Query:  NGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ
        +G V L   TCTC  F   K PC HA+A      INP    D+ YTV  +   Y+    PV   S W  + G       +   PP KV   G+ +
Subjt:  NGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ

AT1G64255.1 MuDR family transposase1.9e-0826.6Show/hide
Query:  HPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRRVLQRWFYERRTLASSRQS-TLSDYAEEMISEAS
        +P  R++L+   + RWA       RY  M  N      ALF          H     V  L D +R    + F      + SR S    D   E + +  
Subjt:  HPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRRVLQRWFYERRTLASSRQS-TLSDYAEEMISEAS

Query:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS
        +  R       +IV  +D   F+V      G+  V L   +CTC  F  +K PC HA+A       NP    D+ YT+      YA     V   S W  
Subjt:  DNAR------RHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKS

Query:  SPG
        + G
Subjt:  SPG

AT1G64260.1 MuDR family transposase5.4e-1124.34Show/hide
Query:  LAKFKTPELEGLFFKAAKAFRESYFNENWVQLC-AHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLP---------VTALLDHI
        L  F+   LE L  +A    ++  F+     +   +P   ++L+ I + +WA    + LRY      I     ALF   R  P         V  + D +
Subjt:  LAKFKTPELEGLFFKAAKAFRESYFNENWVQLC-AHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLP---------VTALLDHI

Query:  RRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLC
        R    +      + + +R    ++   + + E   ++  +++  +++ +F+V + +   +  V L   TCTCR F  +K PC HA+A      INP    
Subjt:  RRVLQRWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLC

Query:  DEAYTVNSWMLAYAEPIFPVGSSSTW
        DE YTV  +   YA    PV   + W
Subjt:  DEAYTVNSWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGAGGTCAAGTCAAGCTTTTTGAAGCAAGCTGCTGCCGCCCGCTTGGCGGGCTCCGCGCTCGTTATCCTTACTGTTCTCTCTGCAAAGGAAAGCGAGATCGTACC
TCTTTTTCCAGGCCTGTTCGGACATACGGTTCCCGCGGAAGATCAAGTTGGTGAGCGGATGCCTCGTGTTTTTATATTATTCGGTGGAGAATGGAAAGATATTGAAAAGG
ATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGAATTTCCAGGACATGTATGTAGGCTAAGTAGTACAAATCCATTACAGGAAGAT
ATCATAATTAGACGTGTATATAATTTTAAAGCGAAGGTTTGTGTAATGGAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCC
GCTATACATAGCTACCGTGCCAAAGAAGCTTGTGCCCCCCATTTGCAAAACCCTTATTTTCAAGACCCTCATTTTCAAGTCTGTCAATTCCGTCCTCGTCGTCGAACCCC
TCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAGCATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCA
TATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAAGGGGA
ATATGGAAATGAGTACGCCAGTGATCGACTTGATGTGCACATGAGCATGAAAAGAGTCACCGGTCAGTCAGAAGGTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGA
CCGATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTCTTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACG
CCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGCACATGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTTCA
CTCTAAGTGCAATGGTGCCGTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGC
CGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTTGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCT
TCATCGTACGTGCTACTACCCGCTTATGGGGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCGAGTTGGAGGGATTATTTTTTAAGGCTGCGAAGGCATTTCGCGA
GTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAAGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAA
GATACTCACAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGACGTGTGTTGCAG
AGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTTCCGAAGCTTCGGATAATGCACGGAGACACATTGT
TATGAACATCGACCAGTTTAATTTTGAGGTACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGGGTTCGATTATTTTAAAGTCC
CGTGCTCCCATGCTATTGCTGCAGCTAGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTATGCAGAACCAATA
TTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGAT
TCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACGTCGGGACACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGCTGGAGGTCAAGTCAAGCTTTTTGAAGCAAGCTGCTGCCGCCCGCTTGGCGGGCTCCGCGCTCGTTATCCTTACTGTTCTCTCTGCAAAGGAAAGCGAGATCGTACC
TCTTTTTCCAGGCCTGTTCGGACATACGGTTCCCGCGGAAGATCAAGTTGGTGAGCGGATGCCTCGTGTTTTTATATTATTCGGTGGAGAATGGAAAGATATTGAAAAGG
ATTACGTGGGTGGTCGTACAAGAGGATTGACTGTGGATAGTAAAATCACCTATGCTGAATTTCCAGGACATGTATGTAGGCTAAGTAGTACAAATCCATTACAGGAAGAT
ATCATAATTAGACGTGTATATAATTTTAAAGCGAAGGTTTGTGTAATGGAAATAACTGACGACGATGACCTGACTTTCTTCTTGACTGGTGAAGATGTCTCTGAATTGCC
GCTATACATAGCTACCGTGCCAAAGAAGCTTGTGCCCCCCATTTGCAAAACCCTTATTTTCAAGACCCTCATTTTCAAGTCTGTCAATTCCGTCCTCGTCGTCGAACCCC
TCTTCTTCCCGCCCACCACCCCCTACTTTGGTCATATTGGTCATGATATAGCATCTCTCACACCGTTAGGGTCAGATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCA
TATGATTGGGATGTGCCTGGCTTGTGGAATGGAAGTGAAAATGTGGATGAAGATAGTGATGAATCATATCGTCCAATGACCGACATGGGAGAAGGAGACGACGAAGGGGA
ATATGGAAATGAGTACGCCAGTGATCGACTTGATGTGCACATGAGCATGAAAAGAGTCACCGGTCAGTCAGAAGGTGATAGATTGCAAGCCATGGTCCAATCGGCTGGGA
CCGATGATGTTAAGGAGGGTGACGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTCTTTGCATTGCGGAAGAACTTTCAGTTTCGAGTGAAGAAGTCTACG
CCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGCACATGGCGACTTAGAGCCACTAAGATTAGAGATTGCAACCTGTTTAAGATAAAGAAATATATCGCAGTTCA
CTCTAAGTGCAATGGTGCCGTTATGAAACAGGATCATCGTCAGGCGAAGAGTTGGGTGGTCGGTCATCTAGTACAATCAAAGTTTACTGATGTTTCTCGCACGTACAGGC
CGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGTAAATATGAGTTACGACAAGGCCTGGCGTTTGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCT
TCATCGTACGTGCTACTACCCGCTTATGGGGAAGCCATGAACTTGCTGGCCAAATTTAAAACGCCCGAGTTGGAGGGATTATTTTTTAAGGCTGCGAAGGCATTTCGCGA
GTCATATTTCAATGAGAACTGGGTCCAACTGTGCGCACACCCAAGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAA
GATACTCACAAATGACCACCAATATTGCAGAGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGACGTGTGTTGCAG
AGGTGGTTCTACGAACGTCGGACGCTTGCTTCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTTCCGAAGCTTCGGATAATGCACGGAGACACATTGT
TATGAACATCGACCAGTTTAATTTTGAGGTACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCGGGGGTTCGATTATTTTAAAGTCC
CGTGCTCCCATGCTATTGCTGCAGCTAGTTCTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCGTATGCAGAACCAATA
TTTCCAGTGGGTTCATCCTCAACATGGAAGAGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGAT
TCCTTCCACAGGCGAGGTCCGTCCACCGCGCAAGTGCAGTCGATGTGGTACGTCGGGACACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
Protein sequenceShow/hide protein sequence
MLEVKSSFLKQAAAARLAGSALVILTVLSAKESEIVPLFPGLFGHTVPAEDQVGERMPRVFILFGGEWKDIEKDYVGGRTRGLTVDSKITYAEFPGHVCRLSSTNPLQED
IIIRRVYNFKAKVCVMEITDDDDLTFFLTGEDVSELPLYIATVPKKLVPPICKTLIFKTLIFKSVNSVLVVEPLFFPPTTPYFGHIGHDIASLTPLGSDVVPCNLGDDRA
YDWDVPGLWNGSENVDEDSDESYRPMTDMGEGDDEGEYGNEYASDRLDVHMSMKRVTGQSEGDRLQAMVQSAGTDDVKEGDVFDSKKELVMKMHFFALRKNFQFRVKKST
PELYLLRCIDPTCTWRLRATKIRDCNLFKIKKYIAVHSKCNGAVMKQDHRQAKSWVVGHLVQSKFTDVSRTYRPKDIMQDIREEYGVNMSYDKAWRLSEEALRLIRGDPA
SSYVLLPAYGEAMNLLAKFKTPELEGLFFKAAKAFRESYFNENWVQLCAHPRVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRRVLQ
RWFYERRTLASSRQSTLSDYAEEMISEASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCRGFDYFKVPCSHAIAAASSRSINPYTLCDEAYTVNSWMLAYAEPI
FPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV