; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g33360 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g33360
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr8:24223002..24231795
RNA-Seq ExpressionMoc08g33360
SyntenyMoc08g33360
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type
IPR018289 - MULE transposase domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]2.4e-19567.63Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD+VHE ASNR+TGQS+ DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTW----------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS
        STP+LYL+RCIDPTCTW                                                          PKDIMQDIREEYGVNMSYDKAWRSS
Subjt:  STPELYLLRCIDPTCTW----------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEA----------------------------------------------------------------------------
        EEALRLIRGDPASSY LLPAYGEA                                                                            
Subjt:  EEALRLIRGDPASSYGLLPAYGEA----------------------------------------------------------------------------

Query:  -----------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVRE
                         SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIH LKMNLLAKFK  ALE LFFKA KAFRESYFNENWVQLCAHPGVRE
Subjt:  -----------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVRE

Query:  YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNF
        YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIA+A DNARRHIVMNIDQFNF
Subjt:  YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTC+EFDYFKV CSHAIA ASSRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNS

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]2.7e-17553.7Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      +V +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPELYLLRCIDPTCTW---------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS
        KSTPELY+L C+D +CTW                                                         PKDI+QD+R+EYGVN+SYDKAWRSS
Subjt:  KSTPELYLLRCIDPTCTW---------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------SV
        EEALRLIRGDPASSYGLLP YGEA                                                                          ++
Subjt:  EEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------SV

Query:  VD-------------------TVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKM--HALEELFFKAVKAFRESYFNENWVQLCAHPGV
        VD                    V NLVF+S+RH  ICKAID+VFPTAFHCFCI  +KMNLLAKFK+   ALEELF KA KA+RESYFN  W QL A+PGV
Subjt:  VD-------------------TVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKM--HALEELFFKAVKAFRESYFNENWVQLCAHPGV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQF
        REYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +A+ SDNARRH+V+NIDQF
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQ
        + +VRDGNL+G VD  S+TC C+EFDYFK+ CSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR+
Subjt:  NFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQ

Query:  TVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        TVRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  TVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

XP_022154964.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]5.4e-14789.26Show/hide
Query:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
        SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIH LKMNLLAKFK   LEELFFKA KA RESYFNENWVQLCAHPGVREY+E IGKERWARCFQTK
Subjt:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK

Query:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL------------------SDYAEEMIAKASDNARRHIVMNIDQFN
        LRY QMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL                  SDYAEEMIA+ASDNARRHIVMNIDQFN
Subjt:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL------------------SDYAEEMIAKASDNARRHIVMNIDQFN

Query:  FEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        FEVRDGNLNGDVDLQSQTCTC+EFDYFKV CSHAIA A SRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
Subjt:  FEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

XP_022156122.1 uncharacterized protein LOC111023087 [Momordica charantia]4.2e-16870.55Show/hide
Query:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------
        MQDIREEYGVNMSYDKAWRSSEEALRLIR DPASSYGLLPAYGEA                                                       
Subjt:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------

Query:  --------------------------------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKA
                                              SVVDTVQNLVFIS+RHAAICKAIDEVFPTAFHCFCIH LKM+LLAKFK  ALEELFFKA KA
Subjt:  --------------------------------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKA

Query:  FRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEM
        FRESYFNENWVQLCA+PGVREYLEAI KERWARCFQ KLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR                         EEM
Subjt:  FRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEM

Query:  IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
        IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTC+EFDYFKVSCS AIA ASSRSINPYTLCDE YTVNSWMLAYAEPIFPVGSSSTWKSSPG
Subjt:  IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

Query:  FVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        FVNIDVQPPKKVVRVGRRQTV+IPSTGEVRPPR CSRCGTSGHNRKTCREPLNTV
Subjt:  FVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

XP_022158655.1 uncharacterized protein LOC111025117 [Momordica charantia]5.3e-15587.07Show/hide
Query:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
        SVVDTVQNLVFISDRHA+ICKAIDEVFP AFHCFCIH LKMNLLAKFK  ALE LFFKA KAF E YFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
Subjt:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK

Query:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
        LRYSQMTTNIAESVNAL                   + RWFYER+TLASSRQSTLSDYAEEMIA+A+DN+RRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
Subjt:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT

Query:  CTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
        CTC+EFDYFKV CSHAIA A+SRSINPYTLCDEAYTVNSWMLA+AEPIF VGSS+TWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
Subjt:  CTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC

Query:  GTSGHNRKTCREPLNTV
        GTSGHNRKTCREPLNTV
Subjt:  GTSGHNRKTCREPLNTV

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like1.1e-19567.63Show/hide
Query:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK
        EGDDE EYGNE+ASDRLDVQHEHE+VTIHNTMAEYPVD+VHE ASNR+TGQS+ DRLQAMVQSA T+DVKE DVFDSKKELVMKMHLLALRKNFQF+VKK
Subjt:  EGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVKK

Query:  STPELYLLRCIDPTCTW----------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS
        STP+LYL+RCIDPTCTW                                                          PKDIMQDIREEYGVNMSYDKAWRSS
Subjt:  STPELYLLRCIDPTCTW----------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEA----------------------------------------------------------------------------
        EEALRLIRGDPASSY LLPAYGEA                                                                            
Subjt:  EEALRLIRGDPASSYGLLPAYGEA----------------------------------------------------------------------------

Query:  -----------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVRE
                         SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIH LKMNLLAKFK  ALE LFFKA KAFRESYFNENWVQLCAHPGVRE
Subjt:  -----------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVRE

Query:  YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNF
        YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKL +TALLDHIRGVLQRWFYE RTLASSRQSTLSDYAEEMIA+A DNARRHIVMNIDQFNF
Subjt:  YLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNF

Query:  EVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNS
        EV DGNLNGDVDLQSQTCTC+EFDYFKV CSHAIA ASSRSINPYTLCDEAYTVNS
Subjt:  EVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNS

A0A6J1DJT1 uncharacterized protein LOC1110207151.3e-17553.7Show/hide
Query:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK
        EEGD E E+ N+   D LD + E +   +H  +      +V +   + LTGQ   + LQ +VQS+GTNDVKEG+VFD+KKEL ++MHL+ +R NFQF+VK
Subjt:  EEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNFQFRVK

Query:  KSTPELYLLRCIDPTCTW---------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS
        KSTPELY+L C+D +CTW                                                         PKDI+QD+R+EYGVN+SYDKAWRSS
Subjt:  KSTPELYLLRCIDPTCTW---------------------------------------------------------PKDIMQDIREEYGVNMSYDKAWRSS

Query:  EEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------SV
        EEALRLIRGDPASSYGLLP YGEA                                                                          ++
Subjt:  EEALRLIRGDPASSYGLLPAYGEA--------------------------------------------------------------------------SV

Query:  VD-------------------TVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKM--HALEELFFKAVKAFRESYFNENWVQLCAHPGV
        VD                    V NLVF+S+RH  ICKAID+VFPTAFHCFCI  +KMNLLAKFK+   ALEELF KA KA+RESYFN  W QL A+PGV
Subjt:  VD-------------------TVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKM--HALEELFFKAVKAFRESYFNENWVQLCAHPGV

Query:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQF
        REYL+ IGKERWARCFQT+LRY+QMT+N AESVNALFRHARKLPVTALLDHIRG+LQ WFY+RRTLASSR +TLS YAE  +A+ SDNARRH+V+NIDQF
Subjt:  REYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQF

Query:  NFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQ
        + +VRDGNL+G VD  S+TC C+EFDYFK+ CSHAIA A  R+INPYTLCDEAYT NSW++AYAEPIFP+G  STW SSP FV+  V+ P  V RVGRR+
Subjt:  NFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQ

Query:  TVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT
        TVRIPSTGEVR  RKC RCGTSGHN KTC EPLNT
Subjt:  TVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNT

A0A6J1DNT3 protein FAR1-RELATED SEQUENCE 4-like2.6e-14789.26Show/hide
Query:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
        SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIH LKMNLLAKFK   LEELFFKA KA RESYFNENWVQLCAHPGVREY+E IGKERWARCFQTK
Subjt:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK

Query:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL------------------SDYAEEMIAKASDNARRHIVMNIDQFN
        LRY QMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL                  SDYAEEMIA+ASDNARRHIVMNIDQFN
Subjt:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTL------------------SDYAEEMIAKASDNARRHIVMNIDQFN

Query:  FEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
        FEVRDGNLNGDVDLQSQTCTC+EFDYFKV CSHAIA A SRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR
Subjt:  FEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRR

A0A6J1DR67 uncharacterized protein LOC1110230872.0e-16870.55Show/hide
Query:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------
        MQDIREEYGVNMSYDKAWRSSEEALRLIR DPASSYGLLPAYGEA                                                       
Subjt:  MQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEA-------------------------------------------------------

Query:  --------------------------------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKA
                                              SVVDTVQNLVFIS+RHAAICKAIDEVFPTAFHCFCIH LKM+LLAKFK  ALEELFFKA KA
Subjt:  --------------------------------------SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKA

Query:  FRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEM
        FRESYFNENWVQLCA+PGVREYLEAI KERWARCFQ KLRYSQMTTNIAESVNALFRHARKLPVTALLDHIR                         EEM
Subjt:  FRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEM

Query:  IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
        IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTC+EFDYFKVSCS AIA ASSRSINPYTLCDE YTVNSWMLAYAEPIFPVGSSSTWKSSPG
Subjt:  IAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

Query:  FVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV
        FVNIDVQPPKKVVRVGRRQTV+IPSTGEVRPPR CSRCGTSGHNRKTCREPLNTV
Subjt:  FVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV

A0A6J1DWF8 uncharacterized protein LOC1110251172.6e-15587.07Show/hide
Query:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
        SVVDTVQNLVFISDRHA+ICKAIDEVFP AFHCFCIH LKMNLLAKFK  ALE LFFKA KAF E YFNENWVQLCAHPGVREYLEAIGKERWARCFQTK
Subjt:  SVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTK

Query:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
        LRYSQMTTNIAESVNAL                   + RWFYER+TLASSRQSTLSDYAEEMIA+A+DN+RRHIVMNIDQFNFEVRDGNLNGDVDLQSQT
Subjt:  LRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQT

Query:  CTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
        CTC+EFDYFKV CSHAIA A+SRSINPYTLCDEAYTVNSWMLA+AEPIF VGSS+TWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC
Subjt:  CTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRC

Query:  GTSGHNRKTCREPLNTV
        GTSGHNRKTCREPLNTV
Subjt:  GTSGHNRKTCREPLNTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase1.0e-1024.11Show/hide
Query:  PTAFHCFCIHQLKMNLLA-----KFKMHAL-EELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHA
        P A+H FC++ L   L +      + MH L +E    + K   +SY  E   +   +P   ++L+     +WA       RY  M  +  E++ A+ +  
Subjt:  PTAFHCFCIHQLKMNLLA-----KFKMHAL-EELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHA

Query:  RKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASD-----NARRHIVMNIDQFNFEV-------------RDGNLNGDVDLQSQTCTC
        RK+ +   +  + G L+  F E   L+         Y E ++ K  +     +     +  +++  ++V              + + +G V L   TCTC
Subjt:  RKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASD-----NARRHIVMNIDQFNFEV-------------RDGNLNGDVDLQSQTCTC

Query:  QEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ
         EF   K  C HA+AV     INP    D+ YTV  +   Y+    PV   S W  + G       +   PP KV   G+ +
Subjt:  QEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPGFVN----IDVQPPKKVVRVGRRQ

AT1G64255.1 MuDR family transposase4.3e-0924.2Show/hide
Query:  QNLVFISDRHAAICKAIDE-----VFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRE----SYFNENWVQLCAHPGVREYLEAIGKERWARCF
        + L  IS  H  I   ++E       P A+H F ++         F    L     +A    ++    SY N+   +   +P  R++L+   + RWA   
Subjt:  QNLVFISDRHAAICKAIDE-----VFPTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRE----SYFNENWVQLCAHPGVREYLEAIGKERWARCF

Query:  QTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASD-----NARRHIVMNIDQFNF
            RY  M  N      ALF          H     V  L D +R    + F    + + S  +    Y E ++ K  +         +IV  +D   F
Subjt:  QTKLRYSQMTTNIAESVNALFR---------HARKLPVTALLDHIRGVLQRWFYERRTLASSRQSTLSDYAEEMIAKASD-----NARRHIVMNIDQFNF

Query:  EVRDGNLNGD--VDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG
        +V      G+  V L   +CTC +F  +K  C HA+AV      NP    D+ YT+      YA     V   S W  + G
Subjt:  EVRDGNLNGD--VDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWKSSPG

AT1G64260.1 MuDR family transposase8.8e-1527.16Show/hide
Query:  PTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLC-AHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPV
        P A H FC++ L+   L  F+ + LE L  +A    ++  F+     +   +P   ++L+ I + +WA    + LRY      I     ALF   R  P 
Subjt:  PTAFHCFCIHQLKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLC-AHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPV

Query:  TALLDHIRGVLQRWFYERRT----LASSRQSTLSD---YAEEMIAKASD---NARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCQEFDYFKVSCS
          +   + G +   F E R+      SS  S+L+    Y E  + K  +   ++  +++  +++ +F+V + +   +  V L   TCTC++F  +K  C 
Subjt:  TALLDHIRGVLQRWFYERRT----LASSRQSTLSD---YAEEMIAKASD---NARRHIVMNIDQFNFEVRDGNLNGD--VDLQSQTCTCQEFDYFKVSCS

Query:  HAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW
        HA+AV     INP    DE YTV  +   YA    PV   + W
Subjt:  HAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCTCTCGGAAGAGAAGGTTGTCGAGAATGGTTGGAGGAATAGAAAGGTCACCGGTTCGAATAAGATTAAGGAAGACACTGACACTAAAGCAAGCATGGTCTCTGT
CGATGAAATGATGTTTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGGTGGTCCTACAAGAGGATTGACTGTGGATAGTAATATCA
CCTATGCTGAATTTTTAGGACATGTATGGTCAAATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGTCTGGCTTGTGGAATGGAAGTGAAAAT
GTGGATGAAGATAGTGATGAATCATATCGTCTAATGACCGACACGGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACA
TGAGCATGAAGAGGTAACAATTCATAATACAATGGCTGAATATCCTGTCGATTCCGTCCATGAAACGGCAAGCAATAGACTCACCGGTCAGTCAAAAGCTGATAGATTGC
AAGCCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTT
CAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGCACGTGGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGT
AAATATGAGTTACGACAAGGCCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGGGAAGCCAGTG
TTGTCGACACCGTTCAGAATTTGGTCTTCATTTCTGATCGACATGCCGCCATTTGTAAAGCGATTGATGAGGTATTTCCTACTGCATTTCATTGCTTTTGTATACATCAA
CTAAAGATGAACTTGCTGGCCAAATTTAAAATGCACGCGTTGGAGGAATTATTTTTTAAGGCTGTTAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACT
GTGCGCACACCCAGGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACCAATATTGCAG
AGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAACGTCGGACGCTTGCT
TCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCAAAGCTTCGGATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGGT
ACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCAGGAGTTCGATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGTAGCCAGTT
CTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCGTCCTCAACATGGAAG
AGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCG
CAAGTGCAGTCGATGTGGTACGTCGGGGCACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCTCTCGGAAGAGAAGGTTGTCGAGAATGGTTGGAGGAATAGAAAGGTCACCGGTTCGAATAAGATTAAGGAAGACACTGACACTAAAGCAAGCATGGTCTCTGT
CGATGAAATGATGTTTCGTGTTTTTATATCATTCGGTGGAGAATGGAAAGATATTGAAAAGGATTACGTGGGTGGTCCTACAAGAGGATTGACTGTGGATAGTAATATCA
CCTATGCTGAATTTTTAGGACATGTATGGTCAAATGTTGTTCCTTGTAATTTGGGAGATGATAGGGCATATGATTGGGATGTGTCTGGCTTGTGGAATGGAAGTGAAAAT
GTGGATGAAGATAGTGATGAATCATATCGTCTAATGACCGACACGGAAGAAGGAGACGACGAAAGGGAATATGGAAATGAGCACGCTAGTGATCGACTTGATGTGCAACA
TGAGCATGAAGAGGTAACAATTCATAATACAATGGCTGAATATCCTGTCGATTCCGTCCATGAAACGGCAAGCAATAGACTCACCGGTCAGTCAAAAGCTGATAGATTGC
AAGCCATGGTCCAATCGGCTGGGACCAATGATGTTAAGGAGGGTGATGTATTCGACTCGAAGAAGGAACTAGTTATGAAAATGCATTTACTTGCATTGCGGAAGAACTTT
CAGTTTCGAGTGAAGAAGTCTACGCCGGAACTATACTTGCTGCGATGCATCGATCCTACTTGCACGTGGCCGAAGGACATCATGCAAGATATTCGTGAGGAGTACGGTGT
AAATATGAGTTACGACAAGGCCTGGCGTTCGAGCGAAGAAGCACTCCGACTTATCAGAGGGGATCCAGCTTCATCGTACGGGCTACTACCCGCTTATGGGGAAGCCAGTG
TTGTCGACACCGTTCAGAATTTGGTCTTCATTTCTGATCGACATGCCGCCATTTGTAAAGCGATTGATGAGGTATTTCCTACTGCATTTCATTGCTTTTGTATACATCAA
CTAAAGATGAACTTGCTGGCCAAATTTAAAATGCACGCGTTGGAGGAATTATTTTTTAAGGCTGTTAAGGCATTTCGCGAGTCATATTTCAATGAGAACTGGGTCCAACT
GTGCGCACACCCAGGAGTGAGGGAATATCTGGAAGCTATAGGAAAGGAACGATGGGCTCGCTGCTTTCAGACGAAACTAAGATACTCACAAATGACCACCAATATTGCAG
AGTCCGTTAATGCCCTTTTCAGGCATGCACGTAAGTTGCCAGTCACCGCATTACTTGATCATATCAGAGGTGTGTTGCAGAGGTGGTTCTACGAACGTCGGACGCTTGCT
TCTTCACGTCAGAGTACGTTGTCTGACTACGCAGAGGAAATGATTGCCAAAGCTTCGGATAATGCACGGAGACACATTGTTATGAACATCGACCAGTTTAATTTTGAGGT
ACGCGACGGGAACCTCAATGGGGACGTTGACTTGCAATCGCAGACGTGTACTTGTCAGGAGTTCGATTATTTTAAAGTCTCGTGCTCCCATGCTATTGCTGTAGCCAGTT
CTCGTAGCATAAATCCGTACACACTATGCGATGAGGCGTACACGGTCAACAGCTGGATGTTGGCATATGCAGAACCAATATTTCCAGTGGGTTCGTCCTCAACATGGAAG
AGTTCTCCGGGGTTTGTGAATATCGATGTTCAACCACCGAAGAAGGTCGTTAGGGTTGGACGGCGACAGACGGTGAGGATTCCTTCCACAGGCGAGGTCCGTCCACCGCG
CAAGTGCAGTCGATGTGGTACGTCGGGGCACAATCGTAAAACTTGTCGCGAACCACTAAATACTGTGTAG
Protein sequenceShow/hide protein sequence
MGLSEEKVVENGWRNRKVTGSNKIKEDTDTKASMVSVDEMMFRVFISFGGEWKDIEKDYVGGPTRGLTVDSNITYAEFLGHVWSNVVPCNLGDDRAYDWDVSGLWNGSEN
VDEDSDESYRLMTDTEEGDDEREYGNEHASDRLDVQHEHEEVTIHNTMAEYPVDSVHETASNRLTGQSKADRLQAMVQSAGTNDVKEGDVFDSKKELVMKMHLLALRKNF
QFRVKKSTPELYLLRCIDPTCTWPKDIMQDIREEYGVNMSYDKAWRSSEEALRLIRGDPASSYGLLPAYGEASVVDTVQNLVFISDRHAAICKAIDEVFPTAFHCFCIHQ
LKMNLLAKFKMHALEELFFKAVKAFRESYFNENWVQLCAHPGVREYLEAIGKERWARCFQTKLRYSQMTTNIAESVNALFRHARKLPVTALLDHIRGVLQRWFYERRTLA
SSRQSTLSDYAEEMIAKASDNARRHIVMNIDQFNFEVRDGNLNGDVDLQSQTCTCQEFDYFKVSCSHAIAVASSRSINPYTLCDEAYTVNSWMLAYAEPIFPVGSSSTWK
SSPGFVNIDVQPPKKVVRVGRRQTVRIPSTGEVRPPRKCSRCGTSGHNRKTCREPLNTV