; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19980 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19980
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr2:14829044..14833639
RNA-Seq ExpressionMoc02g19980
SyntenyMoc02g19980
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131652.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]8.3e-9541.55Show/hide
Query:  EGHSQAEYGNEEHDDALDDELEPDVEQVH-TEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK
        EG  + EYGNE   D LD + E +   +H T      D V     N +T  +  ++LQ +VQS+RT+DV E DVFD+KK+L MKMHL+A+RKNFQF+VKK
Subjt:  EGHSQAEYGNEEHDDALDDELEPDVEQVH-TEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK

Query:  STLELDIL-------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSS
        ST +L ++                                           RQAKSWVVGHLVQ KFTDVSRTYRPKDI+QD+R+EYGV +SYD+   SS
Subjt:  STLELDIL-------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSS

Query:  EKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG---------------------
        E+ALRLIRG+PASSY LLPAYGEA+KIMNP                          GF+ CI+PVLV+DGAH+K K+ G                     
Subjt:  EKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHF
                              MTTNIAESVNALF HARKL +TALLDHIRG LQ  FY+ RTLA+SR +TLSDYAE M AE  D+ARRH+V NIDQF+F
Subjt:  ----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHF

Query:  QVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNS
        +V DGNL+  VDL + TC CREFDYFK+PCSHAIA A+ R+INPY+LCDEAYT NS
Subjt:  QVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNS

XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]8.0e-11452.84Show/hide
Query:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK
        M+KNFQFKVKKSTLEL IL                                          RQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGV 
Subjt:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK

Query:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG-----------
        LSYDR   SSE+ALRLIRG+PASSYGLLPAYGEALKIMNP                          GFLGCI+PVLVVDGAH+K KFRG           
Subjt:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVR
                           MTTNIAESVN LF HARKLPVTALLDHIRG LQ  FYDRRTLAASRSTTLSDYAENMYAEYS+S RRHVVDNIDQFHFQV+
Subjt:  -------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVR

Query:  DGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
        D NLD IVDLNAMTC CREFDYFKIPCSHAIA ATMRNINPYSLCDEAYTTNSWILAY EPIFPVGHVSTWN+SP
Subjt:  DGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]2.6e-13350.78Show/hide
Query:  EEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK
        EEG  +AE+ N+++DDALD+E EPDVEQVH EI RDE AV+  GC+GLT   N E LQLIVQSS TNDV EG+VFD KK+LS++MHLV MR NFQFKVKK
Subjt:  EEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK

Query:  STLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSSE
        ST EL IL                                          RQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGV LSYD+   SSE
Subjt:  STLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSSE

Query:  KALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG----------------------
        +ALRLIRG+PASSYGLLP YGEALKIMNP                          GFL CI+PVLVVDGAH+K KF G                      
Subjt:  KALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFH
                               MT+N AESVNALF HARKLPVTALLDHIRG LQT FYDRRTLA+SRSTTLS YAEN  AEYSD+ARRHVV NIDQFH
Subjt:  -----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFH

Query:  FQVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
         QVRDGNLD IVD N+ TC CREFDYFKIPCSHAIA A MRNINPY+LCDEAYTTNSW++AY EPIFP+GHVSTWN+SP
Subjt:  FQVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]7.3e-9968.54Show/hide
Query:  DVLGVWNDNKDESGESYDPLAESEEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDN
        DV GVWNDN+DESGESYDPLA SEEGHSQAEYGNEEHDDALDDELE DVEQVHTEIRRDE+AVR PGCNGLT   NDEKLQLIVQSS TNDVNEGDVFDN
Subjt:  DVLGVWNDNKDESGESYDPLAESEEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDN

Query:  KKKLSMKMHLVAMRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKD
        KK+LS+KMHLVAMRKNFQFKVKKST +L IL                                          RQAKSWVVGHLVQEKFTDVSRTYRPKD
Subjt:  KKKLSMKMHLVAMRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKD

Query:  IIQDMRKEYGVKLSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFR
        IIQDMRKEYGV LSYDR   SSE+ALRLIRG+PASSYGLLPAYG+ALKIMNP                          GFL CI+PVLVVDGAH+K KFR
Subjt:  IIQDMRKEYGVKLSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFR

Query:  GM
        G+
Subjt:  GM

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]9.8e-11253.49Show/hide
Query:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK
        MRKNFQFKVKKSTLEL IL                                          RQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGV 
Subjt:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK

Query:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRGMTTNIA-----
        LSYDR   SSE+ALRLIRG+PASSYGLLPAYGEALKIMNP                          GFLGC +PVLVVDGAH+K KFRG+  + +     
Subjt:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRGMTTNIA-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------ESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLDEIVDLNAMTCCC
                 S+NALF H RKLPVTALLDHIRG LQT FYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLD IVDLNAM C C
Subjt:  --------ESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLDEIVDLNAMTCCC

Query:  REFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
        REFDYFKIPCSHAIA ATMRNINPYSLCDEAYTTNSWILAY EPIFPVGH+STWN+SP
Subjt:  REFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

TrEMBL top hitse value%identityAlignment
A0A6J1BRM2 protein FAR1-RELATED SEQUENCE 4-like4.0e-9541.55Show/hide
Query:  EGHSQAEYGNEEHDDALDDELEPDVEQVH-TEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK
        EG  + EYGNE   D LD + E +   +H T      D V     N +T  +  ++LQ +VQS+RT+DV E DVFD+KK+L MKMHL+A+RKNFQF+VKK
Subjt:  EGHSQAEYGNEEHDDALDDELEPDVEQVH-TEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK

Query:  STLELDIL-------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSS
        ST +L ++                                           RQAKSWVVGHLVQ KFTDVSRTYRPKDI+QD+R+EYGV +SYD+   SS
Subjt:  STLELDIL-------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSS

Query:  EKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG---------------------
        E+ALRLIRG+PASSY LLPAYGEA+KIMNP                          GF+ CI+PVLV+DGAH+K K+ G                     
Subjt:  EKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG---------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHF
                              MTTNIAESVNALF HARKL +TALLDHIRG LQ  FY+ RTLA+SR +TLSDYAE M AE  D+ARRH+V NIDQF+F
Subjt:  ----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHF

Query:  QVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNS
        +V DGNL+  VDL + TC CREFDYFK+PCSHAIA A+ R+INPY+LCDEAYT NS
Subjt:  QVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNS

A0A6J1CVL4 uncharacterized protein LOC1110151813.9e-11452.84Show/hide
Query:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK
        M+KNFQFKVKKSTLEL IL                                          RQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGV 
Subjt:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK

Query:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG-----------
        LSYDR   SSE+ALRLIRG+PASSYGLLPAYGEALKIMNP                          GFLGCI+PVLVVDGAH+K KFRG           
Subjt:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG-----------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVR
                           MTTNIAESVN LF HARKLPVTALLDHIRG LQ  FYDRRTLAASRSTTLSDYAENMYAEYS+S RRHVVDNIDQFHFQV+
Subjt:  -------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVR

Query:  DGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
        D NLD IVDLNAMTC CREFDYFKIPCSHAIA ATMRNINPYSLCDEAYTTNSWILAY EPIFPVGHVSTWN+SP
Subjt:  DGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

A0A6J1DJT1 uncharacterized protein LOC1110207151.3e-13350.78Show/hide
Query:  EEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK
        EEG  +AE+ N+++DDALD+E EPDVEQVH EI RDE AV+  GC+GLT   N E LQLIVQSS TNDV EG+VFD KK+LS++MHLV MR NFQFKVKK
Subjt:  EEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKK

Query:  STLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSSE
        ST EL IL                                          RQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGV LSYD+   SSE
Subjt:  STLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSSE

Query:  KALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG----------------------
        +ALRLIRG+PASSYGLLP YGEALKIMNP                          GFL CI+PVLVVDGAH+K KF G                      
Subjt:  KALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRG----------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFH
                               MT+N AESVNALF HARKLPVTALLDHIRG LQT FYDRRTLA+SRSTTLS YAEN  AEYSD+ARRHVV NIDQFH
Subjt:  -----------------------MTTNIAESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFH

Query:  FQVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
         QVRDGNLD IVD N+ TC CREFDYFKIPCSHAIA A MRNINPY+LCDEAYTTNSW++AY EPIFP+GHVSTWN+SP
Subjt:  FQVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

A0A6J1DTG5 uncharacterized protein LOC1110238433.5e-9968.54Show/hide
Query:  DVLGVWNDNKDESGESYDPLAESEEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDN
        DV GVWNDN+DESGESYDPLA SEEGHSQAEYGNEEHDDALDDELE DVEQVHTEIRRDE+AVR PGCNGLT   NDEKLQLIVQSS TNDVNEGDVFDN
Subjt:  DVLGVWNDNKDESGESYDPLAESEEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDN

Query:  KKKLSMKMHLVAMRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKD
        KK+LS+KMHLVAMRKNFQFKVKKST +L IL                                          RQAKSWVVGHLVQEKFTDVSRTYRPKD
Subjt:  KKKLSMKMHLVAMRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKD

Query:  IIQDMRKEYGVKLSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFR
        IIQDMRKEYGV LSYDR   SSE+ALRLIRG+PASSYGLLPAYG+ALKIMNP                          GFL CI+PVLVVDGAH+K KFR
Subjt:  IIQDMRKEYGVKLSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFR

Query:  GM
        G+
Subjt:  GM

A0A6J1DYC4 uncharacterized protein LOC1110256784.7e-11253.49Show/hide
Query:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK
        MRKNFQFKVKKSTLEL IL                                          RQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGV 
Subjt:  MRKNFQFKVKKSTLELDIL------------------------------------------RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVK

Query:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRGMTTNIA-----
        LSYDR   SSE+ALRLIRG+PASSYGLLPAYGEALKIMNP                          GFLGC +PVLVVDGAH+K KFRG+  + +     
Subjt:  LSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNP--------------------------GFLGCIKPVLVVDGAHIKEKFRGMTTNIA-----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  --------ESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLDEIVDLNAMTCCC
                 S+NALF H RKLPVTALLDHIRG LQT FYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLD IVDLNAM C C
Subjt:  --------ESVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLDEIVDLNAMTCCC

Query:  REFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP
        REFDYFKIPCSHAIA ATMRNINPYSLCDEAYTTNSWILAY EPIFPVGH+STWN+SP
Subjt:  REFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase6.3e-0841.54Show/hide
Query:  IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW
        IV LN  TC C EF   K PC HA+AV     INP    D+ YT   +   Y     PV  +S W
Subjt:  IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW

AT1G64255.1 MuDR family transposase7.7e-0632.95Show/hide
Query:  HVVDNIDQFHFQVRDGNLDE---IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW
        ++V  +D   FQV    LD+   IV L+  +C C +F  +K PC HA+AV      NP    D+ YT       Y      V  +S W
Subjt:  HVVDNIDQFHFQVRDGNLDE---IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW

AT1G64260.1 MuDR family transposase4.7e-1134.74Show/hide
Query:  EYSDSARRHVVDNIDQFHFQVRDGNLDE--IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW
        E+   +  +V+  +++  F+V + +  E  IV LN  TC CR+F  +K PC HA+AV     INP    DE YT   +   Y     PV  V+ W
Subjt:  EYSDSARRHVVDNIDQFHFQVRDGNLDE--IVDLNAMTCCCREFDYFKIPCSHAIAVATMRNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCACCAAGGTTCAACCTGACTGGGGTCCGACCTGCTCGGAACCCGACAGGTCCACTCTAGTGTTCAGGTCGGAATCGGAGACCGGGACCATCGCGCCTTTGAGGAT
GCCTCATGTTTTCATAACATTCAGTGGAGAATGGAATGATAGTGAAAAAGATTATGTCGGCGGTGGTATGAAGAGACCCACATATCCTATACCTTCTTTTCCTTCCTCAT
CATCGAACCCCTCTTCTTCCCGACAGCCACACCCCTCCTACGGGCATATAGATGTGCTGGGAGTATGGAATGATAACAAAGATGAAAGTGGTGAATCATATGACCCGTTG
GCAGAGTCTGAAGAAGGACACTCTCAAGCAGAATATGGGAATGAAGAGCATGACGATGCGCTTGATGATGAGCTTGAGCCTGATGTGGAACAGGTGCACACTGAGATTCG
CAGGGATGAAGATGCGGTCCGGCCACCGGGATGTAATGGTCTCACCGAACACGCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTAGGACAAATGATGTTAATG
AGGGCGATGTATTTGATAATAAGAAGAAGTTGAGTATGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAAAAGTCGACGCTGGAGCTAGATATA
CTGCGGCAAGCAAAAAGTTGGGTGGTAGGACATCTTGTGCAGGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTA
TGGTGTCAAATTAAGTTATGATAGAGAATCGCATTCTAGTGAAAAAGCACTCCGGCTTATTAGAGGTAATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTGAAG
CTTTGAAAATCATGAACCCAGGTTTTCTGGGTTGTATTAAACCAGTGTTGGTTGTTGACGGGGCCCACATAAAGGAGAAGTTCAGAGGGATGACTACAAATATTGCAGAG
TCTGTAAATGCCCTCTTCACGCACGCCCGTAAGTTGCCGGTTACCGCCTTACTTGACCACATTAGAGGTAACTTACAGACTCGGTTCTATGATCGACGGACGCTTGCAGC
TTCCCGATCAACCACATTGTCCGACTACGCAGAAAACATGTATGCCGAATATTCAGATAGTGCGCGGAGACACGTTGTAGACAATATTGACCAGTTCCATTTCCAGGTAC
GGGATGGCAACCTTGACGAGATTGTTGATTTGAACGCTATGACGTGTTGTTGTCGGGAGTTTGATTACTTTAAGATTCCATGCTCTCATGCTATTGCGGTGGCGACGATG
CGAAATATAAATCCATACAGTCTGTGCGACGAGGCATATACGACGAACTCCTGGATATTGGCTTATGTAGAACCCATATTTCCAGTCGGACACGTCTCGACATGGAACAA
TTCCCCATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCACCAAGGTTCAACCTGACTGGGGTCCGACCTGCTCGGAACCCGACAGGTCCACTCTAGTGTTCAGGTCGGAATCGGAGACCGGGACCATCGCGCCTTTGAGGAT
GCCTCATGTTTTCATAACATTCAGTGGAGAATGGAATGATAGTGAAAAAGATTATGTCGGCGGTGGTATGAAGAGACCCACATATCCTATACCTTCTTTTCCTTCCTCAT
CATCGAACCCCTCTTCTTCCCGACAGCCACACCCCTCCTACGGGCATATAGATGTGCTGGGAGTATGGAATGATAACAAAGATGAAAGTGGTGAATCATATGACCCGTTG
GCAGAGTCTGAAGAAGGACACTCTCAAGCAGAATATGGGAATGAAGAGCATGACGATGCGCTTGATGATGAGCTTGAGCCTGATGTGGAACAGGTGCACACTGAGATTCG
CAGGGATGAAGATGCGGTCCGGCCACCGGGATGTAATGGTCTCACCGAACACGCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTAGGACAAATGATGTTAATG
AGGGCGATGTATTTGATAATAAGAAGAAGTTGAGTATGAAAATGCATTTAGTTGCAATGCGGAAGAATTTTCAGTTTAAAGTAAAAAAGTCGACGCTGGAGCTAGATATA
CTGCGGCAAGCAAAAAGTTGGGTGGTAGGACATCTTGTGCAGGAGAAGTTCACAGACGTCTCCCGCACGTATAGACCGAAGGACATTATACAAGACATGAGGAAGGAGTA
TGGTGTCAAATTAAGTTATGATAGAGAATCGCATTCTAGTGAAAAAGCACTCCGGCTTATTAGAGGTAATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTGAAG
CTTTGAAAATCATGAACCCAGGTTTTCTGGGTTGTATTAAACCAGTGTTGGTTGTTGACGGGGCCCACATAAAGGAGAAGTTCAGAGGGATGACTACAAATATTGCAGAG
TCTGTAAATGCCCTCTTCACGCACGCCCGTAAGTTGCCGGTTACCGCCTTACTTGACCACATTAGAGGTAACTTACAGACTCGGTTCTATGATCGACGGACGCTTGCAGC
TTCCCGATCAACCACATTGTCCGACTACGCAGAAAACATGTATGCCGAATATTCAGATAGTGCGCGGAGACACGTTGTAGACAATATTGACCAGTTCCATTTCCAGGTAC
GGGATGGCAACCTTGACGAGATTGTTGATTTGAACGCTATGACGTGTTGTTGTCGGGAGTTTGATTACTTTAAGATTCCATGCTCTCATGCTATTGCGGTGGCGACGATG
CGAAATATAAATCCATACAGTCTGTGCGACGAGGCATATACGACGAACTCCTGGATATTGGCTTATGTAGAACCCATATTTCCAGTCGGACACGTCTCGACATGGAACAA
TTCCCCATAG
Protein sequenceShow/hide protein sequence
MGTKVQPDWGPTCSEPDRSTLVFRSESETGTIAPLRMPHVFITFSGEWNDSEKDYVGGGMKRPTYPIPSFPSSSSNPSSSRQPHPSYGHIDVLGVWNDNKDESGESYDPL
AESEEGHSQAEYGNEEHDDALDDELEPDVEQVHTEIRRDEDAVRPPGCNGLTEHANDEKLQLIVQSSRTNDVNEGDVFDNKKKLSMKMHLVAMRKNFQFKVKKSTLELDI
LRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVKLSYDRESHSSEKALRLIRGNPASSYGLLPAYGEALKIMNPGFLGCIKPVLVVDGAHIKEKFRGMTTNIAE
SVNALFTHARKLPVTALLDHIRGNLQTRFYDRRTLAASRSTTLSDYAENMYAEYSDSARRHVVDNIDQFHFQVRDGNLDEIVDLNAMTCCCREFDYFKIPCSHAIAVATM
RNINPYSLCDEAYTTNSWILAYVEPIFPVGHVSTWNNSP