; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc02g19210 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc02g19210
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationchr2:14233111..14241037
RNA-Seq ExpressionMoc02g19210
SyntenyMoc02g19210
Gene Ontology termsGO:0006313 - transposition, DNA-mediated (biological process)
GO:0003677 - DNA binding (molecular function)
GO:0004803 - transposase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]5.3e-16766.12Show/hide
Query:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD
        CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGVNLSYDRA RSSEE LRLIRGD
Subjt:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD

Query:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------
        PASSYGLLPAYGEALKIMNP                                                                                
Subjt:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------

Query:  --------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKM
                                        VNLIAK KNDAKAV ELFLKAAKAYRESYFN IWAQLRAYPGVREYLDDIGKERWARCFQTQLRYT+M
Subjt:  --------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKM

Query:  TTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREF
        TTNIAESVN LFRHARKLPVTALLDHIR                                 +S RRHVV+NIDQFHFQV D NLDGIVDLNAMTC+CREF
Subjt:  TTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREF

Query:  DYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE
        DYFKIPCSHAIAAATMRNINPYSLCDEAYT NSWILAYAE IFPV +VSTWNSSPEFVNIPVEPPKTVPRVGRRKT RIPSTGE
Subjt:  DYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]3.8e-16554.44Show/hide
Query:  EEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD---------------------------
        EEG  +AE+ N+++D ALD+E E DVE VH EI RDE AV   GC+ LTG  N E LQLIVQSSGTNDV EG+                           
Subjt:  EEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD---------------------------

Query:  ---------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE
                 CV   CTWRLRATKL++C LFKIKKY + HTC GG LK DHRQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGVNLSYD+AWRSSE
Subjt:  ---------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE

Query:  EELRLIRGDPASSYGLLPAYGEALKIMNP-----------------------------------------------------------------------
        E LRLIRGDPASSYGLLP YGEALKIMNP                                                                       
Subjt:  EELRLIRGDPASSYGLLPAYGEALKIMNP-----------------------------------------------------------------------

Query:  --------------------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVR
                                                                +NL+AK K DAKA++ELFLKAAKAYRESYFN IWAQL AYPGVR
Subjt:  --------------------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVR

Query:  EYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFH
        EYLDDIGKERWARCFQT+LRYT+MT+N AESVNALFRHARKLPVTALLDHIR                                 D+ARRHVV NIDQFH
Subjt:  EYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFH

Query:  FQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKT
         QV DGNLDGIVD N+ TC+CREFDYFKIPCSHAIA A MRNINPY+LCDEAYT NSW++AYAE IFP+ +VSTWNSSP+FV+ PVE P  VPRVGRR+T
Subjt:  FQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKT

Query:  ARIPSTGE
         RIPSTGE
Subjt:  ARIPSTGE

XP_022155970.1 uncharacterized protein LOC111022954 [Momordica charantia]1.2e-13445.8Show/hide
Query:  ITFSKSAQSINVFDLSLSPHYPKSLSPQFHLRSIPGLELNVNNPGASHIGTVSNISRTCLGHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDES
        +T      S +   ++  P + +   P      IP    + +NP +S      ++    LGHD+ GL PL SDVVPCNLGDDRVC W++PG+WNDN+DES
Subjt:  ITFSKSAQSINVFDLSLSPHYPKSLSPQFHLRSIPGLELNVNNPGASHIGTVSNISRTCLGHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDES

Query:  GESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD-----------------
         ESYD L +SEEG  +AE+ N+++D A D++ E DVE V  EIRRDE  VL  GC+ L G PNDEKLQLIVQSSGTNDV EG                  
Subjt:  GESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD-----------------

Query:  -------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNL
                           CV + CTWRLRA KL +C LFKIKKY + HTC G  LK DHRQAK+WVV HLVQ KFTDVS TYRPKDIIQDMRKEYGVNL
Subjt:  -------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNL

Query:  SYDRAWRSSEEELRLIRGDPASSYGLLPAYG---------------------------------------------EALKIMN-----------------
        SYD+AW+S+EE LRLIRGDP +SYGLLPAYG                                              AL ++N                 
Subjt:  SYDRAWRSSEEELRLIRGDPASSYGLLPAYG---------------------------------------------EALKIMN-----------------

Query:  ---------------PVNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHAR
                         NL+AK K DAKA++ELFLKAAKAY+ESYFN IWAQL AYPG+REYLDDIGKERW RCFQT+LRYT+MT+N AESVNALFRHA 
Subjt:  ---------------PVNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHAR

Query:  KLPVTALLDHIRDSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS
         LPVTALLDHIR                                                                              E IFP+ +VS
Subjt:  KLPVTALLDHIRDSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS

Query:  TWNSSPEFVNIPVEPPKTVPRVGRRKTARIP
        TW SSP+FV+IP E P  VPRVG+R++ RIP
Subjt:  TWNSSPEFVNIPVEPPKTVPRVGRRKTARIP

XP_022157017.1 uncharacterized protein LOC111023843 [Momordica charantia]8.6e-12576.07Show/hide
Query:  GHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTG
        GHDVEGL PLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYD LA SEEGHSQAEYGNEEHD ALDDELE DVE VHTEIRRDEEAV   GCN LTG
Subjt:  GHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTG

Query:  HPNDEKLQLIVQSSGTNDVNEGD------------------------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDH
         PNDEKLQLIVQSSGTNDVNEGD                                    CVHADCTWRLRATKLKECTLFKIKKYCA HTCYGGALKHDH
Subjt:  HPNDEKLQLIVQSSGTNDVNEGD------------------------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGDPASSYGLLPAYGEALKIMNPVNLIAKCKNDAKAVDELFLKA
        RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEE LRLIRGDPASSYGLLPAYG+ALKIMNP  +        K    +F+  
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGDPASSYGLLPAYGEALKIMNPVNLIAKCKNDAKAVDELFLKA

Query:  AKAYR
         ++ R
Subjt:  AKAYR

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]9.4e-14057.52Show/hide
Query:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD
        CVHADCTWRLRATKLKECTLFKI KYCAAHTCYGGALKHDHRQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGVNLSYDRAWRSSEE LRLIRGD
Subjt:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD

Query:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------
        PASSYGLLPAYGEALKIMNP                                                                                
Subjt:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------

Query:  -----------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKE
                                                       VNLIAK K+DAKAV+ELFLKAAKAYRESYFN IWAQLRAYP            
Subjt:  -----------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKE

Query:  RWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLD
                             S+NALFRH RKLPVTALLDHIR                                 DSARRHVV+NIDQFHFQV DGNLD
Subjt:  RWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLD

Query:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE
        GIVDLNAM CSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYT NSWILAYAE IFPV ++STWNSSPEFVNIPVEPPKTVPRVGRRKT RIPSTGE
Subjt:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE

TrEMBL top hitse value%identityAlignment
A0A6J1CVL4 uncharacterized protein LOC1110151812.6e-16766.12Show/hide
Query:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD
        CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPK+IIQDMRKEYGVNLSYDRA RSSEE LRLIRGD
Subjt:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD

Query:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------
        PASSYGLLPAYGEALKIMNP                                                                                
Subjt:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------

Query:  --------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKM
                                        VNLIAK KNDAKAV ELFLKAAKAYRESYFN IWAQLRAYPGVREYLDDIGKERWARCFQTQLRYT+M
Subjt:  --------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKM

Query:  TTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREF
        TTNIAESVN LFRHARKLPVTALLDHIR                                 +S RRHVV+NIDQFHFQV D NLDGIVDLNAMTC+CREF
Subjt:  TTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREF

Query:  DYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE
        DYFKIPCSHAIAAATMRNINPYSLCDEAYT NSWILAYAE IFPV +VSTWNSSPEFVNIPVEPPKTVPRVGRRKT RIPSTGE
Subjt:  DYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE

A0A6J1DJT1 uncharacterized protein LOC1110207151.8e-16554.44Show/hide
Query:  EEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD---------------------------
        EEG  +AE+ N+++D ALD+E E DVE VH EI RDE AV   GC+ LTG  N E LQLIVQSSGTNDV EG+                           
Subjt:  EEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD---------------------------

Query:  ---------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE
                 CV   CTWRLRATKL++C LFKIKKY + HTC GG LK DHRQAKSWVVGHLVQ KFTDVSRTYRPKDIIQDMRKEYGVNLSYD+AWRSSE
Subjt:  ---------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSE

Query:  EELRLIRGDPASSYGLLPAYGEALKIMNP-----------------------------------------------------------------------
        E LRLIRGDPASSYGLLP YGEALKIMNP                                                                       
Subjt:  EELRLIRGDPASSYGLLPAYGEALKIMNP-----------------------------------------------------------------------

Query:  --------------------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVR
                                                                +NL+AK K DAKA++ELFLKAAKAYRESYFN IWAQL AYPGVR
Subjt:  --------------------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVR

Query:  EYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFH
        EYLDDIGKERWARCFQT+LRYT+MT+N AESVNALFRHARKLPVTALLDHIR                                 D+ARRHVV NIDQFH
Subjt:  EYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFH

Query:  FQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKT
         QV DGNLDGIVD N+ TC+CREFDYFKIPCSHAIA A MRNINPY+LCDEAYT NSW++AYAE IFP+ +VSTWNSSP+FV+ PVE P  VPRVGRR+T
Subjt:  FQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKT

Query:  ARIPSTGE
         RIPSTGE
Subjt:  ARIPSTGE

A0A6J1DP00 uncharacterized protein LOC1110229545.8e-13545.8Show/hide
Query:  ITFSKSAQSINVFDLSLSPHYPKSLSPQFHLRSIPGLELNVNNPGASHIGTVSNISRTCLGHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDES
        +T      S +   ++  P + +   P      IP    + +NP +S      ++    LGHD+ GL PL SDVVPCNLGDDRVC W++PG+WNDN+DES
Subjt:  ITFSKSAQSINVFDLSLSPHYPKSLSPQFHLRSIPGLELNVNNPGASHIGTVSNISRTCLGHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDES

Query:  GESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD-----------------
         ESYD L +SEEG  +AE+ N+++D A D++ E DVE V  EIRRDE  VL  GC+ L G PNDEKLQLIVQSSGTNDV EG                  
Subjt:  GESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGD-----------------

Query:  -------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNL
                           CV + CTWRLRA KL +C LFKIKKY + HTC G  LK DHRQAK+WVV HLVQ KFTDVS TYRPKDIIQDMRKEYGVNL
Subjt:  -------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNL

Query:  SYDRAWRSSEEELRLIRGDPASSYGLLPAYG---------------------------------------------EALKIMN-----------------
        SYD+AW+S+EE LRLIRGDP +SYGLLPAYG                                              AL ++N                 
Subjt:  SYDRAWRSSEEELRLIRGDPASSYGLLPAYG---------------------------------------------EALKIMN-----------------

Query:  ---------------PVNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHAR
                         NL+AK K DAKA++ELFLKAAKAY+ESYFN IWAQL AYPG+REYLDDIGKERW RCFQT+LRYT+MT+N AESVNALFRHA 
Subjt:  ---------------PVNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHAR

Query:  KLPVTALLDHIRDSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS
         LPVTALLDHIR                                                                              E IFP+ +VS
Subjt:  KLPVTALLDHIRDSARRHVVNNIDQFHFQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS

Query:  TWNSSPEFVNIPVEPPKTVPRVGRRKTARIP
        TW SSP+FV+IP E P  VPRVG+R++ RIP
Subjt:  TWNSSPEFVNIPVEPPKTVPRVGRRKTARIP

A0A6J1DTG5 uncharacterized protein LOC1110238434.2e-12576.07Show/hide
Query:  GHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTG
        GHDVEGL PLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYD LA SEEGHSQAEYGNEEHD ALDDELE DVE VHTEIRRDEEAV   GCN LTG
Subjt:  GHDVEGLAPLGSDVVPCNLGDDRVCDWDVPGVWNDNEDESGESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTG

Query:  HPNDEKLQLIVQSSGTNDVNEGD------------------------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDH
         PNDEKLQLIVQSSGTNDVNEGD                                    CVHADCTWRLRATKLKECTLFKIKKYCA HTCYGGALKHDH
Subjt:  HPNDEKLQLIVQSSGTNDVNEGD------------------------------------CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDH

Query:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGDPASSYGLLPAYGEALKIMNPVNLIAKCKNDAKAVDELFLKA
        RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEE LRLIRGDPASSYGLLPAYG+ALKIMNP  +        K    +F+  
Subjt:  RQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGDPASSYGLLPAYGEALKIMNPVNLIAKCKNDAKAVDELFLKA

Query:  AKAYR
         ++ R
Subjt:  AKAYR

A0A6J1DYC4 uncharacterized protein LOC1110256784.6e-14057.52Show/hide
Query:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD
        CVHADCTWRLRATKLKECTLFKI KYCAAHTCYGGALKHDHRQ KSWVVGHLVQEKFTDVSRTYRPKDIIQDMR EYGVNLSYDRAWRSSEE LRLIRGD
Subjt:  CVHADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGD

Query:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------
        PASSYGLLPAYGEALKIMNP                                                                                
Subjt:  PASSYGLLPAYGEALKIMNP--------------------------------------------------------------------------------

Query:  -----------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKE
                                                       VNLIAK K+DAKAV+ELFLKAAKAYRESYFN IWAQLRAYP            
Subjt:  -----------------------------------------------VNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKE

Query:  RWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLD
                             S+NALFRH RKLPVTALLDHIR                                 DSARRHVV+NIDQFHFQV DGNLD
Subjt:  RWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIR---------------------------------DSARRHVVNNIDQFHFQVWDGNLD

Query:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE
        GIVDLNAM CSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYT NSWILAYAE IFPV ++STWNSSPEFVNIPVEPPKTVPRVGRRKT RIPSTGE
Subjt:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTARIPSTGE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase3.7e-0938.37Show/hide
Query:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIP------VEPP
        GIV LN  TC+C EF   K PC HA+A      INP    D+ YT+  +   Y+    PV  +S W   PE   +P      +EPP
Subjt:  GIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIP------VEPP

AT1G64255.1 MuDR family transposase9.2e-0830.33Show/hide
Query:  PVTALLDHIRDS--ARRHVVNNIDQFHFQVWDGNLDG--IVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRY
        PV   L+  R +     ++V  +D   FQV      G  IV L+  +C+C +F  +K PC HA+A       NP    D+ YT+      YA +   V  
Subjt:  PVTALLDHIRDS--ARRHVVNNIDQFHFQVWDGNLDG--IVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRY

Query:  VSTWNSSPEFVNIPVEPPKTVP
        +S W   PE   +P   P  +P
Subjt:  VSTWNSSPEFVNIPVEPPKTVP

AT1G64260.1 MuDR family transposase6.8e-1132.14Show/hide
Query:  PVTALLDHIRDSARRHVVNNIDQFHFQVWDGN--LDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS
        P    L+     +  +V+  +++  F+V + +   + IV LN  TC+CR+F  +K PC HA+A      INP    DE YT+  +   YA    PV  V+
Subjt:  PVTALLDHIRDSARRHVVNNIDQFHFQVWDGN--LDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVS

Query:  TWNSSPEFVNIP
         W   PE   +P
Subjt:  TWNSSPEFVNIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCGATCAAAGGTGCTATTACCAAGGTGTCATTATGAGGTTGGCGAACGCCTTCAAGATCGAGGTTGGAGAATCAGAAGGGACATTTCGACGACTTGGTGTGCAAGG
TGTATACCTCGTGCCTGCCTTCCTTGACCAATGTCTTTCTCTTCTTGTTGTGCAATTGGCCCCGCTAGACTCTCCGAAAATGGTATTTATCATGGCGGGTCGTTCTTGTC
GACAAGGGGCCAATTCCTTGATCGGAATGTCGTTCCTTGAATTGACCTTGAGCCGAGCGAATTGTCGGCGTCTACGTACTTTTGAGCTTGGCGCGTTAGACCACATTGAA
GGTCGCTATGAGCTTCTTGCGAGGGATAATATGAGGTTCTTATCCTTCAGGCCAACAATGAACCCTGACTTAGGCGTATCATCAGAGCAGTGGGCAACCATGATCCGGTT
GGCCTTGATCTTTCTATCTCAACTCTGGTTGGCCTTGATCTTTCTACCTCAGCGACTCCTTCTCTCTTTGCCAAATGAATTTGAGCTTCCAAGACCTCTTAGGGCCGCCA
AGGTAAAGCTTATCATCGATAGGTCGGTTACAGGCCCTTTGCTTCAAGTCGAGTCTTATAATAAGGTACGCTCTACCTCTTTTGCTTGTCTTATTGCTTGCTTGAACGTA
CGTGCCATTGATTCCATTTGGGCGTTTATTACTGCCATCTTAGCATGTAGAGTAGCTACGGTCACTTGTCGAGCTATCATAGACTGTGTTCCTTGTGGGCAGATCACGTT
TTCAAAATCCGCTCAAAGCATCAACGTGTTTGATCTTTCTTTGTCCCCACACTATCCAAAATCATTGTCCCCACAATTTCACTTGAGATCTATACCTGGTCTAGAATTGA
ATGTAAACAATCCCGGTGCATCACATATCGGGACTGTCTCAAATATATCTCGGACCTGTCTTGGTCATGATGTAGAGGGTTTAGCACCATTAGGGTCAGATGTTGTTCCA
TGTAATCTGGGAGATGATAGGGTGTGTGATTGGGATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACCTATTGGCAGAGTCTGAAGAAGG
ACACTCTCAAGCAGAATATGGGAACGAAGAGCATGACGGTGCGCTTGATGATGAGCTTGAGCATGATGTGGAACATGTGCACACTGAGATTCGCAGGGATGAAGAAGCGG
TCCTGCCATCGGGATGTAATAGTCTCACCGGACACCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATTGCGTTCAT
GCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCTGCCCATACGTGCTATGGTGGGGCTTTAAAACATGA
TCATAGGCAAGCCAAAAGTTGGGTGGTAGGACATCTTGTGCAGGAGAAGTTCACAGACGTCTCCCGAACGTATAGACCGAAGGACATTATACAAGACATGAGGAAAGAGT
ATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAGAACTCCGGCTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTGAA
GCTTTGAAAATCATGAACCCGGTTAACTTGATAGCAAAATGTAAAAACGATGCGAAGGCAGTCGATGAACTATTTTTAAAGGCTGCAAAGGCGTATCGCGAGTCATATTT
CAACTTGATCTGGGCCCAACTTCGTGCATACCCCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACTCAATTGAGGTACACAA
AGATGACTACAAATATCGCAGAGTCTGTAAATGCCCTCTTCAGGCACGCCCGTAAGTTGCCGGTTACCGCTTTACTTGACCACATTAGAGATAGCGCGCGGAGACACGTT
GTAAACAATATTGACCAGTTCCATTTCCAGGTATGGGATGGCAACCTTGACGGGATTGTTGATTTGAACGCTATGACGTGTAGTTGTCGGGAGTTTGATTACTTTAAGAT
TCCGTGCTCTCATGCTATTGCGGCGGCGACGATGCGAAATATAAATCCATACAGTCTGTGCGACGAGGCATATACGATGAACTCCTGGATATTGGCTTATGCAGAGCTCA
TATTTCCAGTCAGATACGTCTCGACATGGAACAGTTCGCCAGAGTTTGTCAACATACCGGTGGAACCACCGAAGACTGTTCCAAGAGTTGGGAGGAGGAAGACGGCTAGG
ATTCCTTCCACGGGCGAGGGGTATTCGATTCTCGTGGACAGGACCCCTGGTTTCACGGCCGTGATTACGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCA
AATTAATCAAAAGAAGCAAAAAGACGAAAAAAGCATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGC
GTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGATTTAGAGAAGGCCACTGTAAGTCTTAAC
AAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTCGATTGGTTAAACCGAGGAGAGGAGCGCCTTAGACATAAGGGTTCCGCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGATCGATCAAAGGTGCTATTACCAAGGTGTCATTATGAGGTTGGCGAACGCCTTCAAGATCGAGGTTGGAGAATCAGAAGGGACATTTCGACGACTTGGTGTGCAAGG
TGTATACCTCGTGCCTGCCTTCCTTGACCAATGTCTTTCTCTTCTTGTTGTGCAATTGGCCCCGCTAGACTCTCCGAAAATGGTATTTATCATGGCGGGTCGTTCTTGTC
GACAAGGGGCCAATTCCTTGATCGGAATGTCGTTCCTTGAATTGACCTTGAGCCGAGCGAATTGTCGGCGTCTACGTACTTTTGAGCTTGGCGCGTTAGACCACATTGAA
GGTCGCTATGAGCTTCTTGCGAGGGATAATATGAGGTTCTTATCCTTCAGGCCAACAATGAACCCTGACTTAGGCGTATCATCAGAGCAGTGGGCAACCATGATCCGGTT
GGCCTTGATCTTTCTATCTCAACTCTGGTTGGCCTTGATCTTTCTACCTCAGCGACTCCTTCTCTCTTTGCCAAATGAATTTGAGCTTCCAAGACCTCTTAGGGCCGCCA
AGGTAAAGCTTATCATCGATAGGTCGGTTACAGGCCCTTTGCTTCAAGTCGAGTCTTATAATAAGGTACGCTCTACCTCTTTTGCTTGTCTTATTGCTTGCTTGAACGTA
CGTGCCATTGATTCCATTTGGGCGTTTATTACTGCCATCTTAGCATGTAGAGTAGCTACGGTCACTTGTCGAGCTATCATAGACTGTGTTCCTTGTGGGCAGATCACGTT
TTCAAAATCCGCTCAAAGCATCAACGTGTTTGATCTTTCTTTGTCCCCACACTATCCAAAATCATTGTCCCCACAATTTCACTTGAGATCTATACCTGGTCTAGAATTGA
ATGTAAACAATCCCGGTGCATCACATATCGGGACTGTCTCAAATATATCTCGGACCTGTCTTGGTCATGATGTAGAGGGTTTAGCACCATTAGGGTCAGATGTTGTTCCA
TGTAATCTGGGAGATGATAGGGTGTGTGATTGGGATGTGCCGGGAGTATGGAATGATAACGAAGATGAAAGTGGTGAATCATATGACCTATTGGCAGAGTCTGAAGAAGG
ACACTCTCAAGCAGAATATGGGAACGAAGAGCATGACGGTGCGCTTGATGATGAGCTTGAGCATGATGTGGAACATGTGCACACTGAGATTCGCAGGGATGAAGAAGCGG
TCCTGCCATCGGGATGTAATAGTCTCACCGGACACCCTAATGATGAGAAATTGCAACTCATAGTACAGTCTTCTGGGACAAATGATGTTAATGAGGGCGATTGCGTTCAT
GCTGATTGCACGTGGAGACTTCGAGCTACCAAGCTAAAGGAATGCACTTTGTTCAAGATAAAAAAATATTGTGCTGCCCATACGTGCTATGGTGGGGCTTTAAAACATGA
TCATAGGCAAGCCAAAAGTTGGGTGGTAGGACATCTTGTGCAGGAGAAGTTCACAGACGTCTCCCGAACGTATAGACCGAAGGACATTATACAAGACATGAGGAAAGAGT
ATGGTGTCAATTTAAGTTATGATAGAGCATGGCGTTCTAGTGAAGAAGAACTCCGGCTTATTAGAGGTGATCCAGCATCGTCATATGGTCTACTTCCAGCTTATGGTGAA
GCTTTGAAAATCATGAACCCGGTTAACTTGATAGCAAAATGTAAAAACGATGCGAAGGCAGTCGATGAACTATTTTTAAAGGCTGCAAAGGCGTATCGCGAGTCATATTT
CAACTTGATCTGGGCCCAACTTCGTGCATACCCCGGTGTACGGGAATATCTAGACGATATTGGGAAGGAGCGTTGGGCTCGTTGTTTCCAAACTCAATTGAGGTACACAA
AGATGACTACAAATATCGCAGAGTCTGTAAATGCCCTCTTCAGGCACGCCCGTAAGTTGCCGGTTACCGCTTTACTTGACCACATTAGAGATAGCGCGCGGAGACACGTT
GTAAACAATATTGACCAGTTCCATTTCCAGGTATGGGATGGCAACCTTGACGGGATTGTTGATTTGAACGCTATGACGTGTAGTTGTCGGGAGTTTGATTACTTTAAGAT
TCCGTGCTCTCATGCTATTGCGGCGGCGACGATGCGAAATATAAATCCATACAGTCTGTGCGACGAGGCATATACGATGAACTCCTGGATATTGGCTTATGCAGAGCTCA
TATTTCCAGTCAGATACGTCTCGACATGGAACAGTTCGCCAGAGTTTGTCAACATACCGGTGGAACCACCGAAGACTGTTCCAAGAGTTGGGAGGAGGAAGACGGCTAGG
ATTCCTTCCACGGGCGAGGGGTATTCGATTCTCGTGGACAGGACCCCTGGTTTCACGGCCGTGATTACGCTTGTTTACATGCGTAAAAAGAAGGACCATTTGGGAGTGCA
AATTAATCAAAAGAAGCAAAAAGACGAAAAAAGCATCATGGGAGGCGCCAGGCGCCTGGGAAGCCTGCAGAAAAACTGGTTTTCTTCCAACTTTGCCCTTAATGAAACGC
GTCTTCCAATGCGTTTTGGTGGTTCCAACCGATGCATACGTGTAGAAGAAGTGTTCCACTATCAGTTTGAGCACGATTTGGATTTAGAGAAGGCCACTGTAAGTCTTAAC
AAGTCAATACTGAAAAGGTTGCTTATTTGCATTGTCGATTGGTTAAACCGAGGAGAGGAGCGCCTTAGACATAAGGGTTCCGCTTAG
Protein sequenceShow/hide protein sequence
MIDQRCYYQGVIMRLANAFKIEVGESEGTFRRLGVQGVYLVPAFLDQCLSLLVVQLAPLDSPKMVFIMAGRSCRQGANSLIGMSFLELTLSRANCRRLRTFELGALDHIE
GRYELLARDNMRFLSFRPTMNPDLGVSSEQWATMIRLALIFLSQLWLALIFLPQRLLLSLPNEFELPRPLRAAKVKLIIDRSVTGPLLQVESYNKVRSTSFACLIACLNV
RAIDSIWAFITAILACRVATVTCRAIIDCVPCGQITFSKSAQSINVFDLSLSPHYPKSLSPQFHLRSIPGLELNVNNPGASHIGTVSNISRTCLGHDVEGLAPLGSDVVP
CNLGDDRVCDWDVPGVWNDNEDESGESYDLLAESEEGHSQAEYGNEEHDGALDDELEHDVEHVHTEIRRDEEAVLPSGCNSLTGHPNDEKLQLIVQSSGTNDVNEGDCVH
ADCTWRLRATKLKECTLFKIKKYCAAHTCYGGALKHDHRQAKSWVVGHLVQEKFTDVSRTYRPKDIIQDMRKEYGVNLSYDRAWRSSEEELRLIRGDPASSYGLLPAYGE
ALKIMNPVNLIAKCKNDAKAVDELFLKAAKAYRESYFNLIWAQLRAYPGVREYLDDIGKERWARCFQTQLRYTKMTTNIAESVNALFRHARKLPVTALLDHIRDSARRHV
VNNIDQFHFQVWDGNLDGIVDLNAMTCSCREFDYFKIPCSHAIAAATMRNINPYSLCDEAYTMNSWILAYAELIFPVRYVSTWNSSPEFVNIPVEPPKTVPRVGRRKTAR
IPSTGEGYSILVDRTPGFTAVITLVYMRKKKDHLGVQINQKKQKDEKSIMGGARRLGSLQKNWFSSNFALNETRLPMRFGGSNRCIRVEEVFHYQFEHDLDLEKATVSLN
KSILKRLLICIVDWLNRGEERLRHKGSA