; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1973 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1973
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionUPF0400 protein C337.03
Genome locationMC04:26498821..26507366
RNA-Seq ExpressionMC04g1973
SyntenyMC04g1973
Gene Ontology termsGO:0031124 - mRNA 3'-end processing (biological process)
GO:0016591 - RNA polymerase II, holoenzyme (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138638.1 UPF0400 protein C337.03 [Cucumis sativus]0.089.2Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIG DVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQY GSS+++DL+ HH+ILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LV REREQSAPVMYAAS+PFP+KPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN+SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DG+FY+QSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

XP_008441251.1 PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo]0.089.96Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQY GSS+++DL+ HHTILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAAS+PFP+KPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DGNFYNQSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

XP_008441252.1 PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo]0.088.83Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
                 +DL+ HHTILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAAS+PFP+KPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DGNFYNQSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

XP_022152479.1 UPF0400 protein C337.03 [Momordica charantia]0.0100Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS
        PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS

Query:  YQNFQASDGNFYNQSSSMPMAPMSRQ
        YQNFQASDGNFYNQSSSMPMAPMSRQ
Subjt:  YQNFQASDGNFYNQSSSMPMAPMSRQ

XP_038884747.1 UPF0400 protein C337.03 [Benincasa hispida]0.089.92Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETG+RNGK FS KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIG DVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQY GSS+++DL+ HHTILR CIEQLT+IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EEASKDAQTSIAPHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPR+REQSAPVMYA SLPFP KPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE   DYPSEKRPKLENDQ PYTLPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS
        PPVSSFPHPESLQ N SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFP+PQFTQN GSVSS+PYSYS+TQ L PLAMPGYPNVG PVTG+SPFTIPTNS
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS

Query:  YQNFQASDGNFYNQSSSMPMAPMSRQ
        YQ+FQA DGNFYNQSSSMPMAP+SRQ
Subjt:  YQNFQASDGNFYNQSSSMPMAPMSRQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMU3 CID domain-containing protein0.089.2Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIG DVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQY GSS+++DL+ HH+ILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LV REREQSAPVMYAAS+PFP+KPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN+SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DG+FY+QSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

A0A1S3B3N2 UPF0400 protein C337.03 isoform X10.089.96Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQY GSS+++DL+ HHTILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAAS+PFP+KPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DGNFYNQSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

A0A1S3B3S0 UPF0400 protein C337.03 isoform X20.088.83Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGD+F
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQS S SLDKIV+GYQVVYG EIDED VLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
                 +DL+ HHTILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM EE SKDAQTS+APHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAAS+PFP+KPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP K+   DYPSEKRPKLENDQ PY LPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        PPVSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+P  YSYS+TQ L PLAMPGYPN G PVTGMSPFTIPT
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVP--YSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQNFQA DGNFYNQSSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

A0A6J1DG44 UPF0400 protein C337.030.0100Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
        GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHS

Query:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
        LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR
Subjt:  LVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQR

Query:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS
        PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS
Subjt:  PPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNS

Query:  YQNFQASDGNFYNQSSSMPMAPMSRQ
        YQNFQASDGNFYNQSSSMPMAPMSRQ
Subjt:  YQNFQASDGNFYNQSSSMPMAPMSRQ

A0A6J1FFD5 UPF0400 protein C337.03-like0.087.88Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIE+GDDF
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGK +ETGNRNGK FS KLKQS S SLDKIV GYQVVY +E+DED VLSKCRNSISYLEKLDKEIGADVNS
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNS

Query:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMA-EEASKDAQTSIAPH
        GQY G+S +EDL+ HH ILR CIEQLT IE+SRA+LVSHLREALQEQEFKL++VRNQLQ SHSQSEQTQNL RQFLNGENV+ M  EEASKDAQTSIAPH
Subjt:  GQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMA-EEASKDAQTSIAPH

Query:  SLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQ
        +LVPRER+QSAPVMYA SLPFPAKPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE   DYPSEKR KLENDQ PYTLPPNPQ
Subjt:  SLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQ

Query:  RPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNA-GSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT
        RPPV  FPHPESLQHNASSTSQQYTP+D PPPPSSSPPP+PPLPPV Q PLPQFTQNA GSVSS+ YSYS+TQ LQPLA PGYPN+G PVTGMSP TIPT
Subjt:  RPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNA-GSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPT

Query:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ
        NSYQ+FQ SDGNFYN SSSMPMAP+SRQ
Subjt:  NSYQNFQASDGNFYNQSSSMPMAPMSRQ

SwissProt top hitse value%identityAlignment
Q0P5J9 Regulation of nuclear pre-mRNA domain-containing protein 1A1.5e-2029.09Show/hide
Query:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   A   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE
          R++ IWEER V+ +   + LK+ + G   +   R  +Q  V   ++ S+         +LD +V   Q +      +  V  +  +       +S L+
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE

Query:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL
        K+ DKE G  +       S + ED    L  ++  L   I+    +    A+ +   +EAL E+E KL+E + +L
Subjt:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL

Q8VDS4 Regulation of nuclear pre-mRNA domain-containing protein 1A9.1e-2129.45Show/hide
Query:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   A   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE
          R++ IWEER V+ +   + LK  + G   +   R  +Q  V   ++ S+         +LD +V   Q +      +  V  +  +       +S LE
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE

Query:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL
        K+ DKE G  +       S + ED    L  ++  L   I+    +    A+ +   +EAL E+E KL+E + +L
Subjt:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL

Q96P16 Regulation of nuclear pre-mRNA domain-containing protein 1A1.5e-2029.09Show/hide
Query:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   A   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE
          R++ IWEER V+ +   + LK+ + G   +   R  +Q  V   ++ S+         +LD +V   Q +      +  V  +  +       +S L+
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMGKHVETGNRNGKQFSVKLKQSTST---------SLDKIVAGYQVVYGTEIDEDVVLSKCRN------SISYLE

Query:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL
        K+ DKE G  +       S + ED    L  ++  L   I+    +    A+ +   +EAL E+E KL+E + +L
Subjt:  KL-DKEIGADVNSGQYHGSSVSED----LQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQL

Q9CSU0 Regulation of nuclear pre-mRNA domain-containing protein 1B3.7e-2243.1Show/hide
Query:  TFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRN
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   A   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    D+  + 
Subjt:  TFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRN

Query:  AALRLIGIWEERKVFG
           RL+ IW+ER V+G
Subjt:  AALRLIGIWEERKVFG

Q9NQG5 Regulation of nuclear pre-mRNA domain-containing protein 1B3.7e-2243.1Show/hide
Query:  TFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRN
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   A   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    D+  + 
Subjt:  TFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRN

Query:  AALRLIGIWEERKVFG
           RL+ IW+ER V+G
Subjt:  AALRLIGIWEERKVFG

Arabidopsis top hitse value%identityAlignment
AT3G26990.1 ENTH/VHS family protein3.9e-13654Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        MG +FNA ILV+KLA+LNNSQASIETLSHWCIFHMNKAK VVETW +QFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRD+IENGDDF
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGK----QFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGA
        GR +A RL+ IWEERKVFGSRGQ LKEE++G+  E G RNG     + SV  +Q   ++L+K+V+  +V++G +IDED ++ K  N+  YLEK  +E+  
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVETGNRNGK----QFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGA

Query:  DVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFL-NGENVQPMA--EEASKD--
        D++SG   G +V ++LQ  H ILR CIEQL A+E+SR +L+SHLREALQEQE KL++VRN LQ +  QS++T +L RQ L +G + QP A  EE SK+  
Subjt:  DVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFL-NGENVQPMA--EEASKD--

Query:  --AQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN----PIKE--SSSDYPSEKRP
          + T+ AP S    + EQSAPVM+A++      P  + EDPRK+AAAAV AKLTASTSS +MLSYVLSSLASEG+IGN     + E  SS D+P EKRP
Subjt:  --AQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN----PIKE--SSSDYPSEKRP

Query:  KLENDQPPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSL------TQPLQPLA
        KL+N    Y  P                  H  ++T+   TP  P PP    PPP    P  +Q PL    Q  G V+  P++Y++      TQ  Q   
Subjt:  KLENDQPPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVVQFPLPQFTQNAGSVSSVPYSYSL------TQPLQPLA

Query:  MPGYPNVGTPVTGMSPFTIPT-NSYQNFQASDGNFYNQSSSMPMAPMSRQ
         P  P     +T +S  + P+ NSYQ FQ  DG FY  +SS+P+ P++RQ
Subjt:  MPGYPNVGTPVTGMSPFTIPT-NSYQNFQASDGNFYNQSSSMPMAPMSRQ

AT5G10060.1 ENTH/VHS family protein2.1e-6533.59Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        M   F+  IL+DKLA+LN+SQ SIETLSHWCIF+ +KA+ +V TW+KQFH    +Q++  LYLANDILQNS+R+G+EFV EFW VLP AL+D++  GDD 
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHV--------------ETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISY
        G++A  R+I IWEER+VFGSR +SLK+ ++G+ V              ++  R  K    KL  S   + +KI + Y +V     +E+  ++KC++++  
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHV--------------ETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISY

Query:  LEKLDKEIGADVNSGQYH--GSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPM
        + K++K++    ++ + +    S++++L+    +LR CIE+L +++ SR++LV+ L++AL+EQE +LD ++ Q+Q +  Q+E+ QN+ ++         +
Subjt:  LEKLDKEIGADVNSGQYH--GSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPM

Query:  AEEASKDAQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPK
         +E     QT+ A       +  +S                       K   A++AA LTASTSS  ++  VLSS A+E    + + +S S         
Subjt:  AEEASKDAQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPK

Query:  LENDQPPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPP----------MPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQ
                T+P +      +SFP   + Q+   +T  QY     PPPP    PP          +P +PP +  P P           +P S S  Q  Q
Subjt:  LENDQPPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPP----------MPPLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQ

Query:  --PLAMPGYPNVGTP
              PG    G P
Subjt:  --PLAMPGYPNVGTP

AT5G65180.1 ENTH/VHS family protein1.7e-6234.65Show/hide
Query:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF
        M   F+  IL+D LA+LN++Q SI+TLS WCI H ++A+ VV TW+KQFH     Q++  LYLANDILQNS+R+G+EFV EFWKVLP AL+D++  GDD+
Subjt:  MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVE----------TGNRNGKQ--FSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLE
        G+    RL+ IWEER+VFGSR +SLK+ ++ +              G+++ K+   S K K S+    +KIV+ + +V     +E+  ++KC++++  + 
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHVE----------TGNRNGKQ--FSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLE

Query:  KLDKEI-GADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEE
        K++K++  A   +      S++++L+    ILR  +E+L ++E SR +LV+HLREAL+EQE +L+ +++Q+Q +  Q+E+ QN+ ++ LN E   P+   
Subjt:  KLDKEI-GADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEE

Query:  ASKDAQTS-IAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLE
             Q++ I P S+                                   AA+A  LT+ST+S  ++  VLSS A+E    + + +S++           
Subjt:  ASKDAQTS-IAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLE

Query:  NDQPPYTLPPNPQRPPVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM
        +D   + +PPNPQ+  +   P+P + Q   +   +         PPPPP + PP M
Subjt:  NDQPPYTLPPNPQRPPVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM

AT5G65180.2 ENTH/VHS family protein5.3e-1627.33Show/hide
Query:  SVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEI-GADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREA
        S K K S+    +KIV+ + +V     +E+  ++KC++++  + K++K++  A   +      S++++L+    ILR  +E+L ++E SR +LV+HLREA
Subjt:  SVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEI-GADVNSGQYHGSSVSEDLQRHHTILRGCIEQLTAIESSRANLVSHLREA

Query:  LQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTS-IAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAK
        L+EQE +L+ +++Q+Q +  Q+E+ QN+ ++ LN E   P+        Q++ I P S+                                   AA+A  
Subjt:  LQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTS-IAPHSLVPREREQSAPVMYAASLPFPAKPGPNEEDPRKSAAAAVAAK

Query:  LTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQRPPVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM
        LT+ST+S  ++  VLSS A+E    + + +S++           +D   + +PPNPQ+  +   P+P + Q   +   +         PPPPP + PP M
Subjt:  LTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQRPPVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTACATTCAATGCACACATTTTGGTGGACAAGCTAGCCAGGCTTAATAACTCACAGGCGAGCATAGAGACTTTATCCCATTGGTGTATATTTCACATGAACAA
AGCCAAGCAAGTTGTAGAAACATGGGATAAGCAGTTTCATTGTGCTCCACGTGAACAAAGATTAGCCTATCTATATCTTGCAAATGACATTTTGCAAAACAGTAGGCGAA
AAGGCTCAGAGTTTGTTGGTGAGTTTTGGAAAGTCCTTCCAGATGCACTTCGTGATGTAATTGAGAATGGGGATGATTTTGGACGAAATGCGGCCCTACGACTGATTGGA
ATTTGGGAAGAGAGGAAAGTTTTTGGATCTCGTGGGCAAAGTCTTAAGGAAGAGATAATGGGAAAGCACGTGGAAACTGGTAATCGGAATGGGAAGCAATTCAGCGTTAA
ACTGAAACAATCTACCAGCACATCATTGGATAAAATAGTTGCTGGTTACCAAGTTGTTTATGGAACTGAGATTGATGAAGATGTAGTATTGAGCAAATGCAGGAATTCTA
TTAGCTATCTTGAGAAACTGGACAAAGAAATTGGTGCTGATGTTAATTCAGGGCAATACCATGGATCTTCAGTGTCAGAGGATCTGCAGAGACATCATACCATTTTGAGG
GGCTGCATCGAACAATTAACAGCAATTGAATCATCAAGGGCAAATCTCGTGTCTCATCTGAGAGAGGCTCTTCAAGAACAGGAATTCAAATTGGATGAAGTCCGAAACCA
ACTTCAGGCTTCCCATTCCCAGTCGGAACAAACTCAGAATCTCAGCCGTCAGTTCTTAAATGGTGAAAATGTGCAACCCATGGCTGAGGAAGCCTCAAAAGATGCTCAGA
CCTCAATAGCGCCACATAGCCTTGTACCAAGGGAGAGAGAACAATCAGCACCAGTTATGTATGCAGCCTCCTTACCTTTTCCTGCAAAACCTGGACCTAATGAGGAAGAT
CCTCGCAAGTCTGCTGCTGCTGCAGTGGCAGCGAAGCTAACGGCATCAACGTCCTCGGTTCAGATGCTCTCTTATGTCCTCTCTTCCCTCGCATCAGAGGGTGTGATTGG
AAATCCAATTAAAGAGTCATCCAGTGATTATCCCTCTGAGAAGAGGCCCAAACTTGAAAATGACCAGCCACCCTATACATTGCCTCCGAATCCTCAGCGACCTCCGGTTT
CTTCCTTCCCACACCCTGAGTCCCTCCAACATAATGCCTCGTCCACCAGTCAACAATACACTCCGACTGACCCTCCACCTCCCCCATCATCATCTCCACCGCCCATGCCT
CCATTACCTCCTGTAGTGCAGTTCCCTCTGCCCCAGTTCACACAGAATGCAGGATCAGTAAGTAGCGTACCTTACAGTTACAGTCTGACACAACCACTGCAACCATTAGC
GATGCCGGGCTATCCAAATGTAGGTACCCCAGTGACCGGGATGTCTCCTTTTACGATACCAACAAATTCTTACCAGAATTTTCAGGCTTCAGATGGTAATTTCTATAATC
AGTCATCATCCATGCCGATGGCACCCATGTCTAGGCAATAG
mRNA sequenceShow/hide mRNA sequence
GGTGATTATAAAGGAGGCGCGAGAGGGAAGCTGGCGCGGCCAATTAGCGGCGCCAAATCCAGAGCTCGCGCGATTGCAATCGCGAAGCCAATTGAGTTGCAGTTCCACAG
CCACAGCCAGCTGCTGGTGAGTGGTGCGGCCCAACACAGGGAAACCGCAGCGACACAAAATATTCTTCTAATTCCAAGCACACCGCTATTCAATTCTTATCTTGCGACGC
GACCCCCTCGGGCCTCGACCAAAATCCCTGCTTCCTGGGTTTTCCTTTTCCGCTCCCTTCTTCTCTATTTGGGTCTTCTCCATACTCTCACACTCTTCCGCCATCTCCCT
CTATTCGCCCTTCGCTCTCTGTTTTTGCGAAACCCTTTTCCGGACCCTGTAAATTGTAATTGCCCCCCACTCTTCCATGGAGGAAGCCCTCCAACGGCCTAATTTCGCCC
GCTTCTAGTTGCCTTCTCTTTCACCCACATTCCCTTTTGCAGTTATCAAATTCAATGGGTGGTACATTCAATGCACACATTTTGGTGGACAAGCTAGCCAGGCTTAATAA
CTCACAGGCGAGCATAGAGACTTTATCCCATTGGTGTATATTTCACATGAACAAAGCCAAGCAAGTTGTAGAAACATGGGATAAGCAGTTTCATTGTGCTCCACGTGAAC
AAAGATTAGCCTATCTATATCTTGCAAATGACATTTTGCAAAACAGTAGGCGAAAAGGCTCAGAGTTTGTTGGTGAGTTTTGGAAAGTCCTTCCAGATGCACTTCGTGAT
GTAATTGAGAATGGGGATGATTTTGGACGAAATGCGGCCCTACGACTGATTGGAATTTGGGAAGAGAGGAAAGTTTTTGGATCTCGTGGGCAAAGTCTTAAGGAAGAGAT
AATGGGAAAGCACGTGGAAACTGGTAATCGGAATGGGAAGCAATTCAGCGTTAAACTGAAACAATCTACCAGCACATCATTGGATAAAATAGTTGCTGGTTACCAAGTTG
TTTATGGAACTGAGATTGATGAAGATGTAGTATTGAGCAAATGCAGGAATTCTATTAGCTATCTTGAGAAACTGGACAAAGAAATTGGTGCTGATGTTAATTCAGGGCAA
TACCATGGATCTTCAGTGTCAGAGGATCTGCAGAGACATCATACCATTTTGAGGGGCTGCATCGAACAATTAACAGCAATTGAATCATCAAGGGCAAATCTCGTGTCTCA
TCTGAGAGAGGCTCTTCAAGAACAGGAATTCAAATTGGATGAAGTCCGAAACCAACTTCAGGCTTCCCATTCCCAGTCGGAACAAACTCAGAATCTCAGCCGTCAGTTCT
TAAATGGTGAAAATGTGCAACCCATGGCTGAGGAAGCCTCAAAAGATGCTCAGACCTCAATAGCGCCACATAGCCTTGTACCAAGGGAGAGAGAACAATCAGCACCAGTT
ATGTATGCAGCCTCCTTACCTTTTCCTGCAAAACCTGGACCTAATGAGGAAGATCCTCGCAAGTCTGCTGCTGCTGCAGTGGCAGCGAAGCTAACGGCATCAACGTCCTC
GGTTCAGATGCTCTCTTATGTCCTCTCTTCCCTCGCATCAGAGGGTGTGATTGGAAATCCAATTAAAGAGTCATCCAGTGATTATCCCTCTGAGAAGAGGCCCAAACTTG
AAAATGACCAGCCACCCTATACATTGCCTCCGAATCCTCAGCGACCTCCGGTTTCTTCCTTCCCACACCCTGAGTCCCTCCAACATAATGCCTCGTCCACCAGTCAACAA
TACACTCCGACTGACCCTCCACCTCCCCCATCATCATCTCCACCGCCCATGCCTCCATTACCTCCTGTAGTGCAGTTCCCTCTGCCCCAGTTCACACAGAATGCAGGATC
AGTAAGTAGCGTACCTTACAGTTACAGTCTGACACAACCACTGCAACCATTAGCGATGCCGGGCTATCCAAATGTAGGTACCCCAGTGACCGGGATGTCTCCTTTTACGA
TACCAACAAATTCTTACCAGAATTTTCAGGCTTCAGATGGTAATTTCTATAATCAGTCATCATCCATGCCGATGGCACCCATGTCTAGGCAATAGAGGTATGTAATACTA
GTGCTACTTGAGCCGTGTTGTGCTAACAAAATCTTTCAGAAAGTGGTGCATCCCATTAAGGCGCTACGAGGAAATCTTGCATCCTTGTATACCCCTTAAGCTCTGACGAC
TTAACTTAATGGATTTTGTACACTTTAGTAATAGTTTGTAATTTATCTCTGAACGATATTTCATCTAGTTGAATGGAGCAGTATGATGATTATGTTGACTTGATAGAATT
TTGCACATAGTTAGAGTTTGTAATTTATCTTCATTTTTCAATGTAATGCTTATACGCTGGAACAGAACTTCATCTGGTTGAATGATGTATGATTGTTTATATTGGTTTTG
GTTGGTTTTGTTGTAGCTAACAATTGAGTGTTAGAAATAAGAGATCCTATGTTAGACTTGAG
Protein sequenceShow/hide protein sequence
MGGTFNAHILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCAPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDDFGRNAALRLIG
IWEERKVFGSRGQSLKEEIMGKHVETGNRNGKQFSVKLKQSTSTSLDKIVAGYQVVYGTEIDEDVVLSKCRNSISYLEKLDKEIGADVNSGQYHGSSVSEDLQRHHTILR
GCIEQLTAIESSRANLVSHLREALQEQEFKLDEVRNQLQASHSQSEQTQNLSRQFLNGENVQPMAEEASKDAQTSIAPHSLVPREREQSAPVMYAASLPFPAKPGPNEED
PRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPIKESSSDYPSEKRPKLENDQPPYTLPPNPQRPPVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMP
PLPPVVQFPLPQFTQNAGSVSSVPYSYSLTQPLQPLAMPGYPNVGTPVTGMSPFTIPTNSYQNFQASDGNFYNQSSSMPMAPMSRQ