; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0005226 (gene) of Snake gourd v1 genome

Gene IDTan0005226
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionregulation of nuclear pre-mRNA domain-containing protein 1B-like
Genome locationLG11:1005179..1017527
RNA-Seq ExpressionTan0005226
SyntenyTan0005226
Gene Ontology termsGO:0031124 - mRNA 3'-end processing (biological process)
GO:0016591 - RNA polymerase II, holoenzyme (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138638.1 UPF0400 protein C337.03 [Cucumis sativus]6.9e-26891.3Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIG D N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LV REREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN+SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DG+FY+QSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

XP_008441251.1 PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo]3.0e-27192.25Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQY GSSIA+DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DGNFYNQSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

XP_008441252.1 PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo]2.7e-26490.74Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        S         +DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DGNFYNQSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

XP_022152479.1 UPF0400 protein C337.03 [Momordica charantia]6.9e-26891.46Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIE+GDDF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGKQFS KL KQS S SLDKIV+GYQVV G+E+DEDVVLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM+EEASKDAQTSIAPH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+SLPFPAKPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE S DYPSEKR KLENDQ PYTLPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN
        RP VSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+PYSY++TQ L PLAMPGYPNVG PVTGMSPFTIPTN
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN

Query:  SYQSFQASDGNFYNQSSSMPMTPISRQ
        SYQ+FQASDGNFYNQSSSMPM P+SRQ
Subjt:  SYQSFQASDGNFYNQSSSMPMTPISRQ

XP_038884747.1 UPF0400 protein C337.03 [Benincasa hispida]6.0e-27292.79Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETG+RNGK FS+KL KQS SVSLDKIVSGYQVV G+E+DED VLSKCR+SISYLEKLDKEIG D N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQY GSSIA+DLRGHHTILRDCIEQLT+IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EEASKDAQTSIAPH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPR+REQSAPVMYA SLPFP KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKEL GDYPSEKR KLENDQ PYTLPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN
        RP VSSFPHPESLQ N SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFP+PQFTQN GSVSSIPYSY+MTQSLPPLAMPGYPNVGAPVTG+SPFTIPTN
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN

Query:  SYQSFQASDGNFYNQSSSMPMTPISRQ
        SYQSFQA DGNFYNQSSSMPM PISRQ
Subjt:  SYQSFQASDGNFYNQSSSMPMTPISRQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMU3 CID domain-containing protein3.3e-26891.3Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIG D N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LV REREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN+SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DG+FY+QSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

A0A1S3B3N2 UPF0400 protein C337.03 isoform X11.4e-27192.25Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQY GSSIA+DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DGNFYNQSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

A0A1S3B3S0 UPF0400 protein C337.03 isoform X21.3e-26490.74Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI +GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F++KL KQS SVSLDKIVSGYQVV G E+DED VLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        S         +DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM+EE SKDAQTS+APH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+S+PFP+KPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNK+L GDYPSEKR KLENDQ PY LPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        RP VSSFPHPESLQHN SSTSQQYTP+DPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPYSY+MTQSLPPLAMPGYPN GAPVTGMSPFTIP
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQ+FQA DGNFYNQSSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

A0A6J1DG44 UPF0400 protein C337.033.3e-26891.46Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIE+GDDF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGKQFS KL KQS S SLDKIV+GYQVV G+E+DEDVVLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH
        SGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA+LVSHLREALQEQEFKL++VRNQLQASHSQSEQTQNL RQFLNGENVQPM+EEASKDAQTSIAPH
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPH

Query:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ
         LVPREREQSAPVMYA+SLPFPAKPGP EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE S DYPSEKR KLENDQ PYTLPPNPQ
Subjt:  GLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQ

Query:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN
        RP VSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPV QFPLPQFTQNAGSVSS+PYSY++TQ L PLAMPGYPNVG PVTGMSPFTIPTN
Subjt:  RPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTN

Query:  SYQSFQASDGNFYNQSSSMPMTPISRQ
        SYQ+FQASDGNFYNQSSSMPM P+SRQ
Subjt:  SYQSFQASDGNFYNQSSSMPMTPISRQ

A0A6J1FFD5 UPF0400 protein C337.03-like1.0e-26190.93Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIE GDDF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN
        GRNAALRLIGIWEERKVFGSRGQSLKEEIMGK +ETGNRNGK FS+KL KQS S+SLDKIV GYQVV  SEVDED VLSKCR+SISYLEKLDKEIGAD N
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGN

Query:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMS-EEASKDAQTSIAP
        SGQY G+S AEDLRGHH ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ SHSQSEQTQNLCRQFLNGENV+ M+ EEASKDAQTSIAP
Subjt:  SGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMS-EEASKDAQTSIAP

Query:  HGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNP
        H LVPRER+QSAPVMYA SLPFPAKPGP+EEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKEL GDYPSEKR KLENDQSPYTLPPNP
Subjt:  HGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNP

Query:  QRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQN-AGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP
        QRP V  FPHPESLQHNASSTSQQYTP+D PPPPSSSPPP+PPLPPV Q PLPQFTQN AGSVSSI YSY+MTQSL PLA PGYPN+GAPVTGMSP TIP
Subjt:  QRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQN-AGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIP

Query:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ
        TNSYQSFQ SDGNFYN SSSMPM PISRQ
Subjt:  TNSYQSFQASDGNFYNQSSSMPMTPISRQ

SwissProt top hitse value%identityAlignment
Q0P5J9 Regulation of nuclear pre-mRNA domain-containing protein 1A3.5e-2028.62Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL
          R++ IWEER V+ +   + LK+ + G      +  E    +  +  + L   S       +V   Q +  +   +  V  +  S       +S L+K+
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL

Query:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
         DKE G   +        +  D  G      D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q8VDS4 Regulation of nuclear pre-mRNA domain-containing protein 1A2.6e-2029Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL
          R++ IWEER V+ +   + LK  + G      +  E    +  +  + L   S       +V   Q +  +   +  V  +  S       +S LEK+
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL

Query:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
         DKE G   +        +  D  G      D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q96P16 Regulation of nuclear pre-mRNA domain-containing protein 1A3.5e-2028.62Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    D+  +  
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNA

Query:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL
          R++ IWEER V+ +   + LK+ + G      +  E    +  +  + L   S       +V   Q +  +   +  V  +  S       +S L+K+
Subjt:  ALRLIGIWEERKVFGSRG-QSLKEEIMG------KHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRS------SISYLEKL

Query:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
         DKE G   +        +  D  G      D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  -DKEIGADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q9CSU0 Regulation of nuclear pre-mRNA domain-containing protein 1B6.3e-2242.24Show/hide
Query:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRN
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    D+  + 
Subjt:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRN

Query:  AALRLIGIWEERKVFG
           RL+ IW+ER V+G
Subjt:  AALRLIGIWEERKVFG

Q9NQG5 Regulation of nuclear pre-mRNA domain-containing protein 1B6.3e-2242.24Show/hide
Query:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRN
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    D+  + 
Subjt:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRN

Query:  AALRLIGIWEERKVFG
           RL+ IW+ER V+G
Subjt:  AALRLIGIWEERKVFG

Arabidopsis top hitse value%identityAlignment
AT3G26990.1 ENTH/VHS family protein6.0e-13753.82Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        MG +FN QILV+KLA+LNNSQASIETLSHWCIFHMNKAK VVETW +QFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRD+IE+GDDF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKL---QKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGA
        GR +A RL+ IWEERKVFGSRGQ LKEE++G+  E G RNG     KL   Q+Q    +L+K+VS  +V+ G ++DED ++ K  ++  YLEK  +E+  
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKL---QKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGA

Query:  DGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFL-NGENVQP------MSEEAS
        D +SG   G ++ ++L+G H ILRDCIEQL A+ETSR SL+SHLREALQEQE KLEQVRN LQ +  QS++T +LCRQ L +G + QP       S+E  
Subjt:  DGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFL-NGENVQP------MSEEAS

Query:  KDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKEL------SGDYPSEKRA
        K + T+ AP      + EQSAPVM+AS+      P    EDPRK+AAAAV AKLTASTSS +MLSYVLSSLASEG+IGN N         S D+P EKR 
Subjt:  KDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKEL------SGDYPSEKRA

Query:  KLENDQSPYTLPPNPQRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTM------TQSLPPLA
        KL+N    Y  P                  H  ++T+   TP  P PPP       PP     QF  P   Q  G V+  P++YT+      TQ      
Subjt:  KLENDQSPYTLPPNPQRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYSYTM------TQSLPPLA

Query:  MPGYPNVGAPVTGMSPFTIPT-NSYQSFQASDGNFYNQSSSMPMTPISRQ
         P  P     +T +S  + P+ NSYQ FQ  DG FY  +SS+P+TP++RQ
Subjt:  MPGYPNVGAPVTGMSPFTIPT-NSYQSFQASDGNFYNQSSSMPMTPISRQ

AT5G10060.1 ENTH/VHS family protein1.2e-6534.63Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        M   F+ QIL+DKLA+LN+SQ SIETLSHWCIF+ +KA+ +V TW+KQFH +  +Q++  LYLANDILQNS+R+G+EFV EFW VLP AL+D++  GDD 
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMG----------KHMETGNRNGKQFSNKLQKQSVSVS--LDKIVSGYQVVSGSEVDEDVVLSKCRSSISYL
        G++A  R+I IWEER+VFGSR +SLK+ ++G          K    G+++ K+ S   + +  S     +KI S Y +V     +E+  ++KC+S++  +
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMG----------KHMETGNRNGKQFSNKLQKQSVSVS--LDKIVSGYQVVSGSEVDEDVVLSKCRSSISYL

Query:  EKLDKEIGADGNSGQYH--GSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMS
         K++K++    ++ + +    S+A++L     +LR CIE+L +++ SR+SLV+ L++AL+EQE +L+ ++ Q+Q +  Q+E+ QN+ ++         ++
Subjt:  EKLDKEIGADGNSGQYH--GSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMS

Query:  EEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKL
        +E     QT+ A       +  +S                       K   A++AA LTASTSS  ++  VLSS A+E               + K + L
Subjt:  EEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKL

Query:  ENDQSPYTLPPNPQRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPP----------MPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSL--
           +S       P   + +SFP   + Q+   +T  QY     PPPP    PP          +P +PP    P P           IP S +  QS   
Subjt:  ENDQSPYTLPPNPQRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPP----------MPPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSL--

Query:  PPLAMPGYPNVGAP
        P    PG    GAP
Subjt:  PPLAMPGYPNVGAP

AT5G65180.1 ENTH/VHS family protein8.9e-6436.03Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF
        M   F+ +IL+D LA+LN++Q SI+TLS WCI H ++A+ VV TW+KQFH +   Q++  LYLANDILQNS+R+G+EFV EFWKVLP AL+D++  GDD+
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDF

Query:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH--------------METGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSIS
        G+    RL+ IWEER+VFGSR +SLK+ ++ +                ++  R+ K    KL    VS   +KIVS + +V     +E+  ++KC+S++ 
Subjt:  GRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH--------------METGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSIS

Query:  YLEKLDKEI-GADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM
         + K++K++  A   +      S+A++L     ILR  +E+L ++E SR SLV+HLREAL+EQE +LE +++Q+Q +  Q+E+ QN+ ++ LN E   P+
Subjt:  YLEKLDKEI-GADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPM

Query:  SEEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAK
        +       Q+                           AK  P       ++ AA+A  LT+ST+S  ++  VLSS A+E        + SG   S     
Subjt:  SEEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAK

Query:  LENDQSPYTLPPNPQRPSVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM
          +D + + +PPNPQ+  +   P+P + Q   +   +         PPPPP + PP M
Subjt:  LENDQSPYTLPPNPQRPSVSSFPHPESLQ---HNASSTSQQYTPTDPPPPPSSSPPPM

AT5G65180.2 ENTH/VHS family protein6.3e-1729.35Show/hide
Query:  ETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEI-GADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETS
        ++  R+ K    KL    VS   +KIVS + +V     +E+  ++KC+S++  + K++K++  A   +      S+A++L     ILR  +E+L ++E S
Subjt:  ETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEI-GADGNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETS

Query:  RASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPR
        R SLV+HLREAL+EQE +LE +++Q+Q +  Q+E+ QN+ ++ LN E   P++       Q+                           AK  P      
Subjt:  RASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEEDPR

Query:  KSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQRPSVSSFPHPESLQ---HNASSTSQQYTPTDPP
         ++ AA+A  LT+ST+S  ++  VLSS A+E        + SG   S       +D + + +PPNPQ+  +   P+P + Q   +   +         PP
Subjt:  KSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQRPSVSSFPHPESLQ---HNASSTSQQYTPTDPP

Query:  PPPSSSPPPM
        PPP + PP M
Subjt:  PPPSSSPPPM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTACATTCAATCCACAAATTTTGGTGGACAAGCTAGCCAGGCTCAACAATTCACAGGCGAGCATTGAGACTTTATCCCATTGGTGTATATTTCACATGAACAA
AGCCAAGCAAGTTGTAGAAACATGGGACAAGCAATTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTATATCTTGCAAATGACATTTTGCAGAACAGTAGGCGAA
AAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTACTTCCAGATGCACTTCGTGATGTAATTGAGAGTGGGGACGATTTTGGAAGAAATGCTGCCCTACGACTGATTGGC
ATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAGCATATGGAAACTGGTAATCGAAATGGGAAGCAATTCAGCAATAA
ACTGCAGAAACAATCCGTCAGCGTATCATTGGATAAAATAGTTTCTGGTTACCAAGTTGTTTCTGGAAGTGAGGTAGATGAAGATGTAGTATTAAGCAAATGCAGGAGTT
CTATTAGCTATCTTGAGAAACTGGACAAGGAAATTGGTGCTGATGGAAATTCAGGGCAATACCATGGATCTTCAATTGCAGAGGATCTGCGGGGACATCATACCATTTTG
AGGGACTGCATCGAACAATTAACAGCGATTGAAACATCAAGGGCAAGTCTCGTGTCTCATCTGAGGGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTCAGAAA
TCAACTTCAGGCTTCCCATTCCCAGTCGGAACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATGGTGAAAATGTGCAACCTATGAGTGAGGAGGCCTCAAAAGATGCTC
AGACCTCGATAGCACCACACGGCCTTGTACCGAGGGAGAGAGAACAATCAGCACCAGTAATGTATGCAAGCTCATTACCTTTTCCTGCAAAACCTGGACCTATTGAGGAA
GATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCAAAGCTAACTGCATCGACATCCTCGGTTCAGATGCTCTCTTACGTCCTATCTTCCCTGGCATCGGAGGGTGTAAT
TGGAAATCCAAATAAAGAGTTATCTGGTGATTATCCATCTGAGAAGAGGGCGAAACTTGAAAATGACCAGTCACCCTACACATTGCCTCCGAATCCACAGCGACCGTCCG
TCTCTTCCTTCCCACACCCGGAGTCTCTCCAACATAATGCCTCGTCCACCAGTCAACAATACACTCCTACTGACCCTCCACCTCCCCCGTCATCTTCTCCACCACCGATG
CCTCCGTTACCTCCTGTAGCGCAGTTTCCTCTGCCCCAGTTCACACAGAATGCTGGGTCAGTGAGTAGCATACCTTACAGTTACACCATGACACAATCACTGCCACCATT
AGCGATGCCTGGCTATCCAAATGTAGGTGCCCCGGTGACTGGGATGTCTCCTTTTACAATACCAACAAATTCTTACCAGAGTTTTCAGGCTTCAGATGGTAATTTCTATA
ATCAGTCTTCATCCATGCCGATGACACCAATTTCTAGGCAATAG
mRNA sequenceShow/hide mRNA sequence
GCCCAAGGAGCGGCGCCAAATCCAGGGCTCATTGACAATCGCAAAGCCGAAAGAAATCAGTTCCACTACCACTTTCAGTTCCTTCCTTCACCCGTGCGGCCCAAACTCTA
GCGACATCGTGTCGAGCCAAATACGCTTCACATTCACAATCACAATCACAATCACAATCCCAACCAAAACCACACCGCTATTCTAAAATCTAAACCCATCTTATCTGGCG
AAGCCACCCCCTGATTCCAAAATCCCCGCTTCCCCTTCCTGGGTTTTCTTCTTCGCCCTCTATCTGGGTCTTTCTCTTCTTCACACACTCCGTCTTCCCTTACCTTCAAC
TCTCCCTCTATTCGCCCTTTGCACTCTGTTTTCACTAACCCTTTTGTAATTCCCCTCCTCTCTTCCATGGAGGAAGGCCTCGAACAGCCTAATTTTGCCCCCTTTTAGTT
GCCTTCTCTCTAACCCACATTCCCTTTTCCAGATTGGCCTGAATTTTCTACGTTATTTCTGCAAAAAAGTTATCCAAATCAATGGGTGGTACATTCAATCCACAAATTTT
GGTGGACAAGCTAGCCAGGCTCAACAATTCACAGGCGAGCATTGAGACTTTATCCCATTGGTGTATATTTCACATGAACAAAGCCAAGCAAGTTGTAGAAACATGGGACA
AGCAATTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTATATCTTGCAAATGACATTTTGCAGAACAGTAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGG
AAAGTACTTCCAGATGCACTTCGTGATGTAATTGAGAGTGGGGACGATTTTGGAAGAAATGCTGCCCTACGACTGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATC
TCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAGCATATGGAAACTGGTAATCGAAATGGGAAGCAATTCAGCAATAAACTGCAGAAACAATCCGTCAGCGTATCAT
TGGATAAAATAGTTTCTGGTTACCAAGTTGTTTCTGGAAGTGAGGTAGATGAAGATGTAGTATTAAGCAAATGCAGGAGTTCTATTAGCTATCTTGAGAAACTGGACAAG
GAAATTGGTGCTGATGGAAATTCAGGGCAATACCATGGATCTTCAATTGCAGAGGATCTGCGGGGACATCATACCATTTTGAGGGACTGCATCGAACAATTAACAGCGAT
TGAAACATCAAGGGCAAGTCTCGTGTCTCATCTGAGGGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTCAGAAATCAACTTCAGGCTTCCCATTCCCAGTCGG
AACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATGGTGAAAATGTGCAACCTATGAGTGAGGAGGCCTCAAAAGATGCTCAGACCTCGATAGCACCACACGGCCTTGTA
CCGAGGGAGAGAGAACAATCAGCACCAGTAATGTATGCAAGCTCATTACCTTTTCCTGCAAAACCTGGACCTATTGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGT
GGCAGCAAAGCTAACTGCATCGACATCCTCGGTTCAGATGCTCTCTTACGTCCTATCTTCCCTGGCATCGGAGGGTGTAATTGGAAATCCAAATAAAGAGTTATCTGGTG
ATTATCCATCTGAGAAGAGGGCGAAACTTGAAAATGACCAGTCACCCTACACATTGCCTCCGAATCCACAGCGACCGTCCGTCTCTTCCTTCCCACACCCGGAGTCTCTC
CAACATAATGCCTCGTCCACCAGTCAACAATACACTCCTACTGACCCTCCACCTCCCCCGTCATCTTCTCCACCACCGATGCCTCCGTTACCTCCTGTAGCGCAGTTTCC
TCTGCCCCAGTTCACACAGAATGCTGGGTCAGTGAGTAGCATACCTTACAGTTACACCATGACACAATCACTGCCACCATTAGCGATGCCTGGCTATCCAAATGTAGGTG
CCCCGGTGACTGGGATGTCTCCTTTTACAATACCAACAAATTCTTACCAGAGTTTTCAGGCTTCAGATGGTAATTTCTATAATCAGTCTTCATCCATGCCGATGACACCA
ATTTCTAGGCAATAGAGCTATGTAATACTAGTGCTACTTGATCTGTTGTGCTAACAAATCTTTCAGGAAGTGGTGCTTCCCATTAAGGCCTACGAGGAAATCTTGTATCC
TTGTATGCTCCGTAAGTTCTGACGACTTGACTAAATGGATTTTGCACATAGTAATAGTTTGTAATTTATCTTCACTTTTCAATGTATTGCTTGGAGTCTCGAACCATATT
TCGTCTAGTTGAATGGAGTAGTATGATAATTATGTTGACTTCATAGATTTTTGCACATAGTAAGAATTTGTAATTTAACTTCATTTATTAATGTAATGCTTGGAGTCTGG
AACAGAACTTCATC
Protein sequenceShow/hide protein sequence
MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIESGDDFGRNAALRLIG
IWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSNKLQKQSVSVSLDKIVSGYQVVSGSEVDEDVVLSKCRSSISYLEKLDKEIGADGNSGQYHGSSIAEDLRGHHTIL
RDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHSQSEQTQNLCRQFLNGENVQPMSEEASKDAQTSIAPHGLVPREREQSAPVMYASSLPFPAKPGPIEE
DPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPNKELSGDYPSEKRAKLENDQSPYTLPPNPQRPSVSSFPHPESLQHNASSTSQQYTPTDPPPPPSSSPPPM
PPLPPVAQFPLPQFTQNAGSVSSIPYSYTMTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMTPISRQ