; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030404 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030404
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUPF0400 protein C337.03 isoform X1
Genome locationchr8:46984495..46997658
RNA-Seq ExpressionLag0030404
SyntenyLag0030404
Gene Ontology termsGO:0031124 - mRNA 3'-end processing (biological process)
GO:0016591 - RNA polymerase II, holoenzyme (cellular component)
GO:0000993 - RNA polymerase II complex binding (molecular function)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06664.1 UPF0400 protein isoform X1 [Cucumis melo var. makuwa]7.8e-23292.09Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+GADVNSGQY GSSIA+DLRGHHTILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

XP_004138638.1 UPF0400 protein C337.03 [Cucumis sativus]1.3e-22991.43Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+G DVNSGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLV REREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN+SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

XP_008441251.1 PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo]7.8e-23292.09Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+GADVNSGQY GSSIA+DLRGHHTILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

XP_022152479.1 UPF0400 protein C337.03 [Momordica charantia]2.3e-23191.83Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGD+FGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGKQFS+KLKQS S SLDKIV+GYQVVYG+E+DEDVVLSKCRNSISYLEKLDKE+GADVNSGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        +LVSHLREALQEQEFKL++VRNQLQASH+QSEQTQNL RQFLNGENVQPM E+ASKDAQTSIAPHSLVPREREQSAPVMYA+SLPFPAKPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE SSDYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQHNASS+SQQYTPTDPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPV QFPLPQFTQ+AGSVSS+PY+YSLTQ L PLAMPGYPNVG P
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP

XP_038884747.1 UPF0400 protein C337.03 [Benincasa hispida]1.6e-23292.72Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETG+RNGK FS KLKQSASVSLDKIVSGYQVVYG+E+DED VLSKCRNSISYLEKLDKE+G DVNSGQY GSSIA+DLRGHHTILRDCIEQLT+IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ASKDAQTSIAPHSLVPR+REQSAPVMYA SLPFP KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+KEL  DYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQ N SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFP+PQFTQ+ GSVSSIPY+YS+TQSLPPLAMPGYPNVGAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP

TrEMBL top hitse value%identityAlignment
A0A0A0LMU3 CID domain-containing protein6.1e-23091.43Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+G DVNSGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLV REREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN+SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

A0A1S3B3N2 UPF0400 protein C337.03 isoform X13.8e-23292.09Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+GADVNSGQY GSSIA+DLRGHHTILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

A0A1S3B3S0 UPF0400 protein C337.03 isoform X23.4e-22590.33Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+GADVNS         +DLRGHHTILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

A0A5D3C5Q5 UPF0400 protein isoform X13.8e-23292.09Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED VLSKCRNSISYLEKLDKE+GADVNSGQY GSSIA+DLRGHHTILRDCIEQLT IETSRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        SLVSHLREALQEQEFKLEQVRNQLQASH+QSEQTQNLCRQFLNGENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA+S+PFP+KPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SS+SQQYTP+DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPVAQFPLPQFTQ+AGSVSS  IPY+YS+TQSLPPLAMPGYPN GAP
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAP

A0A6J1DG44 UPF0400 protein C337.031.1e-23191.83Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGD+FGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKH+
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA
        ETGNRNGKQFS+KLKQS S SLDKIV+GYQVVYG+E+DEDVVLSKCRNSISYLEKLDKE+GADVNSGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA
Subjt:  ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRA

Query:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS
        +LVSHLREALQEQEFKL++VRNQLQASH+QSEQTQNL RQFLNGENVQPM E+ASKDAQTSIAPHSLVPREREQSAPVMYA+SLPFPAKPGP EEDPRKS
Subjt:  SLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE SSDYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQHNASS+SQQYTPTDPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP
        SPPPMPPLPPV QFPLPQFTQ+AGSVSS+PY+YSLTQ L PLAMPGYPNVG P
Subjt:  SPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAP

SwissProt top hitse value%identityAlignment
Q0P5J9 Regulation of nuclear pre-mRNA domain-containing protein 1A4.0e-1326.56Show/hide
Query:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET
        ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   + LK+ + G   + 
Subjt:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET

Query:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT
          R  +Q  +   ++ S                    L+   SG   V+       V + +    +S L+K+ DKE G  ++        +  D  G   
Subjt:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT

Query:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
           D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q8VDS4 Regulation of nuclear pre-mRNA domain-containing protein 1A2.4e-1326.97Show/hide
Query:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET
        ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   + LK  + G   + 
Subjt:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET

Query:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT
          R  +Q  +   ++ S                    L+   SG   V+       V + +    +S LEK+ DKE G  ++        +  D  G   
Subjt:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT

Query:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
           D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q96P16 Regulation of nuclear pre-mRNA domain-containing protein 1A4.0e-1326.56Show/hide
Query:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET
        ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE  +    R++ IWEER V+ +   + LK+ + G   + 
Subjt:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRG-QSLKEEIMGKHMET

Query:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT
          R  +Q  +   ++ S                    L+   SG   V+       V + +    +S L+K+ DKE G  ++        +  D  G   
Subjt:  GNRNGKQFSIKLKQSAS------------------VSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKL-DKEVGADVNSGQYHGSSIAEDLRGHHT

Query:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
           D  +QLT +    A  +   +EAL E+E KLE+ + +L
Subjt:  ILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Q9CSU0 Regulation of nuclear pre-mRNA domain-containing protein 1B2.4e-1342.68Show/hide
Query:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFG
        A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    DE  +    RL+ IW+ER V+G
Subjt:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFG

Q9NQG5 Regulation of nuclear pre-mRNA domain-containing protein 1B2.4e-1342.68Show/hide
Query:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFG
        A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    DE  +    RL+ IW+ER V+G
Subjt:  AKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFG

Arabidopsis top hitse value%identityAlignment
AT3G26990.1 ENTH/VHS family protein4.1e-11453.85Show/hide
Query:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM
        MNKAK VVETW +QFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRD+IENGD+FGR +A RL+ IWEERKVFGSRGQ LKEE++G+  
Subjt:  MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM

Query:  ETGNRNGKQFSIKL----KQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIE
        E G RNG    +KL    +Q    +L+K+VS  +V++G ++DED ++ K  N+  YLEK  +EV  D++SG   G ++ ++L+G H ILRDCIEQL A+E
Subjt:  ETGNRNGKQFSIKL----KQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIE

Query:  TSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLN--GENVQPMTEDAS-----KDAQTSIAPHSLVPREREQSAPVMYASSLPFPAK
        TSR SL+SHLREALQEQE KLEQVRN LQ +  QS++T +LCRQ L+  G +  P TE+       K + T+ AP S    + EQSAPVM+AS+      
Subjt:  TSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLN--GENVQPMTEDAS-----KDAQTSIAPHSLVPREREQSAPVMYASSLPFPAK

Query:  PGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN-----PSKELSS-DYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNAS
        P    EDPRK+AAAAV AKLTASTSS +MLSYVLSSLASEG+IGN      ++ LSS D+P EKRPKL+N    Y  P                  H  +
Subjt:  PGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN-----PSKELSS-DYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNAS

Query:  SSSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQS
        +++   TP  P PPP       PP     QF  P   Q  G V+  P+ Y++  S
Subjt:  SSSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQS

AT5G10060.1 ENTH/VHS family protein5.5e-5031.87Show/hide
Query:  NKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM-
        +KA+ +V TW+KQFH +  +Q++  LYLANDILQNS+R+G+EFV EFW VLP AL+D++  GD+ G++A  R+I IWEER+VFGSR +SLK+ ++G+ + 
Subjt:  NKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHM-

Query:  -------------ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYH--GSSIAEDLRGHHTIL
                     ++  R  K    KL  S  V+ +KI S Y +V     +E+  ++KC++++  + K++K+V    ++ + +    S+A++L     +L
Subjt:  -------------ETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYH--GSSIAEDLRGHHTIL

Query:  RDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLP
        R CIE+L +++ SR+SLV+ L++AL+EQE +L+ ++ Q+Q +  Q+E+ QN+ ++  + +     T  A+   +T+    S                   
Subjt:  RDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLP

Query:  FPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSS
                     K   A++AA LTASTSS  ++  VLSS A+E               + K   L   +S  T+P +      +SFP   + Q+   ++
Subjt:  FPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSS

Query:  SQQYTPTDPPPPPSSSPPP----------MPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSL--PPLAMPGYPNVGAP
          QY     PPPP    PP          +P +PP    P P           IP + S  QS   P    PG    GAP
Subjt:  SQQYTPTDPPPPPSSSPPP----------MPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSL--PPLAMPGYPNVGAP

AT5G65180.1 ENTH/VHS family protein6.7e-4834.43Show/hide
Query:  NKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIM-----
        ++A+ VV TW+KQFH +   Q++  LYLANDILQNS+R+G+EFV EFWKVLP AL+D++  GD++G+    RL+ IWEER+VFGSR +SLK+ ++     
Subjt:  NKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIM-----

Query:  -----GKHMETGNRNGKQ--FSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEV-GADVNSGQYHGSSIAEDLRGHHTILRDC
              K    G+++ K+   S K K S+    +KIVS + +V     +E+  ++KC++++  + K++K+V  A   +      S+A++L     ILR  
Subjt:  -----GKHMETGNRNGKQ--FSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEV-GADVNSGQYHGSSIAEDLRGHHTILRDC

Query:  IEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTS-IAPHSLVPREREQSAPVMYASSLPFP
        +E+L ++E SR SLV+HLREAL+EQE +LE +++Q+Q +  Q+E+ QN+ ++ LN E   P+  +     Q++ I P S+                    
Subjt:  IEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTS-IAPHSLVPREREQSAPVMYASSLPFP

Query:  AKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQ
                       AA+A  LT+ST+S  ++  VLSS A+E           +   S       +D + + +PPN Q+  +   P+P        ++SQ
Subjt:  AKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQ

Query:  QYT---PTDPPPPPSSSPPPMP-PLPP
        Q+       P  PP + PPP P  LPP
Subjt:  QYT---PTDPPPPPSSSPPPMP-PLPP

AT5G65180.2 ENTH/VHS family protein2.2e-1429.41Show/hide
Query:  SIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEV-GADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREA
        S K K S+    +KIVS + +V     +E+  ++KC++++  + K++K+V  A   +      S+A++L     ILR  +E+L ++E SR SLV+HLREA
Subjt:  SIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEV-GADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREA

Query:  LQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTS-IAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAK
        L+EQE +LE +++Q+Q +  Q+E+ QN+ ++ LN E   P+  +     Q++ I P S+                                   AA+A  
Subjt:  LQEQEFKLEQVRNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTS-IAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAK

Query:  LTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYT---PTDPPPPPSSSPPPM
        LT+ST+S  ++  VLSS A+E           +   S       +D + + +PPN Q+  +   P+P        ++SQQ+       P  PP + PPP 
Subjt:  LTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYT---PTDPPPPPSSSPPPM

Query:  P-PLPP
        P  LPP
Subjt:  P-PLPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAACAAAGCCAAGCAAGTGGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTGTATCTTGCAAATGACATTTTGCAGAACAG
TAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCAGATGCACTTCGTGATGTAATTGAGAATGGGGATGAGTTTGGAAGAAATGCTGCCCTACGAC
TGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAGCATATGGAAACTGGTAATCGGAATGGGAAGCAATTC
AGCATTAAACTGAAACAATCTGCCAGCGTATCGTTAGATAAAATAGTTTCTGGTTACCAAGTTGTTTATGGAAGTGAGGTAGATGAAGATGTGGTACTGAGCAAATGCAG
GAATTCTATTAGCTATCTTGAGAAACTGGACAAAGAAGTTGGTGCTGATGTCAATTCAGGGCAATACCATGGATCTTCAATTGCAGAGGATCTGAGGGGACATCATACCA
TTTTGAGGGACTGCATCGAACAATTAACAGCAATTGAAACATCAAGGGCAAGTCTCGTATCTCATCTGAGAGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTC
CGAAATCAACTTCAGGCTTCCCATACCCAGTCGGAACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATGGTGAAAATGTGCAACCTATGACTGAGGATGCCTCAAAAGA
TGCTCAAACCTCGATAGCACCACACAGTCTTGTACCAAGGGAGAGAGAACAGTCAGCGCCAGTAATGTATGCAAGCTCATTACCTTTTCCTGCAAAACCTGGACCTATCG
AGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCTAAGCTAACTGCATCGACATCCTCAGTTCAGATGCTCTCATACGTCCTATCTTCCCTGGCGTCAGAGGGT
GTAATTGGAAATCCAAGTAAAGAGTTATCCAGTGATTATCCATCTGAGAAGAGGCCTAAACTTGAAAATGACCAGTCACCCTACACGTTGCCTCCAAATCTGCAGCGACC
ACCAGTCTCTTCCTTCCCACACCCGGAGTCACTCCAACATAATGCCTCATCCAGCAGTCAACAATACACTCCTACTGACCCTCCTCCTCCCCCGTCATCATCTCCACCGC
CGATGCCTCCGTTACCTCCTGTAGCACAGTTCCCTCTGCCCCAGTTCACACAGCATGCTGGGTCAGTAAGTAGCATACCTTACACTTACAGTTTGACACAGTCGCTGCCA
CCATTAGCGATGCCTGGCTATCCAAATGTAGGTGCCCCGGAAGTGGTGCATCTCATTGAGGCCCTACGAGGAAATCTTGGTGTCATGGTGAATGGTTGCATTCGGCACGA
GAACGTTGACAAGTCTTGTGCTATTTCAGGTGTTTGTAAAGGCTACGACCCACTTGGGTTGAGTGACTCTAGTTTCATTGGGTTGAGCTATGAGCTATTTGGTCAAGTGG
CTTATCACTTGGTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAACAAAGCCAAGCAAGTGGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTGTATCTTGCAAATGACATTTTGCAGAACAG
TAGGCGAAAAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCAGATGCACTTCGTGATGTAATTGAGAATGGGGATGAGTTTGGAAGAAATGCTGCCCTACGAC
TGATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGGAAAGCATATGGAAACTGGTAATCGGAATGGGAAGCAATTC
AGCATTAAACTGAAACAATCTGCCAGCGTATCGTTAGATAAAATAGTTTCTGGTTACCAAGTTGTTTATGGAAGTGAGGTAGATGAAGATGTGGTACTGAGCAAATGCAG
GAATTCTATTAGCTATCTTGAGAAACTGGACAAAGAAGTTGGTGCTGATGTCAATTCAGGGCAATACCATGGATCTTCAATTGCAGAGGATCTGAGGGGACATCATACCA
TTTTGAGGGACTGCATCGAACAATTAACAGCAATTGAAACATCAAGGGCAAGTCTCGTATCTCATCTGAGAGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTC
CGAAATCAACTTCAGGCTTCCCATACCCAGTCGGAACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATGGTGAAAATGTGCAACCTATGACTGAGGATGCCTCAAAAGA
TGCTCAAACCTCGATAGCACCACACAGTCTTGTACCAAGGGAGAGAGAACAGTCAGCGCCAGTAATGTATGCAAGCTCATTACCTTTTCCTGCAAAACCTGGACCTATCG
AGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCTAAGCTAACTGCATCGACATCCTCAGTTCAGATGCTCTCATACGTCCTATCTTCCCTGGCGTCAGAGGGT
GTAATTGGAAATCCAAGTAAAGAGTTATCCAGTGATTATCCATCTGAGAAGAGGCCTAAACTTGAAAATGACCAGTCACCCTACACGTTGCCTCCAAATCTGCAGCGACC
ACCAGTCTCTTCCTTCCCACACCCGGAGTCACTCCAACATAATGCCTCATCCAGCAGTCAACAATACACTCCTACTGACCCTCCTCCTCCCCCGTCATCATCTCCACCGC
CGATGCCTCCGTTACCTCCTGTAGCACAGTTCCCTCTGCCCCAGTTCACACAGCATGCTGGGTCAGTAAGTAGCATACCTTACACTTACAGTTTGACACAGTCGCTGCCA
CCATTAGCGATGCCTGGCTATCCAAATGTAGGTGCCCCGGAAGTGGTGCATCTCATTGAGGCCCTACGAGGAAATCTTGGTGTCATGGTGAATGGTTGCATTCGGCACGA
GAACGTTGACAAGTCTTGTGCTATTTCAGGTGTTTGTAAAGGCTACGACCCACTTGGGTTGAGTGACTCTAGTTTCATTGGGTTGAGCTATGAGCTATTTGGTCAAGTGG
CTTATCACTTGGTTTGA
Protein sequenceShow/hide protein sequence
MNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQF
SIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEVGADVNSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQV
RNQLQASHTQSEQTQNLCRQFLNGENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYASSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEG
VIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSSSQQYTPTDPPPPPSSSPPPMPPLPPVAQFPLPQFTQHAGSVSSIPYTYSLTQSLP
PLAMPGYPNVGAPEVVHLIEALRGNLGVMVNGCIRHENVDKSCAISGVCKGYDPLGLSDSSFIGLSYELFGQVAYHLV