; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg039269 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg039269
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptionregulation of nuclear pre-mRNA domain-containing protein 1B-like
Genome locationscaffold10:46872275..46887697
RNA-Seq ExpressionSpg039269
SyntenySpg039269
Gene Ontology termsGO:0006397 - mRNA processing (biological process)
InterPro domainsIPR006569 - CID domain
IPR008942 - ENTH/VHS


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138638.1 UPF0400 protein C337.03 [Cucumis sativus]1.3e-23861.21Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIG DV+SGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLV REREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN+SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DG+FY+QSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

XP_008441251.1 PREDICTED: UPF0400 protein C337.03 isoform X1 [Cucumis melo]5.7e-24261.84Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIGADV+SGQY GSSIA+DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DGNFYNQSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

XP_008441252.1 PREDICTED: UPF0400 protein C337.03 isoform X2 [Cucumis melo]6.8e-23560.83Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIGADV+S         +DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DGNFYNQSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

XP_022152479.1 UPF0400 protein C337.03 [Momordica charantia]7.0e-24061.24Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGKQFS+KLKQS S SLDKIV+GYQVVYG+E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
        VVLSKCRNSISYLEKLDKEIGADV+SGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA+LVSHLREALQEQEFKL++VRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNL RQFLN ENVQPM E+ASKDAQTSIAPHSLVPREREQSAPVMYA SLPFPAKPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE SSDYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQHNASSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPV QFPLPQFTQNAGSVSS+PY+YSLTQ L PLAMPGYPNVG PVTGMSPFTIPTNSYQ+FQASDGNFYNQSSSMPMAP+SRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

XP_038884747.1 UPF0400 protein C337.03 [Benincasa hispida]2.0e-24262.12Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETG+RNGK FS KLKQSASVSLDKIVSGYQVVYG+E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIG DV+SGQY GSSIA+DLRGHHTILRDCIEQLT+IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ASKDAQTSIAPHSLVPR+REQSAPVMYA SLPFP KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+KEL  DYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQ N SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFP+PQFTQN GSVSSIPY+YS+TQSLPPLAMPGYPNVGAPVTG+SPFTIPTNSYQSFQA DGNFYNQSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

TrEMBL top hitse value%identityAlignment
A0A0A0LMU3 CID domain-containing protein6.4e-23961.21Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIG DV+SGQY GSSIA+DLRGHH+ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLV REREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN+SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DG+FY+QSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

A0A1S3B3N2 UPF0400 protein C337.03 isoform X12.8e-24261.84Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIGADV+SGQY GSSIA+DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DGNFYNQSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

A0A1S3B3S0 UPF0400 protein C337.03 isoform X23.3e-23560.83Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVI NGDEF
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGK F+ KLKQSASVSLDKIVSGYQVVYG E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIGADV+S         +DLRGHHTILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNLCRQFLN ENVQPMTE+ SKDAQTS+APHSLVPREREQSAPVMYA S+PFP+KPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+K+L  DYPSEKRPKLENDQ PY LPPN QRPPVSSFPHPESLQHN SSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPVAQFPLPQFTQNAGSVSS  IPY+YS+TQSLPPLAMPGYPN GAPVTGMSPFTIPTNSYQ+FQA DGNFYNQSSSMPMAPISRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSS--IPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

A0A6J1DG44 UPF0400 protein C337.033.4e-24061.24Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFN  ILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGKH+ETGNRNGKQFS+KLKQS S SLDKIV+GYQVVYG+E+DED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
        VVLSKCRNSISYLEKLDKEIGADV+SGQYHGSS++EDL+ HHTILR CIEQLTAIE+SRA+LVSHLREALQEQEFKL++VRNQLQ               
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS
                                 ASH+QSEQTQNL RQFLN ENVQPM E+ASKDAQTSIAPHSLVPREREQSAPVMYA SLPFPAKPGP EEDPRKS
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMTEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKS

Query:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS
        AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP KE SSDYPSEKRPKLENDQ PYTLPPN QRPPVSSFPHPESLQHNASSTSQQYTP DPPPPPSS
Subjt:  AAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSS

Query:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SPPPMPPLPPV QFPLPQFTQNAGSVSS+PY+YSLTQ L PLAMPGYPNVG PVTGMSPFTIPTNSYQ+FQASDGNFYNQSSSMPMAP+SRQ
Subjt:  SPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

A0A6J1FFD5 UPF0400 protein C337.03-like4.2e-23060.33Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIE+GD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GRNAALRL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED
                                          IGIWEERKVFGSRGQSLKEEIMGK +ETGNRNGK FS KLKQS S+SLDKIV GYQVVY SEVDED
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDED

Query:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF
         VLSKCRNSISYLEKLDKEIGADV+SGQY G+S AEDLRGHH ILRDCIEQLT IETSRASLVSHLREALQEQEFKLEQVRNQLQV              
Subjt:  VVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAF

Query:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW
                                                                                                            
Subjt:  VLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSW

Query:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMT-EDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRK
                                  SH+QSEQTQNLCRQFLN ENV+ MT E+ASKDAQTSIAPH+LVPRER+QSAPVMYA SLPFPAKPGP+EEDPRK
Subjt:  ICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPMT-EDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRK

Query:  SAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPS
        SAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNP+KEL  DYPSEKR KLENDQSPYTLPPN QRPPV  FPHPESLQHNASSTSQQYTP D PPPPS
Subjt:  SAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPS

Query:  SSPPPMPPLPPVAQFPLPQFTQN-AGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ
        SSPPP+PPLPPV Q PLPQFTQN AGSVSSI Y+YS+TQSL PLA PGYPN+GAPVTGMSP TIPTNSYQSFQ SDGNFYN SSSMPMAPISRQ
Subjt:  SSPPPMPPLPPVAQFPLPQFTQN-AGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSFQASDGNFYNQSSSMPMAPISRQ

SwissProt top hitse value%identityAlignment
Q0P5J9 Regulation of nuclear pre-mRNA domain-containing protein 1A5.9e-1640Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE

Q8VDS4 Regulation of nuclear pre-mRNA domain-containing protein 1A5.9e-1640Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE

Q96P16 Regulation of nuclear pre-mRNA domain-containing protein 1A5.9e-1640Show/hide
Query:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE
        F+   L  KL+ L+NSQ S++TLS W I H   ++ +V  W+++   +   ++L +LYLAND++QNS+RKG EF  +F  V+ +A + V    DE
Subjt:  FNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE

Q9CSU0 Regulation of nuclear pre-mRNA domain-containing protein 1B4.1e-1743.75Show/hide
Query:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    DE
Subjt:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE

Q9NQG5 Regulation of nuclear pre-mRNA domain-containing protein 1B4.1e-1743.75Show/hide
Query:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE
        +F+   L  KL+ L+NSQ S++TLS W I H   A  +V  W ++   +   ++L +LYLAND++QNS+RKG EF  EF  VL DA   V    DE
Subjt:  TFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDE

Arabidopsis top hitse value%identityAlignment
AT3G26990.1 ENTH/VHS family protein2.4e-10536.4Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        MG +FN QILV+KLA+LNNSQASIETLSHWCIFHMNKAK VVETW +QFHC+PREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRD+IENGD+F
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        GR +A RL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKL----KQSASVSLDKIVSGYQVVYGSE
                                          + IWEERKVFGSRGQ LKEE++G+  E G RNG    +KL    +Q    +L+K+VS  +V++G +
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKL----KQSASVSLDKIVSGYQVVYGSE

Query:  VDEDVVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRI
        +DED ++ K  N+  YLEK  +E+  D+SSG   G ++ ++L+G H ILRDCIEQL A+ETSR SL+SHLREALQEQE KLEQVRN LQ+          
Subjt:  VDEDVVLSKCRNSISYLEKLDKEIGADVSSGQYHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRI

Query:  LKAFVLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHT
                                                 +RF                                                        
Subjt:  LKAFVLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRFPLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHT

Query:  EDSWICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLN--CENVQPMTEDAS-----KDAQTSIAPHSLVPREREQSAPVMYACSLPFPAK
                                         QS++T +LCRQ L+    +  P TE+       K + T+ AP S    + EQSAPVM+A      + 
Subjt:  EDSWICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLN--CENVQPMTEDAS-----KDAQTSIAPHSLVPREREQSAPVMYACSLPFPAK

Query:  PGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN-----PSKELSS-DYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNAS
        P    EDPRK+AAAAV AKLTASTSS +MLSYVLSSLASEG+IGN      ++ LSS D+P EKRPKL+N    Y              PH ++    +S
Subjt:  PGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGN-----PSKELSS-DYPSEKRPKLENDQSPYTLPPNLQRPPVSSFPHPESLQHNAS

Query:  STSQQYTPIDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSL------TQSLPPLAMPGYPNVGAPVTGMSPFTIPT-NSYQSFQASDGN
        ST  Q  P+ PPPP    P  + PL P             G V+  P+ Y++      TQ       P  P     +T +S  + P+ NSYQ FQ  DG 
Subjt:  STSQQYTPIDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSL------TQSLPPLAMPGYPNVGAPVTGMSPFTIPT-NSYQSFQASDGN

Query:  FYNQSSSMPMAPISRQ
        FY  +SS+P+ P++RQ
Subjt:  FYNQSSSMPMAPISRQ

AT5G10060.1 ENTH/VHS family protein6.0e-4829.85Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        M   F+ QIL+DKLA+LN+SQ SIETLSHWCIF+ +KA+ +V TW+KQFH +  +Q++  LYLANDILQNS+R+G+EFV EFW VLP AL+D++  GD+ 
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        G++A  R+                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHM--------------ETGNRNGKQFSIKLKQSASVSLDKIV
                                          I IWEER+VFGSR +SLK+ ++G+ +              ++  R  K    KL  S  V+ +KI 
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHM--------------ETGNRNGKQFSIKLKQSASVSLDKIV

Query:  SGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQYH--GSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL
        S Y +V     +E+  ++KC++++  + K++K++    S+ + +    S+A++L     +LR CIE+L +++ SR+SLV+ L++AL+EQE +L+ ++ Q+
Subjt:  SGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQYH--GSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQL

Query:  QV
        QV
Subjt:  QV

AT5G65180.1 ENTH/VHS family protein2.8e-4529.82Show/hide
Query:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF
        M   F+ +IL+D LA+LN++Q SI+TLS WCI H ++A+ VV TW+KQFH +   Q++  LYLANDILQNS+R+G+EFV EFWKVLP AL+D++  GD++
Subjt:  MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEF

Query:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV
        G+    RL                                                                                            
Subjt:  GRNAALRLPLVPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSV

Query:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIM----------GKHMETGNRNGKQ--FSIKLKQSASVSLDKIVSG
                                          + IWEER+VFGSR +SLK+ ++           K    G+++ K+   S K K S+    +KIVS 
Subjt:  IWCKFSNSFYNYSISDILAAGAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIM----------GKHMETGNRNGKQ--FSIKLKQSASVSLDKIVSG

Query:  YQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQ-YHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQV
        + +V     +E+  ++KC++++  + K++K++    S+ +     S+A++L     ILR  +E+L ++E SR SLV+HLREAL+EQE +LE +++Q+QV
Subjt:  YQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQ-YHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQV

AT5G65180.2 ENTH/VHS family protein8.8e-1536.75Show/hide
Query:  SIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQ-YHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREA
        S K K S+    +KIVS + +V     +E+  ++KC++++  + K++K++    S+ +     S+A++L     ILR  +E+L ++E SR SLV+HLREA
Subjt:  SIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQ-YHGSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREA

Query:  LQEQEFKLEQVRNQLQV
        L+EQE +LE +++Q+QV
Subjt:  LQEQEFKLEQVRNQLQV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTGGTACATTCAATCCACAGATTTTGGTGGACAAGCTAGCCAGGCTCAACAATTCACAGGCGAGCATTGAGACTTTATCCCATTGGTGTATATTTCACATGAACAA
AGCCAAGCAAGTGGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTGTATCTTGCAAATGACATTTTGCAGAACAGTAGGCGAA
AAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCAGATGCACTTCGTGATGTAATTGAGAATGGGGATGAGTTTGGAAGAAATGCTGCCCTACGACTGCCACTA
GTTCCATTTATGACTCCACATCATTCAATTGGAGATATTATGGACTACTGTGATCTTGAACTAAAAAATCGGCTCCTTGATCATTTTGCAACACTGCCCTTATATTGTTC
TCATCTTGCCAATATTAAACCCTACCACCACTCTGAGTGCTTCAAGCTTGAGAAAAGGGATTATGGGGGTGTTTCTCTCTCATGGGCCTCATTGAAGGAGTGGCCCTTTC
GGCTACTTATGAATAATGACACTTGGTTACATCCCCTTGACCCTTCAGTTATTTGGTGTAAGTTTAGTAATTCCTTTTATAATTACTCAATTTCTGATATCTTGGCTGCT
GGAGCCTTCTTGTGGCTTAGGCTAGAGGCTTCCCTCTCTTTTATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGG
AAAGCATATGGAAACTGGGAATCGGAATGGGAAGCAATTCAGCATTAAACTGAAACAATCTGCCAGCGTATCATTAGATAAAATAGTTTCTGGTTACCAAGTTGTTTATG
GAAGTGAGGTAGATGAAGATGTGGTACTGAGCAAATGCAGGAATTCTATTAGCTATCTTGAGAAACTGGACAAAGAAATTGGTGCTGATGTCAGTTCAGGGCAATACCAT
GGATCTTCAATTGCAGAGGATCTGAGGGGACATCATACCATTTTGAGGGACTGCATCGAACAATTAACAGCAATTGAAACATCAAGGGCAAGTCTCGTATCTCATCTGAG
AGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTCCGAAATCAACTTCAGGTTTGCATTTCTGTTTCACAATATCATCGGATCTTGAAGGCTTTCGTTCTAGATG
CTCAACACATCCTCATCACCAGTCACATTGCTCATCAGCTTGGAAACGGAGGCGATACCTTCTTTTGGACGGACTCATGGATTAATAGTATCCCTCTAGCCTCCAGGTTT
CCCCTTCTTTATCCGCTTTCTCATTGTAAAAATGCATCAGTAAAAGAGGTTTGGAACTCATCAAACAATTTTTGGGACCTCAAACTGGGAAGGAATTTAAAAGACACCGA
GACTACAGAATGGGCTGAGCTTAGTGTTGAACTTGACAACATCACTTTATCCCACACAGAAGACTCATGGATATGCCCCTTAGTCCAAGCGGATCCTTCTCAACAAAGTC
TCTCATCATTGACATGGCAGCAACTGGATCTTGCGGCTTCCCATACCCAGTCGGAACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATTGTGAAAATGTGCAACCTATG
ACTGAGGATGCCTCAAAAGATGCTCAGACCTCGATAGCACCACACAGTCTTGTACCGAGGGAGAGAGAACAGTCAGCGCCAGTAATGTATGCATGCTCATTACCTTTTCC
TGCAAAACCTGGGCCTATCGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCTAAGCTAACTGCATCGACATCCTCAGTTCAGATGCTCTCTTATGTCCTAT
CTTCCCTGGCGTCAGAGGGTGTAATTGGAAATCCAAGTAAAGAGTTATCCAGTGATTATCCATCTGAGAAGAGGCCCAAACTTGAAAATGACCAGTCACCCTACACATTG
CCTCCGAATCTGCAGCGACCACCAGTCTCTTCCTTCCCACACCCGGAGTCACTCCAACATAATGCCTCATCCACCAGTCAACAATACACTCCTATTGACCCTCCTCCTCC
CCCATCATCATCTCCACCGCCGATGCCTCCGTTACCTCCTGTAGCGCAGTTCCCTCTGCCCCAGTTCACACAGAATGCTGGGTCAGTAAGTAGCATACCTTACACTTACA
GTTTGACACAGTCGCTGCCTCCATTAGCGATGCCTGGCTATCCAAATGTAGGTGCCCCGGTAACTGGGATGTCTCCTTTTACAATACCAACAAATTCTTACCAGAGTTTT
CAGGCTTCAGATGGTAATTTCTATAATCAATCATCATCCATGCCGATGGCACCAATTTCTAGGCAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGGTGGTACATTCAATCCACAGATTTTGGTGGACAAGCTAGCCAGGCTCAACAATTCACAGGCGAGCATTGAGACTTTATCCCATTGGTGTATATTTCACATGAACAA
AGCCAAGCAAGTGGTAGAAACATGGGATAAGCAGTTTCATTGTTCTCCACGCGAGCAGAGATTGGCCTATCTGTATCTTGCAAATGACATTTTGCAGAACAGTAGGCGAA
AAGGCTCAGAGTTTGTTGGTGAATTTTGGAAAGTCCTTCCAGATGCACTTCGTGATGTAATTGAGAATGGGGATGAGTTTGGAAGAAATGCTGCCCTACGACTGCCACTA
GTTCCATTTATGACTCCACATCATTCAATTGGAGATATTATGGACTACTGTGATCTTGAACTAAAAAATCGGCTCCTTGATCATTTTGCAACACTGCCCTTATATTGTTC
TCATCTTGCCAATATTAAACCCTACCACCACTCTGAGTGCTTCAAGCTTGAGAAAAGGGATTATGGGGGTGTTTCTCTCTCATGGGCCTCATTGAAGGAGTGGCCCTTTC
GGCTACTTATGAATAATGACACTTGGTTACATCCCCTTGACCCTTCAGTTATTTGGTGTAAGTTTAGTAATTCCTTTTATAATTACTCAATTTCTGATATCTTGGCTGCT
GGAGCCTTCTTGTGGCTTAGGCTAGAGGCTTCCCTCTCTTTTATTGGCATTTGGGAAGAGAGAAAAGTTTTTGGATCTCGAGGGCAGAGTCTTAAGGAAGAGATAATGGG
AAAGCATATGGAAACTGGGAATCGGAATGGGAAGCAATTCAGCATTAAACTGAAACAATCTGCCAGCGTATCATTAGATAAAATAGTTTCTGGTTACCAAGTTGTTTATG
GAAGTGAGGTAGATGAAGATGTGGTACTGAGCAAATGCAGGAATTCTATTAGCTATCTTGAGAAACTGGACAAAGAAATTGGTGCTGATGTCAGTTCAGGGCAATACCAT
GGATCTTCAATTGCAGAGGATCTGAGGGGACATCATACCATTTTGAGGGACTGCATCGAACAATTAACAGCAATTGAAACATCAAGGGCAAGTCTCGTATCTCATCTGAG
AGAGGCTCTTCAAGAACAGGAATTCAAATTGGAGCAAGTCCGAAATCAACTTCAGGTTTGCATTTCTGTTTCACAATATCATCGGATCTTGAAGGCTTTCGTTCTAGATG
CTCAACACATCCTCATCACCAGTCACATTGCTCATCAGCTTGGAAACGGAGGCGATACCTTCTTTTGGACGGACTCATGGATTAATAGTATCCCTCTAGCCTCCAGGTTT
CCCCTTCTTTATCCGCTTTCTCATTGTAAAAATGCATCAGTAAAAGAGGTTTGGAACTCATCAAACAATTTTTGGGACCTCAAACTGGGAAGGAATTTAAAAGACACCGA
GACTACAGAATGGGCTGAGCTTAGTGTTGAACTTGACAACATCACTTTATCCCACACAGAAGACTCATGGATATGCCCCTTAGTCCAAGCGGATCCTTCTCAACAAAGTC
TCTCATCATTGACATGGCAGCAACTGGATCTTGCGGCTTCCCATACCCAGTCGGAACAAACTCAGAATCTCTGCCGTCAGTTTCTAAATTGTGAAAATGTGCAACCTATG
ACTGAGGATGCCTCAAAAGATGCTCAGACCTCGATAGCACCACACAGTCTTGTACCGAGGGAGAGAGAACAGTCAGCGCCAGTAATGTATGCATGCTCATTACCTTTTCC
TGCAAAACCTGGGCCTATCGAGGAAGATCCCCGCAAGTCTGCTGCTGCTGCAGTGGCAGCTAAGCTAACTGCATCGACATCCTCAGTTCAGATGCTCTCTTATGTCCTAT
CTTCCCTGGCGTCAGAGGGTGTAATTGGAAATCCAAGTAAAGAGTTATCCAGTGATTATCCATCTGAGAAGAGGCCCAAACTTGAAAATGACCAGTCACCCTACACATTG
CCTCCGAATCTGCAGCGACCACCAGTCTCTTCCTTCCCACACCCGGAGTCACTCCAACATAATGCCTCATCCACCAGTCAACAATACACTCCTATTGACCCTCCTCCTCC
CCCATCATCATCTCCACCGCCGATGCCTCCGTTACCTCCTGTAGCGCAGTTCCCTCTGCCCCAGTTCACACAGAATGCTGGGTCAGTAAGTAGCATACCTTACACTTACA
GTTTGACACAGTCGCTGCCTCCATTAGCGATGCCTGGCTATCCAAATGTAGGTGCCCCGGTAACTGGGATGTCTCCTTTTACAATACCAACAAATTCTTACCAGAGTTTT
CAGGCTTCAGATGGTAATTTCTATAATCAATCATCATCCATGCCGATGGCACCAATTTCTAGGCAATAG
Protein sequenceShow/hide protein sequence
MGGTFNPQILVDKLARLNNSQASIETLSHWCIFHMNKAKQVVETWDKQFHCSPREQRLAYLYLANDILQNSRRKGSEFVGEFWKVLPDALRDVIENGDEFGRNAALRLPL
VPFMTPHHSIGDIMDYCDLELKNRLLDHFATLPLYCSHLANIKPYHHSECFKLEKRDYGGVSLSWASLKEWPFRLLMNNDTWLHPLDPSVIWCKFSNSFYNYSISDILAA
GAFLWLRLEASLSFIGIWEERKVFGSRGQSLKEEIMGKHMETGNRNGKQFSIKLKQSASVSLDKIVSGYQVVYGSEVDEDVVLSKCRNSISYLEKLDKEIGADVSSGQYH
GSSIAEDLRGHHTILRDCIEQLTAIETSRASLVSHLREALQEQEFKLEQVRNQLQVCISVSQYHRILKAFVLDAQHILITSHIAHQLGNGGDTFFWTDSWINSIPLASRF
PLLYPLSHCKNASVKEVWNSSNNFWDLKLGRNLKDTETTEWAELSVELDNITLSHTEDSWICPLVQADPSQQSLSSLTWQQLDLAASHTQSEQTQNLCRQFLNCENVQPM
TEDASKDAQTSIAPHSLVPREREQSAPVMYACSLPFPAKPGPIEEDPRKSAAAAVAAKLTASTSSVQMLSYVLSSLASEGVIGNPSKELSSDYPSEKRPKLENDQSPYTL
PPNLQRPPVSSFPHPESLQHNASSTSQQYTPIDPPPPPSSSPPPMPPLPPVAQFPLPQFTQNAGSVSSIPYTYSLTQSLPPLAMPGYPNVGAPVTGMSPFTIPTNSYQSF
QASDGNFYNQSSSMPMAPISRQ