; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0014401 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0014401
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF620)
Genome locationchr12:442075..444575
RNA-Seq ExpressionLag0014401
SyntenyLag0014401
Gene Ontology termsNA
InterPro domainsIPR006873 - Protein of unknown function DUF620


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572489.1 hypothetical protein SDJN03_29217, partial [Cucurbita argyrosperma subsp. sororia]5.8e-20187.08Show/hide
Query:  MATAVVIGGGG-GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL
        MATAVVIGGGG       SSKSRR+QIWYS QPL PLME  GPDPQFQDQE NKKDSSASNWEF RDWFKI R NLPPSLP ++ ++  +S T DLKLLL
Subjt:  MATAVVIGGGG-GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL

Query:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ
        GVLACPLAPIPL     H +SPP  SHFP DTPLETSVAHYIIQQYLAATGCLKQQK  KNMY TGS++MIRCETEVSSGK+VKTVGTR EDTGCFVLWQ
Subjt:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ

Query:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
        MLP MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAGLSIDYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHSHSHSHP
        +PPADI DTLHSHS   P
Subjt:  IPPADIFDTLHSHSHSHP

XP_004142167.1 uncharacterized protein LOC101217200 [Cucumis sativus]1.4e-20286.05Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL
        MATAV IG GG      SSKSRR KQIWYS QPL PLME  GPDPQFQDQEPNKKDSS SNWEF RDWFKI R NL PS+  ++ ++  +S T DLKLLL
Subjt:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL

Query:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ
        GVLACPLAPIPL       +SPP TS+FP   PLETSV HYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQ
Subjt:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ

Query:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
        MLPAMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEE+WSIDDVMFNVAGLS+DYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHSHSHSHPHPHSP
        IPPADIFD+LHSHSH H H HSP
Subjt:  IPPADIFDTLHSHSHSHPHPHSP

XP_008449811.1 PREDICTED: uncharacterized protein LOC103491587 [Cucumis melo]7.4e-20486.76Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL
        MATAV I  G GG    SSKSRR KQIWYS QPL PLME  GPDPQFQDQEPNKKDSS SNWEF RDWFKI R NL PS+  ++ ++  +S T DLKLLL
Subjt:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL

Query:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ
        GVLACPLAPI     PLH +SPP TSHFP   PLETSVAHYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQ
Subjt:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ

Query:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
        MLPAMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEE+WSIDDVMFNVAGLS+DYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHSHSHSHPHPHSP
        IPPADIFD+LHSHSH H H HSP
Subjt:  IPPADIFDTLHSHSHSHPHPHSP

XP_022952934.1 uncharacterized protein LOC111455462 [Cucurbita moschata]1.2e-20187.53Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLG
        MATAVVI  GGGG    SSKSRR+QIWYS QPL PLME  GPDPQFQDQE NKKDSSASNWEF RDWFKI R NLPPSLP ++ ++  +S T DLKLLLG
Subjt:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLG

Query:  VLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQM
        VLACPLAPIPL     H +SPP  SHFP DTPLETSVAHYIIQQYLAATGCLKQQK  KNMY TGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQM
Subjt:  VLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQM

Query:  LPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR
        LP MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR
Subjt:  LPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR

Query:  HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFI
        HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAGLSIDYF+
Subjt:  HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFI

Query:  PPADIFDTLHSHSHSHP
        PPADI DTLHSHS   P
Subjt:  PPADIFDTLHSHSHSHP

XP_038887014.1 uncharacterized protein LOC120077181 [Benincasa hispida]8.1e-20386.42Show/hide
Query:  MATAVVIGGGG--GGGGGGSSKSRR--KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLP-PSLPAAAPSSSSSSFTNDL
        MATAVVIG GG        SSKSRR  KQIWYS QPL PLME  GPDPQFQDQE NKKDSSASNWEF RDWFKI RN  P  S+P ++ ++  +S T DL
Subjt:  MATAVVIGGGG--GGGGGGSSKSRR--KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLP-PSLPAAAPSSSSSSFTNDL

Query:  KLLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCF
        KLLLGVLACPLAPI     PLH HSPP  SHFP DTPLETSVAHYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK+VKTVGTR ED+GCF
Subjt:  KLLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCF

Query:  VLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGP
        VLWQMLP MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIG+DECFVLKVSAEREAVMERNEGP
Subjt:  VLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGP

Query:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLS
        AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCI DY+DVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAGLS
Subjt:  AEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLS

Query:  IDYFIPPADIFDTLHSHSHSHPHPHSP
        IDYFIPPADIFDTLHSHSH H H HSP
Subjt:  IDYFIPPADIFDTLHSHSHSHPHPHSP

TrEMBL top hitse value%identityAlignment
A0A0A0L1R2 Uncharacterized protein6.7e-20386.05Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL
        MATAV IG GG      SSKSRR KQIWYS QPL PLME  GPDPQFQDQEPNKKDSS SNWEF RDWFKI R NL PS+  ++ ++  +S T DLKLLL
Subjt:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL

Query:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ
        GVLACPLAPIPL       +SPP TS+FP   PLETSV HYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQ
Subjt:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ

Query:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
        MLPAMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEE+WSIDDVMFNVAGLS+DYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHSHSHSHPHPHSP
        IPPADIFD+LHSHSH H H HSP
Subjt:  IPPADIFDTLHSHSHSHPHPHSP

A0A1S3BNV0 uncharacterized protein LOC1034915873.6e-20486.76Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL
        MATAV I  G GG    SSKSRR KQIWYS QPL PLME  GPDPQFQDQEPNKKDSS SNWEF RDWFKI R NL PS+  ++ ++  +S T DLKLLL
Subjt:  MATAVVIGGGGGGGGGGSSKSRR-KQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLL

Query:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ
        GVLACPLAPI     PLH +SPP TSHFP   PLETSVAHYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQ
Subjt:  GVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQ

Query:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI
        MLPAMWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGED+CFVLKVSAEREAVMERNEGPAEVI
Subjt:  MLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVI

Query:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF
        RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEE+WSIDDVMFNVAGLS+DYF
Subjt:  RHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYF

Query:  IPPADIFDTLHSHSHSHPHPHSP
        IPPADIFD+LHSHSH H H HSP
Subjt:  IPPADIFDTLHSHSHSHPHPHSP

A0A6J1F0N4 uncharacterized protein LOC1114412523.2e-19784.22Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRN--NLPPSLPAAAPSSS--SSSFTNDLK
        MAT  VI  GG G    SSKSRRK IWYS QPL PLMEGP   PQFQDQEPNKKD S SNWEFFRDWFKI RN  +LPPS      S++  +S ++ DLK
Subjt:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRN--NLPPSLPAAAPSSS--SSSFTNDLK

Query:  LLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPH---DTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTG
        LLLGVLACPLAPIP       LHS P+  HFP    DTPLETSVAHYIIQQYLAATGCLKQQK  KNMYATGS+KMIRCETEVSSGK VKTVGTRIED G
Subjt:  LLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPH---DTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTG

Query:  CFVLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNE
        CFVLWQM+PAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRI+QGLDPKSTARLFEKAQCLGEKRIG+D+CFVLKVSAEREAVMERNE
Subjt:  CFVLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNE

Query:  GPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAG
        GPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAG
Subjt:  GPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAG

Query:  LSIDYFIPPADIFDTLHSHSHSHPH--PHSP
        LS+DYFIPPAD FDT+HSHSHSH H   HSP
Subjt:  LSIDYFIPPADIFDTLHSHSHSHPH--PHSP

A0A6J1GN81 uncharacterized protein LOC1114554625.7e-20287.53Show/hide
Query:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLG
        MATAVVI  GGGG    SSKSRR+QIWYS QPL PLME  GPDPQFQDQE NKKDSSASNWEF RDWFKI R NLPPSLP ++ ++  +S T DLKLLLG
Subjt:  MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLG

Query:  VLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQM
        VLACPLAPIPL     H +SPP  SHFP DTPLETSVAHYIIQQYLAATGCLKQQK  KNMY TGS+KMIRCETEVSSGK+VKTVGTR EDTGCFVLWQM
Subjt:  VLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQM

Query:  LPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR
        LP MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR
Subjt:  LPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIR

Query:  HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFI
        HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAGLSIDYF+
Subjt:  HVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFI

Query:  PPADIFDTLHSHSHSHP
        PPADI DTLHSHS   P
Subjt:  PPADIFDTLHSHSHSHP

A0A6J1I0T2 uncharacterized protein LOC1114684005.3e-20086.43Show/hide
Query:  MATAVVIGGGG---GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKL
        MATAVVIGGGG         SSKSRR+QIWYS QPL PLME  GPDPQFQDQE NKKDSSASNWEF RDWFKI R NLPPSLP ++ ++  +S T DLKL
Subjt:  MATAVVIGGGG---GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKL

Query:  LLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVL
        LLGVLACPLAPIPL     + +SPP  SHFP DTPLETSVAHYIIQQYLAATGCLKQQK  KNMY TGS+KMIRCETEVSSGK+VK VGTR EDTGCFVL
Subjt:  LLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVL

Query:  WQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAE
        WQMLP MWSLELVVGGSKVVAGSDG TVWRHTPWLGTHAAKGPQRPLRRIIQGLDPK TARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAE
Subjt:  WQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSID
        VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGS IGDYRDVDGVLIAHRGRSIATVFKFGEMS+QFSRTRMEEVWSIDDVMFNVAGLSID
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSID

Query:  YFIPPADIFDTLHSHSHSHP
        YF+PPADI DTLHSHS   P
Subjt:  YFIPPADIFDTLHSHSHSHP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G27690.1 Protein of unknown function (DUF620)1.4e-9947.99Show/hide
Query:  RRKQIWYSHQP-------LAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSS--FTNDLKLLLGVLACPLAPIPLS
        RR++  Y  QP       LAP++EGP PD +          SS     F R W+    N +   LP A PS SSSS     DL+LLLGVL  PL P+ +S
Subjt:  RRKQIWYSHQP-------LAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSS--FTNDLKLLLGVLACPLAPIPLS

Query:  PAPLHLHSPPYTSHFPH----DTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEV-SSGKAVKTVGTRIEDTGCFVLWQMLPAMWSL
           L           PH    +TP+ETS A YI+QQY AA+G  K   S +N Y  G ++ +  E E  S G   K   ++  ++G FVLW M P MW +
Subjt:  PAPLHLHSPPYTSHFPH----DTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEV-SSGKAVKTVGTRIEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        ELV+GGSKV+AG DGK VWRHTPWLG HAAKGP RPLRR +QGLDP++TA +F  A+C+GEK+I  ++CF+LK+ A+   +  R+EG +E IRH L+GYF
Subjt:  ELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSS-QFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADI
         QK+G+LV+LEDS LTR+Q   G+AVYWETTI S + DY+ V+G++IAH GRS+AT+ +FG+MSS   ++T M+E W ID++ FNV GLSID FIPP+++
Subjt:  CQKSGVLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSS-QFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADI

Query:  -FD-------------TLHSHSH
         FD             TLH H +
Subjt:  -FD-------------TLHSHSH

AT1G49840.1 Protein of unknown function (DUF620)5.6e-10147.92Show/hide
Query:  MATAVVIGGGG--GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPD-PQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKL
        +A++   GGGG  GG  GG   S+ +        L P+MEGP PD  +    +  +  S  S+W     W +       PS+ +  P+   S    DL+L
Subjt:  MATAVVIGGGG--GGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPD-PQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKL

Query:  LLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVL
        LLGV+  PLAPI +S +   LH          D+P ETS A YI+QQY AA G  K   + KN YA G LKMI  E E  +G  V+   +   +TG FVL
Subjt:  LLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVL

Query:  WQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAE
        WQM P MW +EL VGGSKV AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +F +++C+GE+++  ++CF+LK+  + E +  R+EGPAE
Subjt:  WQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAE

Query:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSI
        ++RH+L+GYF Q++G+L  +EDS LTR+Q+ +GDAVYWETTI S + DY+ V+G++IAH GRS+ T+F+FGE++   +RT+MEE W+I++V FNV GLS+
Subjt:  VIRHVLYGYFCQKSGVLVYLEDSHLTRVQT-EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSI

Query:  DYFIPPADI
        D FIPPAD+
Subjt:  DYFIPPADI

AT1G79420.1 Protein of unknown function (DUF620)4.1e-14463.88Show/hide
Query:  RKQIWYS-HQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHR----NNLPPSLPAAAPSSSSSSFTNDLKLLLGVLACPLAPIPLSPAPLH
        RKQ W +  Q L PLMEGP PD Q    E  KK+SS   WE  R+WFK+H+    N   PS+     S    +   DL+LLLGVL CPLAPI +  + L 
Subjt:  RKQIWYS-HQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHR----NNLPPSLPAAAPSSSSSSFTNDLKLLLGVLACPLAPIPLSPAPLH

Query:  LHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVG----TRIEDTGCFVLWQMLPAMWSLELVVGG
           P   S    + P ETS AHYIIQQYLAATGCLK+ K+ KNMYATG +KM  CETE+++GK+VKT+G     R  D+GCFVLWQM P MWSLELV+GG
Subjt:  LHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVG----TRIEDTGCFVLWQMLPAMWSLELVVGG

Query:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNE--GPAEVIRHVLYGYFCQKS
        +K+++GSDGKTVWRHTPWLGTHAAKGPQRPLRR+IQGLDPK+TA LF KAQCLGE+RIG+D+CFVLKVSA+R++++ERN+   PAEVIRH LYGYFCQKS
Subjt:  SKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNE--GPAEVIRHVLYGYFCQKS

Query:  GVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADIFDT
        G+LVYLEDSHLTRV T   E +AVYWETTIG+ IGDYRDVDGV +AH GR++ATVF+FGE S Q+SRTRMEE+W IDDV+F+V GLS+D FIPPADIF+ 
Subjt:  GVLVYLEDSHLTRVQT---EGDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADIFDT

Query:  LHSHSHS
         + ++++
Subjt:  LHSHSHS

AT3G19540.1 Protein of unknown function (DUF620)8.3e-10549.62Show/hide
Query:  GGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLGVLACPLA
        GGGGGGGGGG   ++ +Q+      L P+MEGP PD         +     S       W K   +  P      + +++++   NDL+LLLGV+  PLA
Subjt:  GGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLGVLACPLA

Query:  PIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQMLPAMWSL
        PI +S       S P       +TP+ETS A YI+QQY AA+G  K Q S KN YA G LKMI  E E ++ + V+       +TG FVLWQM P MW +
Subjt:  PIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQMLPAMWSL

Query:  ELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIRHVLYGYF
        EL VGGSKV AG +GK VWRHTPWLG+H AKGP RPLRR +QGLDP++TA +F +A+C+GEK++  ++CF+LK+  + E +  R+EGPAE+IRHVL+GYF
Subjt:  ELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIRHVLYGYF

Query:  CQKSGVLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADI
         QK+G+LV++EDSHLTR+Q+  G+ V+WETT  S + DYR V+G++IAH G S+ T+F+FGE+++  +RT+MEE W+I++V FNV GLS+D FIPPAD+
Subjt:  CQKSGVLVYLEDSHLTRVQTE-GDAVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADI

AT5G05840.1 Protein of unknown function (DUF620)2.0e-8245.65Show/hide
Query:  DLKLLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCE---------TEVSSGKAVKT
        +++LLLGV+  PL P+P+ P   + +  P       D PLE S+A YI++QY+AA G  +   + ++MYA G ++M   E         +++   +++K+
Subjt:  DLKLLLGVLACPLAPIPLSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCE---------TEVSSGKAVKT

Query:  VGTRIEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAE
         G  +   G FVLWQ    +W LELVV G K+ AGSD K  WR TPW  +HA++GP RPLRR +QGLDPKSTA LF ++ C+GEK+I +++CF+LK+ AE
Subjt:  VGTRIEDTGCFVLWQMLPAMWSLELVVGGSKVVAGSDGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAE

Query:  REAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWS
          A+  R+    E+IRH ++G F Q++G+L+ LEDSHL R++ + D +++WETT+ S I DYR VDG+L+AH G+S  ++F+FGE S   SRTRMEE W 
Subjt:  REAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD-AVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWS

Query:  IDDVMFNVAGLSIDYFIPPADI
        I+++ FN+ GLS+D F+PP+D+
Subjt:  IDDVMFNVAGLSIDYFIPPADI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGGCGGTTGTGATCGGCGGCGGGGGCGGGGGCGGGGGCGGAGGCTCATCGAAGTCGAGGAGAAAGCAAATTTGGTACTCTCATCAGCCTCTGGCTCCGCTGAT
GGAAGGGCCCGGACCCGACCCGCAGTTCCAAGACCAAGAACCCAACAAGAAAGACTCCTCCGCTTCCAACTGGGAATTCTTCCGCGACTGGTTCAAGATCCACCGCAACA
ACCTCCCGCCCTCTCTCCCCGCCGCCGCCCCATCCTCCTCCTCCTCCTCCTTTACCAATGACTTGAAGCTCTTGCTCGGCGTCCTCGCCTGCCCACTCGCTCCCATTCCC
CTCTCTCCCGCTCCGCTTCACCTCCATTCCCCTCCTTATACTTCCCACTTTCCACACGATACCCCACTTGAAACCTCCGTGGCTCATTACATTATACAGCAATACTTGGC
TGCCACCGGATGTCTCAAACAGCAAAAGTCTCCCAAGAACATGTATGCCACGGGAAGCCTCAAGATGATTCGCTGCGAAACAGAGGTTTCTTCGGGCAAGGCTGTGAAGA
CCGTAGGCACAAGAATCGAGGACACCGGCTGCTTTGTTCTTTGGCAGATGCTGCCGGCCATGTGGTCCCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTCGCCGGCAGC
GACGGCAAGACCGTCTGGCGTCACACTCCCTGGCTCGGCACCCATGCCGCCAAAGGCCCCCAACGACCTCTCCGTCGCATCATTCAGGGTCTAGACCCAAAGAGCACAGC
GAGGCTGTTCGAGAAAGCCCAATGCCTGGGGGAAAAGCGGATCGGGGAAGACGAGTGCTTCGTGCTGAAAGTATCGGCAGAGCGGGAGGCAGTGATGGAGAGGAACGAGG
GGCCTGCGGAAGTGATAAGGCACGTGCTGTACGGATACTTCTGCCAGAAGAGCGGAGTGCTTGTGTACCTAGAGGACTCTCACCTGACCAGGGTCCAGACCGAAGGCGAC
GCCGTCTACTGGGAAACCACCATCGGGAGCTGCATCGGAGACTACAGGGACGTGGACGGCGTCCTCATCGCCCATCGGGGCAGGTCCATAGCCACCGTGTTCAAGTTTGG
GGAAATGTCGAGCCAATTTAGCAGGACCCGGATGGAAGAGGTCTGGAGCATTGACGATGTGATGTTCAACGTTGCTGGCCTTAGCATCGACTACTTTATTCCTCCCGCTG
ATATTTTTGATACTCTTCATTCTCATTCTCATTCTCATCCTCATCCTCATTCTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGACGGCGGTTGTGATCGGCGGCGGGGGCGGGGGCGGGGGCGGAGGCTCATCGAAGTCGAGGAGAAAGCAAATTTGGTACTCTCATCAGCCTCTGGCTCCGCTGAT
GGAAGGGCCCGGACCCGACCCGCAGTTCCAAGACCAAGAACCCAACAAGAAAGACTCCTCCGCTTCCAACTGGGAATTCTTCCGCGACTGGTTCAAGATCCACCGCAACA
ACCTCCCGCCCTCTCTCCCCGCCGCCGCCCCATCCTCCTCCTCCTCCTCCTTTACCAATGACTTGAAGCTCTTGCTCGGCGTCCTCGCCTGCCCACTCGCTCCCATTCCC
CTCTCTCCCGCTCCGCTTCACCTCCATTCCCCTCCTTATACTTCCCACTTTCCACACGATACCCCACTTGAAACCTCCGTGGCTCATTACATTATACAGCAATACTTGGC
TGCCACCGGATGTCTCAAACAGCAAAAGTCTCCCAAGAACATGTATGCCACGGGAAGCCTCAAGATGATTCGCTGCGAAACAGAGGTTTCTTCGGGCAAGGCTGTGAAGA
CCGTAGGCACAAGAATCGAGGACACCGGCTGCTTTGTTCTTTGGCAGATGCTGCCGGCCATGTGGTCCCTCGAATTGGTGGTCGGAGGCAGTAAGGTGGTCGCCGGCAGC
GACGGCAAGACCGTCTGGCGTCACACTCCCTGGCTCGGCACCCATGCCGCCAAAGGCCCCCAACGACCTCTCCGTCGCATCATTCAGGGTCTAGACCCAAAGAGCACAGC
GAGGCTGTTCGAGAAAGCCCAATGCCTGGGGGAAAAGCGGATCGGGGAAGACGAGTGCTTCGTGCTGAAAGTATCGGCAGAGCGGGAGGCAGTGATGGAGAGGAACGAGG
GGCCTGCGGAAGTGATAAGGCACGTGCTGTACGGATACTTCTGCCAGAAGAGCGGAGTGCTTGTGTACCTAGAGGACTCTCACCTGACCAGGGTCCAGACCGAAGGCGAC
GCCGTCTACTGGGAAACCACCATCGGGAGCTGCATCGGAGACTACAGGGACGTGGACGGCGTCCTCATCGCCCATCGGGGCAGGTCCATAGCCACCGTGTTCAAGTTTGG
GGAAATGTCGAGCCAATTTAGCAGGACCCGGATGGAAGAGGTCTGGAGCATTGACGATGTGATGTTCAACGTTGCTGGCCTTAGCATCGACTACTTTATTCCTCCCGCTG
ATATTTTTGATACTCTTCATTCTCATTCTCATTCTCATCCTCATCCTCATTCTCCATGA
Protein sequenceShow/hide protein sequence
MATAVVIGGGGGGGGGGSSKSRRKQIWYSHQPLAPLMEGPGPDPQFQDQEPNKKDSSASNWEFFRDWFKIHRNNLPPSLPAAAPSSSSSSFTNDLKLLLGVLACPLAPIP
LSPAPLHLHSPPYTSHFPHDTPLETSVAHYIIQQYLAATGCLKQQKSPKNMYATGSLKMIRCETEVSSGKAVKTVGTRIEDTGCFVLWQMLPAMWSLELVVGGSKVVAGS
DGKTVWRHTPWLGTHAAKGPQRPLRRIIQGLDPKSTARLFEKAQCLGEKRIGEDECFVLKVSAEREAVMERNEGPAEVIRHVLYGYFCQKSGVLVYLEDSHLTRVQTEGD
AVYWETTIGSCIGDYRDVDGVLIAHRGRSIATVFKFGEMSSQFSRTRMEEVWSIDDVMFNVAGLSIDYFIPPADIFDTLHSHSHSHPHPHSP