; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025170 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025170
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF688)
Genome locationtig00003412:1909460..1922627
RNA-Seq ExpressionSgr025170
SyntenySgr025170
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR007789 - Protein of unknown function DUF688
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572395.1 hypothetical protein SDJN03_29123, partial [Cucurbita argyrosperma subsp. sororia]5.6e-26068.5Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLD NQPLLSVRRFTSTGTSS+T EK R E +IP  PVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST  TQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
           + QEVPTSINASLDENARET EE  SC+SGND+EEE+DEVYRDANDIFSRSESFFLNCSISGVSGLD SEIKPSG S MDP TRNFMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV  ERP+EIK++TNGDRQS+ NLH+QTQHV+EIF EES+DEDDD+D+S Y SARGCGFLP+FCLKG  VLLNPVP MRMQATS+RR
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS  GSSKDAVNE++SSHGQGITR ++EE A         SNIQE D FSLYRHLQGEGIS YPNEPS+A   +VNPSLG   K T    NEYDE  R
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------
        R L+SF ALLGDESGSASP VEKTLY+DSVHKI SP+SSSNSL+ KGISYSGDMIDDASI+S + KEL  L+S TQD++                     
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------

Query:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII
           T +     EV   +NFS+                    +K +LENQFPLKPS+RGD+NGLA D T L SS G ++EK+NL R QPEKS Y  NN+I+
Subjt:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII

Query:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS
        PEYGKK ES+G+KL           P +DSP LRTSE ATNG++DS NQ LKRVGNEDGSRGGY  SRL FAPPPPKSPS+SWLKRTLPTSSRN+ FLQS
Subjt:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS

Query:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        S  M++NP+S T S D K+ STVKSS+TNH HLQFSK    S+ ++
Subjt:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

XP_022147966.1 uncharacterized protein LOC111016765 [Momordica charantia]4.9e-28071.79Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        M SLRNLMED QLDFNQPLLSVRRF+STGT SETKEKT TE KIPPLPVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KT+ LT PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESG-NDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK
        VPNVKQEVP S+NAS DENARE GE++A CESG  +EEE++DEV+RDAND FSRSESFF NCSISGVSGL+GSEIKPSGAS MDPQ RNFM+DRFLPAAK
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESG-NDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK

Query:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR
        AMASETP HTTRKQT T+ERP+EIKM+TNG RQS+PNLH+QT HVQEI  EES+DEDD Y  SGY SARGCG LP+FCLKG FVLLNPVP +RMQ TS+R
Subjt:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR

Query:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC
        + RNS IG+ KDA NE++ S+GQGITRLQMEE ASKSSK NGK+  QERD FSLYR+LQGEGISNYP E SRA  T+VNP LG  E  T  ATNEYDE  
Subjt:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC

Query:  RRILSSFQALLG-DESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ-------------------
        RR LSSFQALLG DESGSASP VEKTLY+DSVHKI+SPNSSSNS +MKGIS+S DMIDDAS++S + KE YLL+SATQDI+                   
Subjt:  RRILSSFQALLG-DESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ-------------------

Query:  -----TGTYSGCCEVVKLANFSQ----------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGN
             T +     EV   +N+S+                      +K DLENQFPLKP+NR DSN +A+DT  L SSDGT+NEKI + R QPEKSSYEGN
Subjt:  -----TGTYSGCCEVVKLANFSQ----------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGN

Query:  NLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL
        NLIIP+YGKKLES+GK L+ET IQESFH P +DSPDLRTSE ATNG RDS N+ LKR GNE  SR GYSRL FAPPPPKSPSDSWLKRTLPT SRNSSFL
Subjt:  NLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL

Query:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        QSSL MR NP+S+TT  DPKSES VKSS+TNHQHLQF K    S+ ++
Subjt:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

XP_022952581.1 uncharacterized protein LOC111455232 [Cucurbita moschata]8.4e-26468.9Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLD NQPLLSVRRFTSTGTSS+T EK R E +IP  PVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KTQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
           + QEVPTSINASLDENARET EE  SC+SGND+EEE+DEVYRDANDIFSRSESFFLNCSISGVSGLD SEIKPSG S MDP TRNFMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV  ERP+EIK++TNGDRQS+PNLH+QTQHV+EIF EES+DEDDD+DES Y S RGCGFLP+FCLKG  VLLNPVP MRMQATS+RR
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS  GSSKDAVNE++SSHGQGITR ++EE A         SNIQE D FSLYRHLQGEGIS YPNEPS+A   +VNPSLG   K T   TNEYDE  R
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------
        R L+SF ALLGDESGSASP VEKTLY+DSVHKI SP+SSSNSL+ KGISYSGDMIDDASI+S + KEL  L+S TQD++                     
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------

Query:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII
           T +     EV   +NFS+                    +K +LENQFPLKPS+RGD+NGLA D T L SS G ++EK+NL R QPEKS Y  NN+I+
Subjt:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII

Query:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS
        PEYGKK ES+G+KL           P +DSP LRTSE ATNG++DS NQ LKRVGNEDGSRGGY  SRL FAPPPPKSPS+SWLKRTLPTSSRN+ FLQS
Subjt:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS

Query:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        S  M++NP+S T S D K+ STVKSS+TNH HLQFSK    S+ ++
Subjt:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

XP_023511564.1 uncharacterized protein LOC111776366 [Cucurbita pepo subsp. pepo]5.6e-26068.41Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLD NQPLLSVRRFTSTGTSS+T EKTR E +IP  PVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KTQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGN-DEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK
           + QEVPTSINASLDENARET EE  SC+SGN D++EE+DEVYRDANDIFSRSESFFLNCSISGVSGLD SEIKPSG S MDP TR+FMMDRFLPAAK
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGN-DEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK

Query:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR
        AMASETP HT RKQTV  ERP+EIK++TNGDRQS+PNLH+QTQHV+EIF EES+DEDDD+DESGY S RGCGFLP+FCLKG  VLLNPVP MRMQATS+R
Subjt:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR

Query:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC
        R RNS  GSSKDAVNE++SSHGQGITR ++EE A         SNIQE D F LYRHLQGEGIS YP EPS+A   +VNPSLG   K T   TNEYDE  
Subjt:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC

Query:  RRILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ--------------------
        RR L+SF ALLGDESGSASP VEKTLY+DSVHKI SP+SSSNSL+ KGISYSGDMIDDASI+S + KEL  L+SATQD++                    
Subjt:  RRILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ--------------------

Query:  ----TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLI
            T +     EV   +NFS+                    +K +LENQFPLKPS+RGD+NGLA D T L SS   ++EK+NL R QPEKS Y  NN+I
Subjt:  ----TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLI

Query:  IPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQ
        +PEYGKK ES+G+K            P +DSP LRTSE ATNG++DS NQ LKRVGNEDGSRGGY  SRL FAPPPPKSPS+SWLKRTLPTSSRN+ FLQ
Subjt:  IPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQ

Query:  SSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        SS  M+LNP+S T S D K+ STVKSS+ NH HLQFSK    S+ ++
Subjt:  SSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

XP_038888421.1 uncharacterized protein LOC120078273 isoform X1 [Benincasa hispida]7.4e-26067.69Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLDFNQPLLSVRRFTST TSSET EK R E KIPPLPVYKS+LKSGPVR PGTVPFIWEQTPGKPKDES+TKTQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
        V  V QEVPTS++ASLDENARE  EE  SC+SGND+EEEE+EVY+DAND FSRSESFFLNCSISGVSGLD SEIK SGAS MDP TR+FMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV++ERP+EIK++TNGDR+S+PNLH+QT+HV+EIF EES+DEDDDYDESGY S RGCGFLP+ CLKG FVLLNPV  MRMQATS+R+
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS I SSKDAVN+++SSHGQ IT+  +EE A KSSK  G+ NIQE D FSLYRHLQ EGISNY NEPS+A   +VNPSLG  EK T  ATNEYD+  R
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------
        + L+SFQALL DESGSASP VEKTLY+DSVHKIKSP+SSSNS  MKGISY GDMIDD  I+S + KEL  L+SAT D++                     
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------

Query:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII
           T +     +V   +N+S+                    +K DLENQFPLKPS+RGDSNGLA D T              L R QPEKSSYEGN++++
Subjt:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII

Query:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS
        PE+GKK E +G+KL          GP  DSPD+RTSEGATNGK+DS NQFLKRVGNEDGS  GY  SRLPFAPPPPKSPS+SWLKRTLPTSS+N+SFLQS
Subjt:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS

Query:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        S  MR+NP+SM  S D K  STVKSSN NH HLQFSK    S+ ++
Subjt:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

TrEMBL top hitse value%identityAlignment
A0A0A0K1U5 Uncharacterized protein9.4e-25366.71Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLDFNQPLLSVRRFTST TS ET EKTR E KIPPLPVYKSELKSGPVR PGTVPF+WE+TPGKPKDES+T TQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
        V  V QEV TS+NASLDENARE  EE  SC+SGND+EEEE+EVYRDAND FSRSESFFLNCSISGVSGLD SEIKPS AS MDP TR+FMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV+ ERP+E+K++TN DRQS+PNLH+QT+HV+EIF EES+DEDDDYDESGY S +GCGFLP+FCLKG F LLNPVP MRMQATS+RR
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS IG SKDAVNE++  HGQGIT+ Q+EE A+KSSK +GKSNIQE D FSLYRHLQ E +SNYPNEPS+A   +VNPSL    K TK  TNEY+E+ R
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSG---------------
        R L+SFQALL DESGSASP VEKTLY+DSVHKIKSP+SSSNSL++KGISYSGDMIDD  I+S + KEL  L+SAT D++     G               
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSG---------------

Query:  ----CCEV----VKL--------------------ANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIP
            C ++    VKL                    + F+ +K DLENQFPLKPS+R DSN L  D T                R QPEKSSYEGNN+  P
Subjt:  ----CCEV----VKL--------------------ANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIP

Query:  EYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYS--RLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQSS
        EYGKK E + +K+          GP + S DLRTSEGATNGK DS N+F KRVGN DGS  GYS  RLPFAPPPPKSPS+SWLKRTLPTSSRN+SFLQSS
Subjt:  EYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYS--RLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQSS

Query:  LVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
          MR+NPIS+TTS +    STV++ +TN+ HLQFSK    S+ ++
Subjt:  LVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

A0A5A7SPV3 Uncharacterized protein3.8e-25466.98Show/hide
Query:  NYHMISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLP
        N HMISLRNLME+KQLDFNQPLLSVRRFTST TSSET EKTR E KIPPLPVYKSELKSGPVR PGTVPF+WE+TPGKPKDES+  TQ    PPLVPKLP
Subjt:  NYHMISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLP

Query:  PGRVPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPA
        PGRV  + QEV TS+NASLDEN RE  EE ASC SGNDEEEEEDEVYRDAND FSRSESFFLNCSISGVSGLD SEIKPS AS MDP TR+FMMDRFLPA
Subjt:  PGRVPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPA

Query:  AKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATS
        AKAMASETP HT RKQTV+ ERP+E+K++TN DRQS+PNLH+QT+HV+EIF EES+DEDDDYDESGY S +GCGFLP+FCLKG F LLNPVP MRMQATS
Subjt:  AKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATS

Query:  MRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDE
        +RR RNS IGSSKDAVNE+Q  HGQGIT+ Q+EE A+KSSK +GKSNIQE D FSLYRHLQGE +SNYPNEPS+A   +VN SLG   K TK  TNE++E
Subjt:  MRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDE

Query:  TCRRILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSG------------
        + RR L+SFQALL DESGS SP VEKTLY+DSVHKIKSP+SSSNSL+MKGISYSGDMIDD  I+S + KEL  L+SAT D++     G            
Subjt:  TCRRILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSG------------

Query:  -------CCEV----VKL--------------------ANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNL
               C ++    VKL                    + F+ +K DLENQFPLKPS+R DSN L  D T                R QPEKSSYEGNN+
Subjt:  -------CCEV----VKL--------------------ANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNL

Query:  IIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYS--RLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL
        I PEYGK  E + +KL          GP + S DLRTSEGATNG  DS N + KRVGNEDGS  GYS  RLPFAPPPPKSPS+SWLKRTLPTSSRN+SFL
Subjt:  IIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYS--RLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL

Query:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        QSS  MR+NP+S+T S +    STVK+ +TN+ HLQFSK    S+ ++
Subjt:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

A0A6J1D3Y5 uncharacterized protein LOC1110167652.4e-28071.79Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        M SLRNLMED QLDFNQPLLSVRRF+STGT SETKEKT TE KIPPLPVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KT+ LT PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESG-NDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK
        VPNVKQEVP S+NAS DENARE GE++A CESG  +EEE++DEV+RDAND FSRSESFF NCSISGVSGL+GSEIKPSGAS MDPQ RNFM+DRFLPAAK
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESG-NDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAK

Query:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR
        AMASETP HTTRKQT T+ERP+EIKM+TNG RQS+PNLH+QT HVQEI  EES+DEDD Y  SGY SARGCG LP+FCLKG FVLLNPVP +RMQ TS+R
Subjt:  AMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMR

Query:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC
        + RNS IG+ KDA NE++ S+GQGITRLQMEE ASKSSK NGK+  QERD FSLYR+LQGEGISNYP E SRA  T+VNP LG  E  T  ATNEYDE  
Subjt:  RNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETC

Query:  RRILSSFQALLG-DESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ-------------------
        RR LSSFQALLG DESGSASP VEKTLY+DSVHKI+SPNSSSNS +MKGIS+S DMIDDAS++S + KE YLL+SATQDI+                   
Subjt:  RRILSSFQALLG-DESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ-------------------

Query:  -----TGTYSGCCEVVKLANFSQ----------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGN
             T +     EV   +N+S+                      +K DLENQFPLKP+NR DSN +A+DT  L SSDGT+NEKI + R QPEKSSYEGN
Subjt:  -----TGTYSGCCEVVKLANFSQ----------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGN

Query:  NLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL
        NLIIP+YGKKLES+GK L+ET IQESFH P +DSPDLRTSE ATNG RDS N+ LKR GNE  SR GYSRL FAPPPPKSPSDSWLKRTLPT SRNSSFL
Subjt:  NLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFL

Query:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        QSSL MR NP+S+TT  DPKSES VKSS+TNHQHLQF K    S+ ++
Subjt:  QSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

A0A6J1GL08 uncharacterized protein LOC1114552324.1e-26468.9Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLD NQPLLSVRRFTSTGTSS+T EK R E +IP  PVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KTQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
           + QEVPTSINASLDENARET EE  SC+SGND+EEE+DEVYRDANDIFSRSESFFLNCSISGVSGLD SEIKPSG S MDP TRNFMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV  ERP+EIK++TNGDRQS+PNLH+QTQHV+EIF EES+DEDDD+DES Y S RGCGFLP+FCLKG  VLLNPVP MRMQATS+RR
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS  GSSKDAVNE++SSHGQGITR ++EE A         SNIQE D FSLYRHLQGEGIS YPNEPS+A   +VNPSLG   K T   TNEYDE  R
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------
        R L+SF ALLGDESGSASP VEKTLY+DSVHKI SP+SSSNSL+ KGISYSGDMIDDASI+S + KEL  L+S TQD++                     
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------

Query:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII
           T +     EV   +NFS+                    +K +LENQFPLKPS+RGD+NGLA D T L SS G ++EK+NL R QPEKS Y  NN+I+
Subjt:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII

Query:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS
        PEYGKK ES+G+KL           P +DSP LRTSE ATNG++DS NQ LKRVGNEDGSRGGY  SRL FAPPPPKSPS+SWLKRTLPTSSRN+ FLQS
Subjt:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS

Query:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        S  M++NP+S T S D K+ STVKSS+TNH HLQFSK    S+ ++
Subjt:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

A0A6J1HZQ3 uncharacterized protein LOC1114681308.8e-25967.56Show/hide
Query:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR
        MISLRNLME+KQLD NQPLLSVRRFTSTGTSS+T EK R E +IP  PVYKSELKSGPVRNPGTVPF+WEQTPGKPKDEST KTQ    PPLVPKLPPGR
Subjt:  MISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGR

Query:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA
           + QEVPTSINASLDEN RET EE  SC+SGND+EEE+DEVYRDANDIFSRSESFFLNCSISGVSGLD SEIKPSG S MDP +R+FMMDRFLPAAKA
Subjt:  VPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASCMDPQTRNFMMDRFLPAAKA

Query:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR
        MASETP HT RKQTV  ERP+EIK++ NGDRQS+ NLH+QTQHV+EIF EES+ EDDD+DESGY S RGCGFLP+FCLKG  VL+NPVP MRMQATS+RR
Subjt:  MASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQATSMRR

Query:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR
         RNS  GSSKDAVNE++SSHGQGITR ++EE A         SNIQE D FSLYRHLQGEGIS YPNEPS+A   +VNPSLG   K T   TNEYDE CR
Subjt:  NRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCR

Query:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------
        R L+SF ALLGDESGSASP VEKTLY+DSVHKI SP+SSSNSL+ KGISY+GD+IDDASI+S + KEL  L+SATQD++                     
Subjt:  RILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQ---------------------

Query:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII
           T +     EV   +NFS+                    +K +LENQFPLKPS+RGD NGLA D T L SS G  +EK+NL + QPEKS Y  N++I+
Subjt:  ---TGTYSGCCEVVKLANFSQ--------------------QKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLII

Query:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS
        PEY KK ES+G+KL           P +DSP LRTSE ATNG++DS NQ LKRVGNEDGSRGGY  SRL FAPPPPKSPS+SWLKRTLPTSSRN+ FLQS
Subjt:  PEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGY--SRLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQS

Query:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI
        S  M++NP+S T S D K+ STVKSS+T H HLQFSK    S+ ++
Subjt:  SLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQI

SwissProt top hitse value%identityAlignment
Q9LJG8 Trihelix transcription factor ASIL22.2e-2030.12Show/hide
Query:  VAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQK-PSTPVAVS
        V + AWGER+L+L+RGNL+ KHW+EVA+ V+SR  +  K  +T+IQCKNRIDT+KK+YK EK RI   GG   S W FF  LD L G++ K P+    VS
Subjt:  VAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQK-PSTPVAVS

Query:  ----------------------NCKASPVTTP-----------------------------GLSLFSKVPVGPRS---GTKKRRSTHVYRSLATRTFAGM
                              + +A   T P                              +++   +P+  RS   G + R      R+L  +   GM
Subjt:  ----------------------NCKASPVTTP-----------------------------GLSLFSKVPVGPRS---GTKKRRSTHVYRSLATRTFAGM

Query:  QIR----------------------------------------KRVK-ETTDRLLG------YRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAK
         ++                                        KR+K E   +  G      +R+L RAI    + YE+ E AK + +VE+E +RM+F K
Subjt:  QIR----------------------------------------KRVK-ETTDRLLG------YRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAK

Query:  DLEYQRMQLLMEMHLQIQKIKR
        +LE QRMQ  ++  L+I ++K+
Subjt:  DLEYQRMQLLMEMHLQIQKIKR

Q9SYG2 Trihelix transcription factor ASIL11.2e-1833.6Show/hide
Query:  VAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQKPSTPVAVSN
        V + AWG+R  +  +G L+ +HW+EVA+ VN       K  +T+IQCKNRIDT+KK+YK EKA+I  + G   S W FF  L+ L G      T   +++
Subjt:  VAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQKPSTPVAVSN

Query:  CKASPVTTPG-------LSLFSKVPVG---PRSGTKKRRST----HVYRSLATRTFA----------------------------GMQIRKRVKETTDRL
         KAS     G        S+F +   G    +   +KR S     H  +  A+ T +                             M  R +V ++    
Subjt:  CKASPVTTPG-------LSLFSKVPVG---PRSGTKKRRST----HVYRSLATRTFA----------------------------GMQIRKRVKETTDRL

Query:  LGYRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKDLEYQRMQLL
         G   +ARAI   T+ YE+ E AK + M ELE +RM+FAK++E QRMQ L
Subjt:  LGYRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKDLEYQRMQLL

Arabidopsis top hitse value%identityAlignment
AT2G30990.1 Protein of unknown function (DUF688)2.5e-5631.65Show/hide
Query:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV
        +ME+KQLDFN+PL+S+RR T T   SE++ KTR     T +  P  PVYKS++KSGPVRNPGTVPF WE  PGKPKDE     Q    P  VPKLPPGR 
Subjt:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV

Query:  PNVK-QEVPTSINASLDENARETGE----EEASCESG--NDEEEEEDEVYRDANDIFSRSESFFLNCS-ISGVSGLDGSEI--KPSGASCMDPQTRNFMM
          V+    P S  A        + +    E+A   S   +D++++ D  Y DA D  SR+ESFF NCS +SG SGLDGS I  +P G    D QT++ MM
Subjt:  PNVK-QEVPTSINASLDENARETGE----EEASCESG--NDEEEEEDEVYRDANDIFSRSESFFLNCS-ISGVSGLDGSEI--KPSGASCMDPQTRNFMM

Query:  DRFLPAAKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEM
         RFLPAAKA+ SE+P H  RK     E  K++        +  P            F    + E++D + S  +++  CG LPQ CL+    LLNPVP +
Subjt:  DRFLPAAKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEM

Query:  RMQ---ATSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGT
        RMQ   A S+RR R+    S+    NE  +   +   +L++ E  +K S                    QGE +                 S+    +G 
Subjt:  RMQ---ATSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGT

Query:  KPATNEYDETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTY
        +   N    +  +I  +F  LL       E  S +PV EKTLYVD VH +                       D  ++    K+  L  S + DI     
Subjt:  KPATNEYDETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTY

Query:  SGCCEVVKLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPP
                               P+K     D     +    +   +G  +E      SQ  +   +   ++       +E   +K ++  +Q       
Subjt:  SGCCEVVKLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPP

Query:  QDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSS
         +S  L                           R  Y  +P  PP PK+PSDSWLKRTLPT   +N+SF  LQS      N  +  T ++PK E+ VK+S
Subjt:  QDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSS

Query:  NTNHQHLQFSK
        NT    + FSK
Subjt:  NTNHQHLQFSK

AT2G30990.2 Protein of unknown function (DUF688)2.5e-5631.65Show/hide
Query:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV
        +ME+KQLDFN+PL+S+RR T T   SE++ KTR     T +  P  PVYKS++KSGPVRNPGTVPF WE  PGKPKDE     Q    P  VPKLPPGR 
Subjt:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV

Query:  PNVK-QEVPTSINASLDENARETGE----EEASCESG--NDEEEEEDEVYRDANDIFSRSESFFLNCS-ISGVSGLDGSEI--KPSGASCMDPQTRNFMM
          V+    P S  A        + +    E+A   S   +D++++ D  Y DA D  SR+ESFF NCS +SG SGLDGS I  +P G    D QT++ MM
Subjt:  PNVK-QEVPTSINASLDENARETGE----EEASCESG--NDEEEEEDEVYRDANDIFSRSESFFLNCS-ISGVSGLDGSEI--KPSGASCMDPQTRNFMM

Query:  DRFLPAAKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEM
         RFLPAAKA+ SE+P H  RK     E  K++        +  P            F    + E++D + S  +++  CG LPQ CL+    LLNPVP +
Subjt:  DRFLPAAKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEM

Query:  RMQ---ATSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGT
        RMQ   A S+RR R+    S+    NE  +   +   +L++ E  +K S                    QGE +                 S+    +G 
Subjt:  RMQ---ATSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGT

Query:  KPATNEYDETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTY
        +   N    +  +I  +F  LL       E  S +PV EKTLYVD VH +                       D  ++    K+  L  S + DI     
Subjt:  KPATNEYDETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTY

Query:  SGCCEVVKLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPP
                               P+K     D     +    +   +G  +E      SQ  +   +   ++       +E   +K ++  +Q       
Subjt:  SGCCEVVKLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPP

Query:  QDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSS
         +S  L                           R  Y  +P  PP PK+PSDSWLKRTLPT   +N+SF  LQS      N  +  T ++PK E+ VK+S
Subjt:  QDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSS

Query:  NTNHQHLQFSK
        NT    + FSK
Subjt:  NTNHQHLQFSK

AT2G30990.3 Protein of unknown function (DUF688)8.1e-4730.26Show/hide
Query:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV
        +ME+KQLDFN+PL+S+RR T T   SE++ KTR     T +  P  PVYKS++KSGPVRNPGTVPF WE  PGKPKDE     Q    P  VPKLPPGR 
Subjt:  LMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTR-----TELKIPPLPVYKSELKSGPVRNPGTVPFIWEQTPGKPKDESTTKTQYLTLPPLVPKLPPGRV

Query:  PNVKQEVPTSINASLDENARETG-EEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEI--KPSGASCMDPQTRNFMMDRFLPAA
          V+          L      TG + +    S +D+   ED          ++S S   + ++SG SGLDGS I  +P G    D QT++ MM RFLPAA
Subjt:  PNVKQEVPTSINASLDENARETG-EEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEI--KPSGASCMDPQTRNFMMDRFLPAA

Query:  KAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQ---A
        KA+ SE+P H  RK     E  K++        +  P            F    + E++D + S  +++  CG LPQ CL+    LLNPVP +RMQ   A
Subjt:  KAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEMRMQ---A

Query:  TSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEY
         S+RR R+    S+    NE  +   +   +L++ E  +K S                    QGE +                 S+    +G +   N  
Subjt:  TSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEY

Query:  DETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSGCCEVV
          +  +I  +F  LL       E  S +PV EKTLYVD VH +                       D  ++    K+  L  S + DI            
Subjt:  DETCRRILSSFQALLGD-----ESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSGCCEVV

Query:  KLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLR
                        P+K     D     +    +   +G  +E      SQ  +   +   ++       +E   +K ++  +Q        +S  L 
Subjt:  KLANFSQQKVDLENQFPLKPSNRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLR

Query:  TSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHL
                                  R  Y  +P  PP PK+PSDSWLKRTLPT   +N+SF  LQS      N  +  T ++PK E+ VK+SNT    +
Subjt:  TSEGATNGKRDSINQFLKRVGNEDGSRGGYSRLPFAPPPPKSPSDSWLKRTLPT-SSRNSSF--LQSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHL

Query:  QFSK
         FSK
Subjt:  QFSK

AT3G11100.1 sequence-specific DNA binding transcription factors4.3e-3242.36Show/hide
Query:  VYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQKPSTPVAVSNCK
        + AWG+R+++LNRGNLR   W+EVADAVNS HG+ R   +T++QCKNRIDTLKK+YK EKA+        +S W FF  LD L G   K S+   V    
Subjt:  VYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQKPSTPVAVSNCK

Query:  ASPVTTPGLSLFSKVPVGPRSGTKKRRSTHVYRSLATRTFAGMQIRKRVK-ETTDRLLG--YRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKD
         S +  P L+       G                       G  +RK  K E  D   G  +R+LAR+I  + + +ER+E  KQ+ M+ELE QRM+ AK+
Subjt:  ASPVTTPGLSLFSKVPVGPRSGTKKRRSTHVYRSLATRTFAGMQIRKRVK-ETTDRLLG--YRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKD

Query:  LEYQRMQLLMEMHLQIQKIK--RARRASG
        LE QRM +LMEM L+++K K  + R ASG
Subjt:  LEYQRMQLLMEMHLQIQKIK--RARRASG

AT3G58630.1 sequence-specific DNA binding transcription factors1.6e-4238.33Show/hide
Query:  SCRPTPTELSRRRQPSVDSAARNGSVAPVAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSL--------RTNIQCKNRIDTLKKRYKIEKAR
        S RP+P  LSR    S ++           + AWG R++DL+RGNLR KHWQEVA+AVN RH +  +++        RT++QCKNRIDTLKK+YK+EKAR
Subjt:  SCRPTPTELSRRRQPSVDSAARNGSVAPVAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSL--------RTNIQCKNRIDTLKKRYKIEKAR

Query:  IDESG-GSYVSTWPFFSCLDDLTGNS-QKPSTPVAVSNCKASPVTTPGLSLFSKVPVGPRSGTKKRRST-------------------HVYRSLA-----
        + ES  G+Y+S WPFFS LDDL   S    S P +  N     ++ P     + VPV PRS   +R +T                   + + + A     
Subjt:  IDESG-GSYVSTWPFFSCLDDLTGNS-QKPSTPVAVSNCKASPVTTPGLSLFSKVPVGPRSGTKKRRST-------------------HVYRSLA-----

Query:  -----------TRTFAGMQIRKRVKETTDRLLGYRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKDLEYQRMQLLMEMHLQIQKIKRARRASGP
                   +R+       K+ +   ++  GY+++A AI  +  IYERVE  K++ MVELE QRM+FAK+LE  RMQL  EM +++ K++R   + GP
Subjt:  -----------TRTFAGMQIRKRVKETTDRLLGYRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKDLEYQRMQLLMEMHLQIQKIKRARRASGP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAAATCTGCTCTCTAGTAGGACGTATTTCCGTGACGTACTTACAAAAGAGAGCAGAATTAGCCGCTTTACTATCCCCAGATACGGATAAACTCGCACGCTCTAGAGA
ACTGAGAAGTATTCATCCTCGACCGGAAACAGCTCAAGACAATCATTTAATCGTTTGGGAGCTCCGGCGAATGAGGATCCGATGCACCAACAGATCGATAATTCTGAATC
AGAATCCTCTCCTCTATGCAAATAGTGAATTGATTAAGACATTAACCCTCCCGCATCCACAGCCTTGCCGCTCTCATCATATCCCTCGGGCTCCCGGTGATGGAAACGAG
GACGAAGCAGCTGATCGACGAATATGGCAATGGCTTTTTACTGGCAGAGAGGCAGTGGTTACAGTCGTGCGCGGAACCCCCACCCCCACGACCACGAGGACGGTCAGGAA
CTACCACATGATTTCATTAAGGAATCTGATGGAAGACAAACAACTGGATTTTAATCAACCTCTTCTTTCTGTGAGGCGTTTTACATCTACAGGAACGTCATCAGAGACAA
AAGAAAAAACCAGAACTGAACTCAAAATTCCTCCTCTTCCTGTATACAAATCAGAATTGAAATCTGGTCCAGTGAGGAATCCTGGAACTGTTCCTTTTATATGGGAGCAA
ACTCCAGGTAAACCCAAGGATGAAAGCACAACAAAAACTCAGTATTTGACACTGCCTCCACTTGTCCCAAAACTCCCACCGGGAAGGGTCCCAAATGTAAAGCAAGAAGT
TCCTACTTCCATTAATGCTTCATTAGATGAAAATGCCAGAGAGACAGGAGAAGAGGAGGCCAGTTGTGAGTCCGGGAATGATGAAGAAGAAGAAGAAGACGAGGTTTACA
GGGATGCAAATGATATATTTTCTCGATCCGAATCCTTCTTCTTGAATTGTAGTATTAGTGGAGTGAGTGGATTGGATGGTTCAGAAATTAAACCTTCAGGGGCTTCCTGT
ATGGATCCACAGACTCGGAATTTCATGATGGATCGATTTCTGCCTGCAGCAAAGGCAATGGCTTCAGAAACACCTCTACATACTACCAGAAAACAAACTGTTACCATTGA
GCGACCGAAGGAAATAAAGATGATAACAAATGGGGATAGGCAGAGTCAGCCAAACCTGCACATTCAGACACAACATGTCCAAGAAATATTTGGGGAAGAAAGTGAAGATG
AAGATGATGATTATGATGAGTCTGGATATGTTTCTGCTCGGGGCTGTGGTTTTCTTCCTCAGTTTTGCTTGAAAGGTCCATTTGTCCTTTTGAATCCAGTACCCGAAATG
AGAATGCAGGCCACTTCGATGCGTAGAAATCGAAATAGCTTAATCGGGAGTTCCAAGGATGCTGTAAATGAGAAACAATCAAGTCATGGTCAAGGAATAACTCGGCTGCA
AATGGAAGAAAAAGCAAGTAAATCCAGCAAAGGCAATGGCAAAAGTAATATTCAGGAGCGAGATATATTTTCTTTATACAGGCATTTGCAGGGTGAGGGCATATCCAACT
ACCCGAATGAACCTTCCCGAGCTGCTCAAACAGATGTAAATCCTTCTCTTGGTTGCAATGAAAAAGGTACCAAGCCAGCAACAAATGAATATGATGAAACATGTAGAAGA
ATCCTAAGTAGCTTTCAAGCATTACTGGGTGACGAATCTGGTTCAGCTAGTCCTGTTGTTGAGAAAACTCTCTACGTAGACTCTGTACATAAGATTAAATCTCCCAATTC
AAGTTCAAATTCTTTGGAAATGAAAGGCATAAGTTACAGTGGAGATATGATTGACGATGCTTCTATAAGAAGTATCAAAACGAAAGAACTGTATCTACTGAATTCTGCAA
CCCAAGATATTCAAACCGGAACATATTCAGGATGCTGTGAAGTTGTCAAGCTTGCAAATTTCTCACAACAAAAGGTTGATTTGGAAAACCAATTTCCCTTGAAACCAAGC
AACCGAGGTGATTCCAATGGTCTTGCCAATGATACCACTAAATTGACAAGCTCAGATGGAACTTACAATGAGAAGATTAATTTAGGCAGGAGCCAACCTGAGAAATCAAG
TTATGAAGGTAATAACCTTATTATTCCGGAATATGGCAAGAAGCTCGAGTCGGAGGGCAAAAAGCTTGTAGAAACCAGCATTCAAGAAAGTTTCCATGGCCCTCCTCAGG
ACTCCCCCGACCTGAGAACTTCAGAAGGTGCTACTAATGGAAAGAGGGATTCAATAAATCAATTTCTCAAGAGGGTAGGTAATGAAGACGGTTCTCGTGGCGGCTACTCA
CGGCTGCCTTTTGCTCCGCCTCCACCAAAATCTCCATCAGACTCTTGGCTAAAGCGTACTTTGCCAACTTCTTCAAGAAACTCTTCGTTTTTGCAGTCTTCTCTTGTAAT
GCGACTAAACCCCATCTCAATGACCACGTCTTCCGATCCCAAGTCGGAGTCTACTGTTAAAAGCTCCAATACAAACCATCAGCATCTGCAGTTTTCAAAGAATAATGGAG
GAAGCCTTCTCCAAATTTCACTTGTCTGTGAGTATGCTTTACTTTCCACTGGGTTTGATGGGTTCCGAGGTTCAGAAGATATCTCCTCTCACTTTGAGTCTTCTTCTTCT
TCTTCTCTTTTTGGCTTGCTTCAGAATCAGTGTAAAAATAACTGTAAGTGGGCATTTCCACTGCTAGGCAAAATTGATAACTCGCCTGTTGTAAATTTCAAACATTGCGC
GCCGAGGCCCAAAAACCAACCTCAGCATGCGGAAGGGAGATCGCGCCAATGGATCCAAGTGCCGCCTAAAGGAAAGGAAATGCCTGGACACGTTCGCTTTCCAGGAGTGC
TCACCGCTTTCCTTTCTCCTATTGCACTGGCTCGCCCACGCATGCAAGGGCAGTCGGCGAACTTCATTTTAAAACCCATCGCTTCGTGCCGCCCAACTCCCACTGAACTG
AGTCGTCGCCGGCAACCATCTGTGGACTCAGCCGCCCGTAATGGCTCTGTTGCACCCGTCGCCGTCTACGCCTGGGGAGAACGTCATCTCGACTTGAACCGCGGCAACCT
TCGTCACAAACACTGGCAAGAGGTCGCCGACGCTGTCAACTCCCGACACGGTCATGATAGGAAGTCCCTTCGTACGAATATTCAGTGCAAAAACCGCATCGACACGCTTA
AGAAGAGGTACAAGATCGAGAAGGCAAGAATTGATGAATCCGGTGGCTCGTACGTCTCCACTTGGCCATTCTTCTCTTGCCTCGACGATCTTACAGGCAACAGCCAGAAG
CCTTCGACGCCGGTCGCCGTATCTAACTGTAAAGCCTCGCCAGTAACGACTCCAGGGCTGTCATTGTTCTCGAAGGTTCCGGTTGGCCCTCGATCCGGTACGAAGAAACG
TCGTTCAACTCATGTCTATAGGAGTTTGGCGACTCGCACCTTCGCCGGGATGCAAATTCGAAAGAGGGTGAAGGAAACGACGGACAGATTGCTAGGGTATAGAAAACTGG
CTCGGGCAATAGGGACGATTACTGATATATACGAGAGAGTGGAGGTAGCAAAACAGAGGCATATGGTAGAATTGGAGATGCAACGGATGCAATTTGCGAAGGATTTGGAG
TATCAGAGGATGCAACTACTTATGGAGATGCACCTTCAGATTCAAAAGATCAAGCGCGCCAGGCGAGCTTCAGGACCCGCTTGTTTTGGCGGAGGATTTGGTGGAGCTGC
TTGGAAGTGGATCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCAAATCTGCTCTCTAGTAGGACGTATTTCCGTGACGTACTTACAAAAGAGAGCAGAATTAGCCGCTTTACTATCCCCAGATACGGATAAACTCGCACGCTCTAGAGA
ACTGAGAAGTATTCATCCTCGACCGGAAACAGCTCAAGACAATCATTTAATCGTTTGGGAGCTCCGGCGAATGAGGATCCGATGCACCAACAGATCGATAATTCTGAATC
AGAATCCTCTCCTCTATGCAAATAGTGAATTGATTAAGACATTAACCCTCCCGCATCCACAGCCTTGCCGCTCTCATCATATCCCTCGGGCTCCCGGTGATGGAAACGAG
GACGAAGCAGCTGATCGACGAATATGGCAATGGCTTTTTACTGGCAGAGAGGCAGTGGTTACAGTCGTGCGCGGAACCCCCACCCCCACGACCACGAGGACGGTCAGGAA
CTACCACATGATTTCATTAAGGAATCTGATGGAAGACAAACAACTGGATTTTAATCAACCTCTTCTTTCTGTGAGGCGTTTTACATCTACAGGAACGTCATCAGAGACAA
AAGAAAAAACCAGAACTGAACTCAAAATTCCTCCTCTTCCTGTATACAAATCAGAATTGAAATCTGGTCCAGTGAGGAATCCTGGAACTGTTCCTTTTATATGGGAGCAA
ACTCCAGGTAAACCCAAGGATGAAAGCACAACAAAAACTCAGTATTTGACACTGCCTCCACTTGTCCCAAAACTCCCACCGGGAAGGGTCCCAAATGTAAAGCAAGAAGT
TCCTACTTCCATTAATGCTTCATTAGATGAAAATGCCAGAGAGACAGGAGAAGAGGAGGCCAGTTGTGAGTCCGGGAATGATGAAGAAGAAGAAGAAGACGAGGTTTACA
GGGATGCAAATGATATATTTTCTCGATCCGAATCCTTCTTCTTGAATTGTAGTATTAGTGGAGTGAGTGGATTGGATGGTTCAGAAATTAAACCTTCAGGGGCTTCCTGT
ATGGATCCACAGACTCGGAATTTCATGATGGATCGATTTCTGCCTGCAGCAAAGGCAATGGCTTCAGAAACACCTCTACATACTACCAGAAAACAAACTGTTACCATTGA
GCGACCGAAGGAAATAAAGATGATAACAAATGGGGATAGGCAGAGTCAGCCAAACCTGCACATTCAGACACAACATGTCCAAGAAATATTTGGGGAAGAAAGTGAAGATG
AAGATGATGATTATGATGAGTCTGGATATGTTTCTGCTCGGGGCTGTGGTTTTCTTCCTCAGTTTTGCTTGAAAGGTCCATTTGTCCTTTTGAATCCAGTACCCGAAATG
AGAATGCAGGCCACTTCGATGCGTAGAAATCGAAATAGCTTAATCGGGAGTTCCAAGGATGCTGTAAATGAGAAACAATCAAGTCATGGTCAAGGAATAACTCGGCTGCA
AATGGAAGAAAAAGCAAGTAAATCCAGCAAAGGCAATGGCAAAAGTAATATTCAGGAGCGAGATATATTTTCTTTATACAGGCATTTGCAGGGTGAGGGCATATCCAACT
ACCCGAATGAACCTTCCCGAGCTGCTCAAACAGATGTAAATCCTTCTCTTGGTTGCAATGAAAAAGGTACCAAGCCAGCAACAAATGAATATGATGAAACATGTAGAAGA
ATCCTAAGTAGCTTTCAAGCATTACTGGGTGACGAATCTGGTTCAGCTAGTCCTGTTGTTGAGAAAACTCTCTACGTAGACTCTGTACATAAGATTAAATCTCCCAATTC
AAGTTCAAATTCTTTGGAAATGAAAGGCATAAGTTACAGTGGAGATATGATTGACGATGCTTCTATAAGAAGTATCAAAACGAAAGAACTGTATCTACTGAATTCTGCAA
CCCAAGATATTCAAACCGGAACATATTCAGGATGCTGTGAAGTTGTCAAGCTTGCAAATTTCTCACAACAAAAGGTTGATTTGGAAAACCAATTTCCCTTGAAACCAAGC
AACCGAGGTGATTCCAATGGTCTTGCCAATGATACCACTAAATTGACAAGCTCAGATGGAACTTACAATGAGAAGATTAATTTAGGCAGGAGCCAACCTGAGAAATCAAG
TTATGAAGGTAATAACCTTATTATTCCGGAATATGGCAAGAAGCTCGAGTCGGAGGGCAAAAAGCTTGTAGAAACCAGCATTCAAGAAAGTTTCCATGGCCCTCCTCAGG
ACTCCCCCGACCTGAGAACTTCAGAAGGTGCTACTAATGGAAAGAGGGATTCAATAAATCAATTTCTCAAGAGGGTAGGTAATGAAGACGGTTCTCGTGGCGGCTACTCA
CGGCTGCCTTTTGCTCCGCCTCCACCAAAATCTCCATCAGACTCTTGGCTAAAGCGTACTTTGCCAACTTCTTCAAGAAACTCTTCGTTTTTGCAGTCTTCTCTTGTAAT
GCGACTAAACCCCATCTCAATGACCACGTCTTCCGATCCCAAGTCGGAGTCTACTGTTAAAAGCTCCAATACAAACCATCAGCATCTGCAGTTTTCAAAGAATAATGGAG
GAAGCCTTCTCCAAATTTCACTTGTCTGTGAGTATGCTTTACTTTCCACTGGGTTTGATGGGTTCCGAGGTTCAGAAGATATCTCCTCTCACTTTGAGTCTTCTTCTTCT
TCTTCTCTTTTTGGCTTGCTTCAGAATCAGTGTAAAAATAACTGTAAGTGGGCATTTCCACTGCTAGGCAAAATTGATAACTCGCCTGTTGTAAATTTCAAACATTGCGC
GCCGAGGCCCAAAAACCAACCTCAGCATGCGGAAGGGAGATCGCGCCAATGGATCCAAGTGCCGCCTAAAGGAAAGGAAATGCCTGGACACGTTCGCTTTCCAGGAGTGC
TCACCGCTTTCCTTTCTCCTATTGCACTGGCTCGCCCACGCATGCAAGGGCAGTCGGCGAACTTCATTTTAAAACCCATCGCTTCGTGCCGCCCAACTCCCACTGAACTG
AGTCGTCGCCGGCAACCATCTGTGGACTCAGCCGCCCGTAATGGCTCTGTTGCACCCGTCGCCGTCTACGCCTGGGGAGAACGTCATCTCGACTTGAACCGCGGCAACCT
TCGTCACAAACACTGGCAAGAGGTCGCCGACGCTGTCAACTCCCGACACGGTCATGATAGGAAGTCCCTTCGTACGAATATTCAGTGCAAAAACCGCATCGACACGCTTA
AGAAGAGGTACAAGATCGAGAAGGCAAGAATTGATGAATCCGGTGGCTCGTACGTCTCCACTTGGCCATTCTTCTCTTGCCTCGACGATCTTACAGGCAACAGCCAGAAG
CCTTCGACGCCGGTCGCCGTATCTAACTGTAAAGCCTCGCCAGTAACGACTCCAGGGCTGTCATTGTTCTCGAAGGTTCCGGTTGGCCCTCGATCCGGTACGAAGAAACG
TCGTTCAACTCATGTCTATAGGAGTTTGGCGACTCGCACCTTCGCCGGGATGCAAATTCGAAAGAGGGTGAAGGAAACGACGGACAGATTGCTAGGGTATAGAAAACTGG
CTCGGGCAATAGGGACGATTACTGATATATACGAGAGAGTGGAGGTAGCAAAACAGAGGCATATGGTAGAATTGGAGATGCAACGGATGCAATTTGCGAAGGATTTGGAG
TATCAGAGGATGCAACTACTTATGGAGATGCACCTTCAGATTCAAAAGATCAAGCGCGCCAGGCGAGCTTCAGGACCCGCTTGTTTTGGCGGAGGATTTGGTGGAGCTGC
TTGGAAGTGGATCTGA
Protein sequenceShow/hide protein sequence
MQICSLVGRISVTYLQKRAELAALLSPDTDKLARSRELRSIHPRPETAQDNHLIVWELRRMRIRCTNRSIILNQNPLLYANSELIKTLTLPHPQPCRSHHIPRAPGDGNE
DEAADRRIWQWLFTGREAVVTVVRGTPTPTTTRTVRNYHMISLRNLMEDKQLDFNQPLLSVRRFTSTGTSSETKEKTRTELKIPPLPVYKSELKSGPVRNPGTVPFIWEQ
TPGKPKDESTTKTQYLTLPPLVPKLPPGRVPNVKQEVPTSINASLDENARETGEEEASCESGNDEEEEEDEVYRDANDIFSRSESFFLNCSISGVSGLDGSEIKPSGASC
MDPQTRNFMMDRFLPAAKAMASETPLHTTRKQTVTIERPKEIKMITNGDRQSQPNLHIQTQHVQEIFGEESEDEDDDYDESGYVSARGCGFLPQFCLKGPFVLLNPVPEM
RMQATSMRRNRNSLIGSSKDAVNEKQSSHGQGITRLQMEEKASKSSKGNGKSNIQERDIFSLYRHLQGEGISNYPNEPSRAAQTDVNPSLGCNEKGTKPATNEYDETCRR
ILSSFQALLGDESGSASPVVEKTLYVDSVHKIKSPNSSSNSLEMKGISYSGDMIDDASIRSIKTKELYLLNSATQDIQTGTYSGCCEVVKLANFSQQKVDLENQFPLKPS
NRGDSNGLANDTTKLTSSDGTYNEKINLGRSQPEKSSYEGNNLIIPEYGKKLESEGKKLVETSIQESFHGPPQDSPDLRTSEGATNGKRDSINQFLKRVGNEDGSRGGYS
RLPFAPPPPKSPSDSWLKRTLPTSSRNSSFLQSSLVMRLNPISMTTSSDPKSESTVKSSNTNHQHLQFSKNNGGSLLQISLVCEYALLSTGFDGFRGSEDISSHFESSSS
SSLFGLLQNQCKNNCKWAFPLLGKIDNSPVVNFKHCAPRPKNQPQHAEGRSRQWIQVPPKGKEMPGHVRFPGVLTAFLSPIALARPRMQGQSANFILKPIASCRPTPTEL
SRRRQPSVDSAARNGSVAPVAVYAWGERHLDLNRGNLRHKHWQEVADAVNSRHGHDRKSLRTNIQCKNRIDTLKKRYKIEKARIDESGGSYVSTWPFFSCLDDLTGNSQK
PSTPVAVSNCKASPVTTPGLSLFSKVPVGPRSGTKKRRSTHVYRSLATRTFAGMQIRKRVKETTDRLLGYRKLARAIGTITDIYERVEVAKQRHMVELEMQRMQFAKDLE
YQRMQLLMEMHLQIQKIKRARRASGPACFGGGFGGAAWKWI