; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016425 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016425
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00152909:860517..864885
RNA-Seq ExpressionSgr016425
SyntenySgr016425
Gene Ontology termsNA
InterPro domainsIPR012337 - Ribonuclease H-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7017109.1 hypothetical protein SDJN02_22221, partial [Cucurbita argyrosperma subsp. argyrosperma]0.0e+0087.63Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPS SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNASASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSP-PPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLEN
        IS SNASASP+PISSIPSPT HNHKKRSS MNAPILTASYQVHSLAMIEPTRSYAPLISSP  PV QNP        L+QHQLVLSGGKDDLGALEMLEN
Subjt:  ISGSNASASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSP-PPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLEN

Query:  SVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGWK
        SVKKLKSPHASPGPRLSKEQIDSAIELLTDW IESCGSVSLSCLEHPKFKALL+QLGLPS+PRTDILGARLDSKFEEAKADSEARIRD+  FQIASDGWK
Subjt:  SVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGWK

Query:  NKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLIK
        NKNC    CGEESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGS LQ+CVGIIAD+YKAKALRNLEIK HWMVNLSCQLQGFISLIK
Subjt:  NKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLIK

Query:  DFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEVS
        DFNKELPLFRVVTENCLKVANFV+TK  VRN LNKYKVQELEGH L HVPSPNCDTSKNFSPVYAMLDD+LSCAHVLQMVVLDESYKLACMED LA+EVS
Subjt:  DFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEVS

Query:  SLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL
        SLIQNERFWDEVEAVHS VKMIRGMA+EIEAERPLIGQCLPLWEELR+KVKEWCAK++IAE PVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL
Subjt:  SLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL

Query:  PPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSCG
        PPFKCLSQ+QEKDVDSL+NRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK L KVALRLIFLHSTSCG
Subjt:  PPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSCG

Query:  YKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTLD
        YKCKCSI+NLVCSHRHSRVGLE+AQKM+FVAAHAKLER DFSNE DKDAELF+MADGENDMLNEVFSDAPSM+GYEFLD+QNRGH               
Subjt:  YKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTLD

Query:  CHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVP
                               V+D+FDQT+PEL EVNLGLGTISLQLKMHF D IPDISPVVP
Subjt:  CHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVP

KGN50377.1 hypothetical protein Csa_000462 [Cucumis sativus]0.0e+0086.8Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS S AS ASP+PISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP P  QN +GMASKMG  QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSC +HPKFKALL+QLGLPSLPRTDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    C EESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+CAHVLQMVVLDESYK+ACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHS VKMIR MAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAE PVEKI+EKRFRKNYHPAWS AFILDPLYLRRD+NGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLS FK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTL
        G+KCKCSI+NLVCS+RHSRVGLERAQKM+FVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPS+                           
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTL

Query:  DCHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF
                               +V+DVFDQT+PEL  VNLGLGTISLQLKMH ADRIPDISPV+PTYVTF
Subjt:  DCHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF

XP_008467279.1 PREDICTED: uncharacterized protein LOC103504669 isoform X1 [Cucumis melo]0.0e+0092.78Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS ASP+PISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP PV QN +GM SKMG  QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    C EESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+C HVLQMVVLDESYK+ACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHS VKMI  MAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEKI+EKRFRKNYHPAWS AFILDPLYLRRD+NGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVK
        G+KCKCSI+NLVCSHRHSRVGLERAQKM+FVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFL LQNRGHKKVK
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVK

XP_038907184.1 uncharacterized protein LOC120092979 isoform X1 [Benincasa hispida]0.0e+0093.78Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS ASP+PISSIPSPTLHNHKKRSSQMNA ILTASYQVHSLAMIEPTRSYAPLISSPP PV QN LGM SKMG  QHQ VLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALL QLGLPSLPRTDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    CGEESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSC HVLQMVVLDES+KLACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEK++EKRFRKNYHPAWSAAFILDPLYLRRDINGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK
        GYKCKCSI+NLVCSHRHSRVGLERAQKM+FVAAHAKLER DFSNEE+KDAELFAMADGENDMLNEVFSDAPSMNGYEFL LQNRGHKK
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK

XP_038907185.1 uncharacterized protein LOC120092979 isoform X2 [Benincasa hispida]0.0e+0093.78Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS ASP+PISSIPSPTLHNHKKRSSQMNA ILTASYQVHSLAMIEPTRSYAPLISSPP PV QN LGM SKMG  QHQ VLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASP PRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALL QLGLPSLPRTDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    CGEESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSC HVLQMVVLDES+KLACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEK++EKRFRKNYHPAWSAAFILDPLYLRRDINGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK
        GYKCKCSI+NLVCSHRHSRVGLERAQKM+FVAAHAKLER DFSNEE+KDAELFAMADGENDMLNEVFSDAPSMNGYEFL LQNRGHKK
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK

TrEMBL top hitse value%identityAlignment
A0A0A0KPN2 Uncharacterized protein0.0e+0086.8Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS S AS ASP+PISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP P  QN +GMASKMG  QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSC +HPKFKALL+QLGLPSLPRTDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    C EESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+CAHVLQMVVLDESYK+ACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHS VKMIR MAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAE PVEKI+EKRFRKNYHPAWS AFILDPLYLRRD+NGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLS FK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTL
        G+KCKCSI+NLVCS+RHSRVGLERAQKM+FVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPS+                           
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTL

Query:  DCHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF
                               +V+DVFDQT+PEL  VNLGLGTISLQLKMH ADRIPDISPV+PTYVTF
Subjt:  DCHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF

A0A1S3CT64 uncharacterized protein LOC103504669 isoform X10.0e+0092.78Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS ASP+PISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP PV QN +GM SKMG  QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    C EESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+C HVLQMVVLDESYK+ACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHS VKMI  MAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEKI+EKRFRKNYHPAWS AFILDPLYLRRD+NGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVK
        G+KCKCSI+NLVCSHRHSRVGLERAQKM+FVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFL LQNRGHKKVK
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVK

A0A1S3CTD6 uncharacterized protein LOC103504669 isoform X20.0e+0092.77Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S+LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNP+ SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS ASP+PISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP PV QN +GM SKMG  QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNAS-ASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPP-PVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSC +HPKFKALL+QLGLPSLP+TDILGARLDSKFEEAKADSEARIRD+ FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNC    C EESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQ+CVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR VTENCLKVANFVNTK  VRN +NKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+C HVLQMVVLDESYK+ACMED LA+EV
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EAVHS VKMI  MAQEIEAERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEKI+EKRFRKNYHPAWS AFILDPLYLRRD+NGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAH+AFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK
        G+KCKCSI+NLVCSHRHSRVGLERAQKM+FVAAHAKLER DFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFL LQNRGHKK
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKK

A0A6J1DW05 uncharacterized protein LOC111024962 isoform X10.0e+0092.31Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP I+PS+LTEDLA KALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNASASPMPISSIPSPTLHNHKKRSSQ--MNAPILTASYQVHSLAMIEPTRSYAPLISSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE
        IS SNAS SP+PISSIPSPTLHNHKKRSSQ  M+AP+LTASYQVHSLAMIEPTRSYAPLISS  PV QNPLGMA K GL QHQLVLSGGKDDLGALEMLE
Subjt:  ISGSNASASPMPISSIPSPTLHNHKKRSSQ--MNAPILTASYQVHSLAMIEPTRSYAPLISSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLE

Query:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW
        NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKA+LNQLGLPSLPRTDILGARLD+KFEEAKADS+ARIRD++FFQIASDGW
Subjt:  NSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGW

Query:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI
        KNKNCCG+YCGEES+V+FMVNLPNGTTVFQKALFTGGL+SSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQG +SLI
Subjt:  KNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLI

Query:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV
        KDFNKELPLFR+VTENCLKVANFVNTK  +RN LNKYKVQELEGHWLLHVPSPNCDTSKNFSPVY+MLDDML+CAHVLQMVVLDESYKL CMEDPLASE+
Subjt:  KDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEV

Query:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY
        SSLIQNERFWDE+EA HSLVKMIRGMAQEIE ERPLIGQCLPLWEELRTKVKEWC KF+IAEGPVEKI+EKRFRKNYHPAWSAAFILDPLYLRRDINGKY
Subjt:  SSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKY

Query:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC
        LPPFKCLSQ+QEKDVDSLINRLVSREEAHVAF+EL KWRSEGLDPLYAQAVQVKQRDPLTGKMKI NPQSRRLVWETCLSEFK LGKVALRLIFLHSTSC
Subjt:  LPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSC

Query:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLD
        GYKCKCS++NLVCSHRHSRVGLERAQKM+FVAAHAKLERRDFSNEED+DAELFAM DGENDMLNEVFSDAPS+      D
Subjt:  GYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLD

A0A6J1EL65 uncharacterized protein LOC111435659 isoform X10.0e+0086.32Show/hide
Query:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS
        MASTNSPP ID S LTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPS SLPKAVKLKCSLCD+VFSASNPSRTASEHLKRGTCPNLSS
Subjt:  MASTNSPP-IDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSS

Query:  ISGSNASASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSP-PPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLEN
        IS SNASASP+PISSIPSPT HNHKKRSS MNAPILTASYQVHSLAMIEPTRSYAPLISSP  PV QNP        L+QHQLVLSGGKDDLGALEMLEN
Subjt:  ISGSNASASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSP-PPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLEN

Query:  SVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGWK
        SVKKLKSPHASPGPRLSKEQIDSAIELLTDW IESCGSVSLSCLEHPKFKALL+QLGLPS+PRTDILGARLDSKFEEAKADSEARIRD+  FQIASDGWK
Subjt:  SVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGWK

Query:  NKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLIK
        NKNC    CGEESVV+FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGS LQ+CVGIIAD+YKAKALRNLEIK HWMVNLSCQLQGFISLIK
Subjt:  NKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLIK

Query:  DFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEVS
        DFNKELPLFRVVTENCLKVANFV+TK  VRN LNKYKVQELEGH L HVPSPNCDTSKNFSPVYAMLDD+LSCAHVLQMVVLDESYKLACMED LA+EVS
Subjt:  DFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEVS

Query:  SLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL
        SLIQNERFWDEVEAVHS VKMIRGMA+EIEAERPLIGQCLPLWEELR+KVKEWCAK++IAE PVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL
Subjt:  SLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYL

Query:  PPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSCG
        PPFKCLSQ+QEKDVDSL+NRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFK L KVALRLIFLHSTSCG
Subjt:  PPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSCG

Query:  YKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTLD
        YKCKCSI+NLVCSHRHSRVGLE+AQKM+FVAAHAKLER DFSNE DKDAELF+MADGENDMLNEVFSDAPS+N                           
Subjt:  YKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTLD

Query:  CHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF
                               V+D+FDQT+PEL EVNLGLGTISLQLKMHF D IPDISPVVP YVTF
Subjt:  CHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPTYVTF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G12380.1 unknown protein3.7e-25656.13Show/hide
Query:  SPPIDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSSISGSNA
        +PP      T++L  KALNKRYE L+TVRTKA+KGKGAWYW HLEP+L+RN  T LPKAVKL+CSLCDAVFSASNPSRTASEHLKRGTCPN +S++   +
Subjt:  SPPIDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSSISGSNA

Query:  SASPMPISSIPSPTLHNHKKRS---------SQMNAPILTASYQVHSLAMIEPTRSYAPLI--SSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALE
        + +P P SS  SP  H+ K+ S         S++N P +  SY V  + +++P+R     +  S+PPP               QH L+LSGGKDDLG L 
Subjt:  SASPMPISSIPSPTLHNHKKRS---------SQMNAPILTASYQVHSLAMIEPTRSYAPLI--SSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALE

Query:  MLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIAS
        MLE+SVKKLKSP  S    L++ QI+SA++ L+DW  ESCGSVSLS LEHPKF+A L Q+GLP + + D    RLD K EEA+A++E+RIRD++FFQI+S
Subjt:  MLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIAS

Query:  DGWKNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFI
        DGWK           ES+V  +VNLPNGT+++++A+   G V S YAEEV+L+TV  ICG+  QRCVGI++D++K KALRNLE ++ WMVNLSCQ QG  
Subjt:  DGWKNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFI

Query:  SLIKDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVP--------SPNCDTSKN-------FSPVYAMLDDMLSCAHVLQMVV
        SLIKDF KELPLF+ V++NC+++A F+N    +RN+  KY++QE     +L +P          +C +S +       + P++ +L+D+LS A  +Q+VV
Subjt:  SLIKDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVP--------SPNCDTSKN-------FSPVYAMLDDMLSCAHVLQMVV

Query:  LDESYKLACMEDPLASEVSSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWS
         D++ K+  MED +A EV  ++ +E FW+EVEAVH+L+K+++ MA+ IE E+ L+GQCLPLW+ELR KVK+W +KFN+ EG VEK++E+RF+K+YHPAW+
Subjt:  LDESYKLACMEDPLASEVSSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWS

Query:  AAFILDPLYLRRDINGKYLPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEF
        AAFILDPLYL RD +GKYLPPFKCLS +QEKDVD LI RLVSR+EAH+A MELMKWR+EGLDP+YA+AVQ+K+RDP++GKM+IANPQS RLVWET LSEF
Subjt:  AAFILDPLYLRRDINGKYLPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEF

Query:  KVLGKVALRLIFLHSTSCGYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSM
        + LGKVA+RLIFLH+T+ G+KC  S+L  V S+  S   ++RAQK+IF++A++K ERRDFSNEED+DAEL AMA+G++ MLN+V  D  S+
Subjt:  KVLGKVALRLIFLHSTSCGYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSM

AT1G62870.1 unknown protein1.6e-25457.07Show/hide
Query:  TNSPPIDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSSISGS
        T SP   PS+  E+LATKAL KRYE L+ VRTKA+KGKGAWYW+HLEP+L+ N  T  PKAVKL+CSLCDAVFSASNPSRTASEHLKRGTCPN +S+   
Subjt:  TNSPPIDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTCPNLSSISGS

Query:  NASASPMPISSIPSPTLHNHKKRSSQMNAPI----------LTASYQVHSLAMIEPTRSYAPLISSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGAL
         ++ SP P    P P   +H+KR+S     +             SY V  L++++P+R          PVTQ P             L+LSGGKDDLG L
Subjt:  NASASPMPISSIPSPTLHNHKKRSSQMNAPI----------LTASYQVHSLAMIEPTRSYAPLISSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGAL

Query:  EMLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIA
         MLE+SVKKLKSP  S    L+K QIDSA++ L+DW  ESCGSVSLS LEHPK +A L Q+GLP + R D +  RLD K+E+++A++E+RI D++FFQIA
Subjt:  EMLENSVKKLKSPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIA

Query:  SDGWKNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGF
        SDGWK      F    E++V  +VNLPNGT+++++A+F  G V S YAEEV+ +TV  ICG+  QRCVGI++DR+ +KALRNLE ++ WMVNLSCQ QGF
Subjt:  SDGWKNKNCCGFYCGEESVVRFMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGF

Query:  ISLIKDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPL
         SLI+DF KELPLF+ V+++C ++ NFVN+   +RN++ KY++QE     +LH+P      S  F P+Y +L+D+LS A  +Q+V+ D+  K   MED +
Subjt:  ISLIKDFNKELPLFRVVTENCLKVANFVNTKYPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPL

Query:  ASEVSSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIA-EGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRD
        A EV  ++ +  FW+EVEAV+ L+K+++ MA+ IE ERPL+GQCLPLW+ELR+K+K+W AKFN+  E  VEKI+E+RF+K+YHPAW+AAFILDPLYL +D
Subjt:  ASEVSSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLIGQCLPLWEELRTKVKEWCAKFNIA-EGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRD

Query:  INGKYLPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFL
         +GKYLPPFKCLS +QEKDVD LI RLVSR+EAH+A MELMKWR+EGLDP+YA+AVQ+K+RDP++GKM+IANPQS RLVWET LSEF+ LG+VA+RLIFL
Subjt:  INGKYLPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLYAQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFL

Query:  HSTSCGYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSM
        H+TSCG+KC  S+L  V S+  SR  ++RAQK+IF++A++K ERRDFSNEE++DAEL AMA+GE+D+LN+V  D  S+
Subjt:  HSTSCGYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMADGENDMLNEVFSDAPSM


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTACAGCTCGTTATTGTGATAGTAATTTGAGTTTTGTTGCTGGGATGGCTTCCACCAATTCACCGCCCATTGATCCTTCTTCGTTAACTGAAGATTTGGCGACGAA
GGCTTTGAACAAGCGCTATGAATGCCTTGTAACTGTTCGAACAAAAGCCATAAAGGGGAAGGGAGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTCATTCGAAATCCCA
GCACTAGTCTCCCGAAAGCGGTAAAGCTCAAGTGTTCCTTGTGCGATGCCGTTTTCTCGGCTTCGAACCCATCGAGAACTGCATCTGAGCATCTCAAAAGAGGCACTTGT
CCCAATTTGAGCTCCATTTCTGGGTCCAATGCGTCGGCGTCGCCGATGCCTATATCGTCGATCCCTTCTCCGACATTGCACAACCACAAAAAGCGAAGCTCTCAAATGAA
TGCTCCCATTCTCACTGCTTCCTATCAAGTTCATTCTCTAGCCATGATTGAGCCGACGCGATCCTATGCTCCGCTAATTTCCTCGCCGCCCCCAGTGACTCAAAATCCGC
TTGGGATGGCGAGTAAGATGGGCTTGACTCAGCATCAGCTGGTGTTATCAGGAGGGAAAGATGATTTGGGTGCACTAGAAATGCTGGAAAACAGCGTCAAGAAACTAAAG
AGTCCACATGCCTCACCTGGACCAAGGCTAAGTAAGGAACAAATTGATTCTGCAATCGAATTGCTAACTGATTGGTTTATCGAGTCATGTGGGTCAGTGTCGCTTTCGTG
CCTTGAGCATCCGAAGTTTAAAGCCTTGCTTAATCAGTTGGGTTTGCCTTCATTACCTCGGACCGACATTTTAGGAGCTCGGCTCGATTCCAAGTTTGAGGAGGCCAAAG
CTGATTCAGAAGCCAGGATTAGAGATTCAGTGTTTTTCCAAATTGCTTCAGATGGCTGGAAGAATAAGAACTGCTGTGGTTTTTATTGTGGCGAAGAGAGTGTAGTTAGA
TTTATGGTTAATCTTCCAAATGGTACTACTGTGTTCCAAAAAGCTCTATTTACAGGAGGATTGGTGTCATCCAAGTATGCCGAGGAGGTTATATTGGATACGGTCAACGA
GATTTGTGGGAGTGGTCTGCAGAGATGTGTGGGGATAATTGCAGATAGGTATAAGGCCAAGGCGTTGAGAAATTTGGAGATCAAGAATCATTGGATGGTAAATCTCTCTT
GCCAGCTTCAGGGTTTTATTAGTTTGATAAAGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCCTGAAGGTAGCAAACTTTGTAAATACCAAA
TATCCAGTTAGGAATAGTTTAAACAAGTACAAGGTGCAGGAGCTAGAGGGTCACTGGTTGCTTCACGTTCCTTCGCCAAACTGTGATACGTCCAAAAACTTCTCACCTGT
TTATGCTATGCTTGATGATATGCTTAGCTGTGCTCATGTGCTTCAAATGGTGGTGTTAGACGAGTCCTATAAGCTGGCATGCATGGAGGATCCACTTGCGTCTGAGGTTT
CTAGTCTGATACAAAATGAACGCTTTTGGGATGAAGTGGAGGCAGTTCACTCACTTGTGAAGATGATCCGAGGGATGGCTCAAGAGATTGAAGCGGAAAGGCCACTGATT
GGGCAATGCTTGCCTCTCTGGGAGGAGCTGAGAACAAAAGTGAAGGAATGGTGTGCCAAGTTCAACATTGCTGAAGGGCCAGTGGAGAAAATTATAGAAAAACGGTTTAG
GAAAAATTATCATCCAGCATGGTCAGCTGCATTTATACTGGACCCGCTTTACTTGAGGAGGGACATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTTTCGCAAGACC
AGGAAAAGGATGTGGATTCACTTATTAATCGGTTGGTGTCTAGGGAAGAAGCTCATGTCGCATTCATGGAGCTTATGAAATGGCGATCCGAAGGGCTAGACCCACTTTAT
GCTCAGGCAGTTCAGGTTAAACAACGAGACCCTTTGACTGGAAAGATGAAAATTGCCAACCCCCAGAGTAGGCGACTTGTCTGGGAAACTTGCCTAAGCGAGTTCAAGGT
CCTTGGTAAGGTTGCACTGAGGCTTATTTTCCTTCATTCAACATCTTGTGGCTACAAGTGTAAATGTTCCATCTTGAATTTGGTATGCTCACATCGGCACTCGAGGGTCG
GCTTGGAGAGAGCTCAGAAGATGATATTTGTTGCAGCCCATGCCAAGCTTGAACGGAGAGACTTTTCTAACGAGGAAGACAAAGATGCAGAACTATTTGCAATGGCAGAT
GGTGAAAATGACATGCTCAATGAGGTCTTTTCTGATGCACCCTCAATGAATGGTTATGAATTCCTAGACTTGCAGAACAGAGGGCATAAAAAAGTAAAAACTAGTCAGGA
ATCGAGTATGAACACTCTTGACTGTCACAATTTTGAAGCACTCTTTGGCATTGCTAGGCTAGCTAAACATATTGGTATTCTTAATGACGAAGTGCTGGATGTGTTTGATC
AAACCAAACCTGAGTTGCCAGAAGTCAACCTCGGACTAGGTACCATATCTTTGCAGCTGAAGATGCACTTCGCTGATAGAATCCCAGATATATCTCCTGTTGTGCCTACT
TATGTTACTTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTACAGCTCGTTATTGTGATAGTAATTTGAGTTTTGTTGCTGGGATGGCTTCCACCAATTCACCGCCCATTGATCCTTCTTCGTTAACTGAAGATTTGGCGACGAA
GGCTTTGAACAAGCGCTATGAATGCCTTGTAACTGTTCGAACAAAAGCCATAAAGGGGAAGGGAGCTTGGTATTGGGCTCATTTGGAGCCTGTTCTCATTCGAAATCCCA
GCACTAGTCTCCCGAAAGCGGTAAAGCTCAAGTGTTCCTTGTGCGATGCCGTTTTCTCGGCTTCGAACCCATCGAGAACTGCATCTGAGCATCTCAAAAGAGGCACTTGT
CCCAATTTGAGCTCCATTTCTGGGTCCAATGCGTCGGCGTCGCCGATGCCTATATCGTCGATCCCTTCTCCGACATTGCACAACCACAAAAAGCGAAGCTCTCAAATGAA
TGCTCCCATTCTCACTGCTTCCTATCAAGTTCATTCTCTAGCCATGATTGAGCCGACGCGATCCTATGCTCCGCTAATTTCCTCGCCGCCCCCAGTGACTCAAAATCCGC
TTGGGATGGCGAGTAAGATGGGCTTGACTCAGCATCAGCTGGTGTTATCAGGAGGGAAAGATGATTTGGGTGCACTAGAAATGCTGGAAAACAGCGTCAAGAAACTAAAG
AGTCCACATGCCTCACCTGGACCAAGGCTAAGTAAGGAACAAATTGATTCTGCAATCGAATTGCTAACTGATTGGTTTATCGAGTCATGTGGGTCAGTGTCGCTTTCGTG
CCTTGAGCATCCGAAGTTTAAAGCCTTGCTTAATCAGTTGGGTTTGCCTTCATTACCTCGGACCGACATTTTAGGAGCTCGGCTCGATTCCAAGTTTGAGGAGGCCAAAG
CTGATTCAGAAGCCAGGATTAGAGATTCAGTGTTTTTCCAAATTGCTTCAGATGGCTGGAAGAATAAGAACTGCTGTGGTTTTTATTGTGGCGAAGAGAGTGTAGTTAGA
TTTATGGTTAATCTTCCAAATGGTACTACTGTGTTCCAAAAAGCTCTATTTACAGGAGGATTGGTGTCATCCAAGTATGCCGAGGAGGTTATATTGGATACGGTCAACGA
GATTTGTGGGAGTGGTCTGCAGAGATGTGTGGGGATAATTGCAGATAGGTATAAGGCCAAGGCGTTGAGAAATTTGGAGATCAAGAATCATTGGATGGTAAATCTCTCTT
GCCAGCTTCAGGGTTTTATTAGTTTGATAAAGGATTTTAACAAAGAGCTTCCACTTTTCAGGGTAGTCACTGAAAATTGCCTGAAGGTAGCAAACTTTGTAAATACCAAA
TATCCAGTTAGGAATAGTTTAAACAAGTACAAGGTGCAGGAGCTAGAGGGTCACTGGTTGCTTCACGTTCCTTCGCCAAACTGTGATACGTCCAAAAACTTCTCACCTGT
TTATGCTATGCTTGATGATATGCTTAGCTGTGCTCATGTGCTTCAAATGGTGGTGTTAGACGAGTCCTATAAGCTGGCATGCATGGAGGATCCACTTGCGTCTGAGGTTT
CTAGTCTGATACAAAATGAACGCTTTTGGGATGAAGTGGAGGCAGTTCACTCACTTGTGAAGATGATCCGAGGGATGGCTCAAGAGATTGAAGCGGAAAGGCCACTGATT
GGGCAATGCTTGCCTCTCTGGGAGGAGCTGAGAACAAAAGTGAAGGAATGGTGTGCCAAGTTCAACATTGCTGAAGGGCCAGTGGAGAAAATTATAGAAAAACGGTTTAG
GAAAAATTATCATCCAGCATGGTCAGCTGCATTTATACTGGACCCGCTTTACTTGAGGAGGGACATAAATGGGAAATATCTTCCACCCTTCAAGTGCCTTTCGCAAGACC
AGGAAAAGGATGTGGATTCACTTATTAATCGGTTGGTGTCTAGGGAAGAAGCTCATGTCGCATTCATGGAGCTTATGAAATGGCGATCCGAAGGGCTAGACCCACTTTAT
GCTCAGGCAGTTCAGGTTAAACAACGAGACCCTTTGACTGGAAAGATGAAAATTGCCAACCCCCAGAGTAGGCGACTTGTCTGGGAAACTTGCCTAAGCGAGTTCAAGGT
CCTTGGTAAGGTTGCACTGAGGCTTATTTTCCTTCATTCAACATCTTGTGGCTACAAGTGTAAATGTTCCATCTTGAATTTGGTATGCTCACATCGGCACTCGAGGGTCG
GCTTGGAGAGAGCTCAGAAGATGATATTTGTTGCAGCCCATGCCAAGCTTGAACGGAGAGACTTTTCTAACGAGGAAGACAAAGATGCAGAACTATTTGCAATGGCAGAT
GGTGAAAATGACATGCTCAATGAGGTCTTTTCTGATGCACCCTCAATGAATGGTTATGAATTCCTAGACTTGCAGAACAGAGGGCATAAAAAAGTAAAAACTAGTCAGGA
ATCGAGTATGAACACTCTTGACTGTCACAATTTTGAAGCACTCTTTGGCATTGCTAGGCTAGCTAAACATATTGGTATTCTTAATGACGAAGTGCTGGATGTGTTTGATC
AAACCAAACCTGAGTTGCCAGAAGTCAACCTCGGACTAGGTACCATATCTTTGCAGCTGAAGATGCACTTCGCTGATAGAATCCCAGATATATCTCCTGTTGTGCCTACT
TATGTTACTTTTTGA
Protein sequenceShow/hide protein sequence
MSTARYCDSNLSFVAGMASTNSPPIDPSSLTEDLATKALNKRYECLVTVRTKAIKGKGAWYWAHLEPVLIRNPSTSLPKAVKLKCSLCDAVFSASNPSRTASEHLKRGTC
PNLSSISGSNASASPMPISSIPSPTLHNHKKRSSQMNAPILTASYQVHSLAMIEPTRSYAPLISSPPPVTQNPLGMASKMGLTQHQLVLSGGKDDLGALEMLENSVKKLK
SPHASPGPRLSKEQIDSAIELLTDWFIESCGSVSLSCLEHPKFKALLNQLGLPSLPRTDILGARLDSKFEEAKADSEARIRDSVFFQIASDGWKNKNCCGFYCGEESVVR
FMVNLPNGTTVFQKALFTGGLVSSKYAEEVILDTVNEICGSGLQRCVGIIADRYKAKALRNLEIKNHWMVNLSCQLQGFISLIKDFNKELPLFRVVTENCLKVANFVNTK
YPVRNSLNKYKVQELEGHWLLHVPSPNCDTSKNFSPVYAMLDDMLSCAHVLQMVVLDESYKLACMEDPLASEVSSLIQNERFWDEVEAVHSLVKMIRGMAQEIEAERPLI
GQCLPLWEELRTKVKEWCAKFNIAEGPVEKIIEKRFRKNYHPAWSAAFILDPLYLRRDINGKYLPPFKCLSQDQEKDVDSLINRLVSREEAHVAFMELMKWRSEGLDPLY
AQAVQVKQRDPLTGKMKIANPQSRRLVWETCLSEFKVLGKVALRLIFLHSTSCGYKCKCSILNLVCSHRHSRVGLERAQKMIFVAAHAKLERRDFSNEEDKDAELFAMAD
GENDMLNEVFSDAPSMNGYEFLDLQNRGHKKVKTSQESSMNTLDCHNFEALFGIARLAKHIGILNDEVLDVFDQTKPELPEVNLGLGTISLQLKMHFADRIPDISPVVPT
YVTF