; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0015242 (gene) of Snake gourd v1 genome

Gene IDTan0015242
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110419695
Genome locationLG01:108047839..108049710
RNA-Seq ExpressionTan0015242
SyntenyTan0015242
Gene Ontology termsGO:0009793 - embryo development ending in seed dormancy (biological process)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6591919.1 hypothetical protein SDJN03_14265, partial [Cucurbita argyrosperma subsp. sororia]3.2e-26079.63Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT QFPL KTLNPSSPFL STS  PFSNPLLQTLTLKSHQT KPLSIISG PN   LPI RQI QFPFA  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNS+SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE                               
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
                 RFEPQ EFFQA+MPFLDPKTE+D+LFEL++DEG+VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLS+EKKSKCLEFLLTNHPVPLLHPYT
Subjt:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF
        KEWKAKLEEEELGCDAP DD EN+  D+NV+ EWIETD+ +++ ED+ EDVV  MET EE   EDE+D  E ++ EEDEDYWDERFRKAISSPEELEKL 
Subjt:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF

Query:  KRSAEVSDELYEKQKGN-------ME-DGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        KRS E SDE YEKQKG        ME DGDETELRGKRAKVK  EWE+IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  KRSAEVSDELYEKQKGN-------ME-DGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

KAG7024792.1 hypothetical protein SDJN02_13611, partial [Cucurbita argyrosperma subsp. argyrosperma]1.9e-26079.8Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT QFPL KTLNPSSPFL STS  PFSNPLLQTLTLKSHQT KPLSIISG PN   LPI RQI QFPFA  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNS+SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE                               
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
                 RFEPQ EFFQA+MPFLDPKTE+D+LFEL++DEG+VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF
        KEWKAKLEEEELGCDAP DD EN+  D+NV+ EWIETD+ +++ ED  EDVV  MET EE   EDE+D  E ++ EEDEDYWDERFRKAISSPEELEKL 
Subjt:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF

Query:  KRSAEVSDELYEKQKGN-------ME-DGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        KRS E SDE YEKQKG        ME DGDETELRGKRAKVK  EWE+IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  KRSAEVSDELYEKQKGN-------ME-DGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_022142781.1 uncharacterized protein LOC111012814 [Momordica charantia]6.3e-26480.63Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MATL F + KTLNPSSP L      PFSNPLLQTLTLK H++HKPLSI+S  PNP +LPISRQI QFPFA I RDIRTFAGRSKKKGGG SPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRR+LRQNARRKSQK AESHFYRRKNS+SNYADNF+EDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGE+GPYSWRGVVVGEPIRGRFTDERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMD+SKGFRYFWVFVRHPRWRIS+LPWQQWTLIAEVVLEAGKE                                
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
                RFEPQ EFFQAIMPFLDPKTE+D LFEL+NDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
Subjt:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK

Query:  EWKAKLEEEELGCDAPDDIENQRGDD--NVITEWIETDEENEE--DEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK
        EWKAKLEEEELGCDAPDD E +RG D  NVI EWIETD++N+E  DEDQ +D+VME    E G+++  D K +DRS EEDEDYWDERFRKAISSPEE+EK
Subjt:  EWKAKLEEEELGCDAPDDIENQRGDD--NVITEWIETDEENEE--DEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK

Query:  LFKRSAEVSDELYEKQ------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        LFKRSAEVSDELYEKQ      K  MEDGDETE+RGKRAKV+A EWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  LFKRSAEVSDELYEKQ------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_022937202.1 uncharacterized protein LOC111443567 [Cucurbita moschata]1.9e-26079.3Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MA  QFPL KTLNPSSPFL STS  PFSNPLLQTLTLKSHQT KPLSIISG PN   LPI RQI QFPFA  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNS+SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE                               
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
                 RFEPQ EFFQA+MPFLDPKTE+D+LFEL++DEG+VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF
        KEWKAKLEEEELGCDAP DD EN+  D+NV+ EWIETD+ +++ ED+ EDVV  MET EE   EDE+D  E ++ EEDEDYWDERFRKAISSPEELEKL 
Subjt:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF

Query:  KRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        KRS E SDE YEKQKG          +DGDETELRGKRAKVK  EWE+IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  KRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

XP_038898752.1 uncharacterized protein LOC120086270 [Benincasa hispida]1.2e-26280.7Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MAT QFPLSKTLN SS FL STS +PF +PLLQTLTLKSHQTHKPLSI SG PNP +LPISRQI    FA   R+IRT AGRSKKKGGGPSPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRRKLR NARRKSQKLAESHFYRRK  +SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFS+GL RMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKE                                
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDE-GDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
                RFEPQ EFFQAIMPFLDPKTE+D LFEL++DE GDVEWVTYF GLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDE-GDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEED--EDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDED--YWDERFRKAISSPEELE
        KEWKAKLEEEELGCDAPDDIE + GD+NVITEWIETD++N ED  EDQPE+ V+ MET +E EDEDEDDKRED + EE+ED  YWDERFRKAISSPEELE
Subjt:  KEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEED--EDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDED--YWDERFRKAISSPEELE

Query:  KLFKRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
        KLFK SAEV+DE YEK+K +        MEDGDETELRGKRAKVKA EWE IGYGPWRK+IKKS+IPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI
Subjt:  KLFKRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEI

Query:  G
        G
Subjt:  G

TrEMBL top hitse value%identityAlignment
A0A1S3CKF2 uncharacterized protein LOC1035018149.5e-25077.3Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTH--KPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT QFP  KTLNPSSPFL STS  PFSNPLLQTLTLKSHQTH  KPLSI+SG  NP+      QI   P    R DIRT AGRSKK  GGPSPGRIEGN
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTH--KPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR NARRKSQKLAESHFYRRK  +SNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------
        ISEVKDHEEWE+IEQSEMA+DFS GLQRMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKE                              
Subjt:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------

Query:  ----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY
                  RFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG+VEWVTYFGGLCKIVR++PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPY
Subjt:  ----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY

Query:  TKEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEEDEDQP-EDVVME-METGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK
        TKEWKAKLEEEELGCDAPD++EN+R DDNVITEWIETD E EE EDQP ED+VME M+  ++ ED+DE ++      EEDE YWDERFRKAISSPEELEK
Subjt:  TKEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEEDEDQP-EDVVME-METGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK

Query:  LFKRSAEVSDELYEKQKGN------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        LFKRS E++DELYEK+         M+DGDE E+RGKR KVKA EWE IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  LFKRSAEVSDELYEKQKGN------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A5A7VK56 Uncharacterized protein9.5e-25077.3Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTH--KPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT QFP  KTLNPSSPFL STS  PFSNPLLQTLTLKSHQTH  KPLSI+SG  NP+      QI   P    R DIRT AGRSKK  GGPSPGRIEGN
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTH--KPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR NARRKSQKLAESHFYRRK  +SNYADNFSEDELQQIGLGYDRMVRF+EKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------
        ISEVKDHEEWE+IEQSEMA+DFS GLQRMD+SKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKE                              
Subjt:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------

Query:  ----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY
                  RFEPQ EFFQA+MPFLDPKTE+D LFEL++DEG+VEWVTYFGGLCKIVR++PKAFVDDVVNAYEKLSDEKKS CLEFLL+NHPVPLLHPY
Subjt:  ----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPY

Query:  TKEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEEDEDQP-EDVVME-METGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK
        TKEWKAKLEEEELGCDAPD++EN+R DDNVITEWIETD E EE EDQP ED+VME M+  ++ ED+DE ++      EEDE YWDERFRKAISSPEELEK
Subjt:  TKEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEEDEDQP-EDVVME-METGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK

Query:  LFKRSAEVSDELYEKQKGN------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        LFKRS E++DELYEK+         M+DGDE E+RGKR KVKA EWE IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  LFKRSAEVSDELYEKQKGN------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1CN80 uncharacterized protein LOC1110128143.1e-26480.63Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MATL F + KTLNPSSP L      PFSNPLLQTLTLK H++HKPLSI+S  PNP +LPISRQI QFPFA I RDIRTFAGRSKKKGGG SPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRR+LRQNARRKSQK AESHFYRRKNS+SNYADNF+EDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGE+GPYSWRGVVVGEPIRGRFTDERVTIIS
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMD+SKGFRYFWVFVRHPRWRIS+LPWQQWTLIAEVVLEAGKE                                
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
                RFEPQ EFFQAIMPFLDPKTE+D LFEL+NDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK
Subjt:  --------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYTK

Query:  EWKAKLEEEELGCDAPDDIENQRGDD--NVITEWIETDEENEE--DEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK
        EWKAKLEEEELGCDAPDD E +RG D  NVI EWIETD++N+E  DEDQ +D+VME    E G+++  D K +DRS EEDEDYWDERFRKAISSPEE+EK
Subjt:  EWKAKLEEEELGCDAPDDIENQRGDD--NVITEWIETDEENEE--DEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEK

Query:  LFKRSAEVSDELYEKQ------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        LFKRSAEVSDELYEKQ      K  MEDGDETE+RGKRAKV+A EWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
Subjt:  LFKRSAEVSDELYEKQ------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1FAH0 uncharacterized protein LOC1114435679.2e-26179.3Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE
        MA  QFPL KTLNPSSPFL STS  PFSNPLLQTLTLKSHQT KPLSIISG PN   LPI RQI QFPFA  R DIRTFAGRSKKKGGGPSPGRIEGNAE
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAE

Query:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS
        FRRKLR N RRKSQK AESHFYRRKNS+SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+I 
Subjt:  FRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIIS

Query:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------
        EVKDHEEWE+IEQSEMASDFSEGLQRMDRSKGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE                               
Subjt:  EVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-------------------------------

Query:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
                 RFEPQ EFFQA+MPFLDPKTE+D+LFEL++DEG+VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT
Subjt:  ---------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHPYT

Query:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF
        KEWKAKLEEEELGCDAP DD EN+  D+NV+ EWIETD+ +++ ED+ EDVV  MET EE   EDE+D  E ++ EEDEDYWDERFRKAISSPEELEKL 
Subjt:  KEWKAKLEEEELGCDAP-DDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEELEKLF

Query:  KRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV
        KRS E SDE YEKQKG          +DGDETELRGKRAKVK  EWE+IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+GEIGV
Subjt:  KRSAEVSDELYEKQKGN--------MEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV

A0A6J1INI9 uncharacterized protein LOC1114768535.6e-25878.97Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQ--TLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN
        MAT QFPL KTLNPSSPFL STS  PFSNPLLQ  TLTLKSH+T KPLSIISG PN   LPI RQI QFPFA  R DIRTFAGRSKKKGGG SPGRIEGN
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQ--TLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGN

Query:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI
        AEFRRKLR N RRKSQK AESHFYRRKNS+SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHP+DWYKYGEFGPYSWRGVV+GEPIRGRFTDERVT+
Subjt:  AEFRRKLRQNARRKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTI

Query:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------
        I EVKDHEEWE+IEQSEMASDFSEGLQRMDR+KGFR+FWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE                             
Subjt:  ISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE-----------------------------

Query:  -----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP
                   RFEPQ EFFQA+MPFLDPKTE+D+LFEL++DEG+VEWVTYFGGLCKI+RVNPKAFVDDV NAYEKLSDEKKSKCLEFLLTNHPVPLLHP
Subjt:  -----------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNHPVPLLHP

Query:  YTKEWKAKLEEEELGCDAPDDIE---NQRGDDNVITEWIETDEENEED-EDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEE
        YTKEWKAKLEEEELGCDAPDD +   N+  D+NVI EWIETD++N+ D ED+ EDVV  MET EE   EDE+D  E ++ EEDEDYWDERFRKAISSPEE
Subjt:  YTKEWKAKLEEEELGCDAPDDIE---NQRGDDNVITEWIETDEENEED-EDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDYWDERFRKAISSPEE

Query:  LEKLFKRSAEVSDELYEKQKG-NM-------EDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG
        LEKL KRS E SDE YEKQKG NM       +DGDETELRGKRAKVK  EWE+IGYGPWRK+IKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAIL+G
Subjt:  LEKLFKRSAEVSDELYEKQKG-NM-------EDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDG

Query:  EIGV
        EIGV
Subjt:  EIGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G14900.1 unknown protein6.1e-15650Show/hide
Query:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKK-GGGPSPGRIEGNA
        M +     SKTLNPS  F +S    P ++ + + +++    T +         N  +     ++        RRD+R  AGRSKKK GGG S GRIEG++
Subjt:  MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKK-GGGPSPGRIEGNA

Query:  EFRRKLRQNARRKSQKLAESHFYRRKNSS--------SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRF
        + R+++++NAR KS+KLAES FYR  N+         S++ D F+E+EL+ IGLGYDRMVRFM+KDDP LRHPYDW+KYGEFGPYSWRGVVVG+P+RG  
Subjt:  EFRRKLRQNARRKSQKLAESHFYRRKNSS--------SNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRF

Query:  TDERVTIISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE----------------------
        +DE VT+I EV++HEE+E+IEQ EM   F + ++ +D + G RYFWVFVRHP+WR+SELPW+QWTL++EVV+EA K++                      
Subjt:  TDERVTIISEVKDHEEWEEIEQSEMASDFSEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEE----------------------

Query:  ------------------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNH
                          RFEPQ +FF +++P+L+P TE   + E+E+DEG VE  TY+GGLCK+++V   AFVDDVVNAYEKLSDEKKS+ L+FLL NH
Subjt:  ------------------RFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVNAYEKLSDEKKSKCLEFLLTNH

Query:  PVPLLHPYTKEWKAKLEEEELGCDAPDDIENQ-----RGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEG-----------EDEDEDDKREDRSGE
        P  LLHPYTKEWKAKLEE ELGCDAPD+ E++       +    +EWIE + +N++D+D  +D   E+E  ++            E++  +D+ E+   E
Subjt:  PVPLLHPYTKEWKAKLEEEELGCDAPDDIENQ-----RGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEG-----------EDEDEDDKREDRSGE

Query:  EDEDYWDERFRKAISSPEELEKLFKRSAEVSDELYEKQ--------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPF
        EDE YW+E+F KA ++ E +EKL + S  VSD+ YEKQ        KG +E GDE E+RGK+AKVK  EW+ +GYG W K+IKKS+IPPELFLR+ VRPF
Subjt:  EDEDYWDERFRKAISSPEELEKLFKRSAEVSDELYEKQ--------KGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPF

Query:  TYRNLVKEIVLTRHAILDGEIG
         YRNLVKEIVLTRHAIL+GEIG
Subjt:  TYRNLVKEIVLTRHAILDGEIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCACTCTGCAATTCCCTCTTTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCAATCCACCTCCTTCGCACCATTCTCCAATCCTCTTCTTCAAACCCTAACCCT
AAAATCCCATCAAACCCACAAACCACTCTCCATTATATCTGGTTGCCCAAATCCTTTTTATCTTCCGATCTCCCGCCAAATTCCACAATTCCCATTCGCAAAAATCCGCC
GGGATATCCGGACGTTCGCCGGCCGGAGCAAGAAGAAGGGCGGCGGGCCATCACCCGGGCGCATAGAAGGAAACGCCGAGTTCCGGCGGAAGTTGAGGCAAAATGCCCGC
CGGAAGAGCCAGAAGCTCGCGGAGTCTCATTTCTACCGCCGCAAGAACTCGAGCAGTAATTACGCTGATAACTTCAGCGAAGATGAGCTTCAGCAGATCGGCCTCGGCTA
CGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAATCTGCGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGCGTGGTGTCGTCGTCG
GCGAGCCGATTCGCGGGCGGTTCACGGACGAGCGAGTTACGATAATCAGCGAGGTTAAGGATCACGAGGAATGGGAGGAGATCGAGCAATCAGAAATGGCGTCTGATTTC
AGCGAGGGATTACAGCGGATGGACAGAAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGGCACCCGAGGTGGAGGATTTCGGAGCTGCCATGGCAGCAGTGGACTTT
GATTGCAGAGGTTGTGCTTGAAGCAGGTAAAGAAGAAAGATTCGAGCCTCAGGGCGAGTTTTTCCAGGCGATAATGCCATTTCTCGATCCGAAAACAGAGAAAGATCTCC
TGTTTGAGTTGGAGAATGATGAAGGAGATGTTGAATGGGTGACTTATTTTGGTGGGCTGTGTAAGATTGTGAGGGTGAATCCAAAGGCATTTGTGGATGATGTGGTGAAT
GCTTATGAGAAGCTGAGTGATGAGAAGAAATCCAAGTGCTTGGAGTTTCTTCTGACCAACCACCCTGTTCCATTGCTGCATCCATATACCAAAGAGTGGAAGGCCAAGTT
AGAGGAGGAGGAATTGGGCTGTGATGCTCCGGACGACATCGAGAATCAACGCGGTGACGACAATGTGATCACGGAGTGGATTGAGACTGATGAAGAGAATGAAGAGGATG
AGGATCAGCCTGAGGATGTGGTGATGGAGATGGAGACAGGGGAAGAAGGCGAAGACGAGGACGAGGACGACAAACGAGAGGATCGGAGTGGAGAAGAAGATGAGGATTAT
TGGGATGAGAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAGCTGGAGAAGCTGTTCAAGCGCAGTGCAGAAGTGAGTGATGAATTGTATGAGAAACAGAAGGGGAATAT
GGAAGATGGGGATGAGACTGAATTGAGAGGGAAGAGAGCAAAAGTGAAAGCTTCAGAATGGGAGCAAATTGGGTATGGGCCATGGAGGAAGAGGATAAAGAAAAGTCAAA
TTCCTCCAGAGCTGTTTTTGAGATCTACAGTCAGACCATTTACTTATAGGAACCTTGTGAAGGAGATTGTATTGACCAGACATGCTATTTTGGATGGTGAAATTGGGGTA
TGA
mRNA sequenceShow/hide mRNA sequence
CCAAAAACTGTCCAAATGGCCACTCTGCAATTCCCTCTTTCTAAAACCCTAAACCCTTCATCTCCATTTCTCCAATCCACCTCCTTCGCACCATTCTCCAATCCTCTTCT
TCAAACCCTAACCCTAAAATCCCATCAAACCCACAAACCACTCTCCATTATATCTGGTTGCCCAAATCCTTTTTATCTTCCGATCTCCCGCCAAATTCCACAATTCCCAT
TCGCAAAAATCCGCCGGGATATCCGGACGTTCGCCGGCCGGAGCAAGAAGAAGGGCGGCGGGCCATCACCCGGGCGCATAGAAGGAAACGCCGAGTTCCGGCGGAAGTTG
AGGCAAAATGCCCGCCGGAAGAGCCAGAAGCTCGCGGAGTCTCATTTCTACCGCCGCAAGAACTCGAGCAGTAATTACGCTGATAACTTCAGCGAAGATGAGCTTCAGCA
GATCGGCCTCGGCTACGATCGGATGGTCCGATTCATGGAGAAAGACGACCCGAATCTGCGCCATCCCTACGACTGGTACAAGTACGGCGAGTTCGGCCCGTACTCGTGGC
GTGGTGTCGTCGTCGGCGAGCCGATTCGCGGGCGGTTCACGGACGAGCGAGTTACGATAATCAGCGAGGTTAAGGATCACGAGGAATGGGAGGAGATCGAGCAATCAGAA
ATGGCGTCTGATTTCAGCGAGGGATTACAGCGGATGGACAGAAGCAAAGGGTTTCGGTACTTTTGGGTGTTCGTGAGGCACCCGAGGTGGAGGATTTCGGAGCTGCCATG
GCAGCAGTGGACTTTGATTGCAGAGGTTGTGCTTGAAGCAGGTAAAGAAGAAAGATTCGAGCCTCAGGGCGAGTTTTTCCAGGCGATAATGCCATTTCTCGATCCGAAAA
CAGAGAAAGATCTCCTGTTTGAGTTGGAGAATGATGAAGGAGATGTTGAATGGGTGACTTATTTTGGTGGGCTGTGTAAGATTGTGAGGGTGAATCCAAAGGCATTTGTG
GATGATGTGGTGAATGCTTATGAGAAGCTGAGTGATGAGAAGAAATCCAAGTGCTTGGAGTTTCTTCTGACCAACCACCCTGTTCCATTGCTGCATCCATATACCAAAGA
GTGGAAGGCCAAGTTAGAGGAGGAGGAATTGGGCTGTGATGCTCCGGACGACATCGAGAATCAACGCGGTGACGACAATGTGATCACGGAGTGGATTGAGACTGATGAAG
AGAATGAAGAGGATGAGGATCAGCCTGAGGATGTGGTGATGGAGATGGAGACAGGGGAAGAAGGCGAAGACGAGGACGAGGACGACAAACGAGAGGATCGGAGTGGAGAA
GAAGATGAGGATTATTGGGATGAGAGGTTCAGGAAGGCAATAAGTAGTCCAGAAGAGCTGGAGAAGCTGTTCAAGCGCAGTGCAGAAGTGAGTGATGAATTGTATGAGAA
ACAGAAGGGGAATATGGAAGATGGGGATGAGACTGAATTGAGAGGGAAGAGAGCAAAAGTGAAAGCTTCAGAATGGGAGCAAATTGGGTATGGGCCATGGAGGAAGAGGA
TAAAGAAAAGTCAAATTCCTCCAGAGCTGTTTTTGAGATCTACAGTCAGACCATTTACTTATAGGAACCTTGTGAAGGAGATTGTATTGACCAGACATGCTATTTTGGAT
GGTGAAATTGGGGTATGAAATTATCTAATCTCATCATCAGTCTGCTTCTGGTTAGATGGATTTTTATTAAAAAAAGAATAATTCTACTTTTTGAACAAAAAA
Protein sequenceShow/hide protein sequence
MATLQFPLSKTLNPSSPFLQSTSFAPFSNPLLQTLTLKSHQTHKPLSIISGCPNPFYLPISRQIPQFPFAKIRRDIRTFAGRSKKKGGGPSPGRIEGNAEFRRKLRQNAR
RKSQKLAESHFYRRKNSSSNYADNFSEDELQQIGLGYDRMVRFMEKDDPNLRHPYDWYKYGEFGPYSWRGVVVGEPIRGRFTDERVTIISEVKDHEEWEEIEQSEMASDF
SEGLQRMDRSKGFRYFWVFVRHPRWRISELPWQQWTLIAEVVLEAGKEERFEPQGEFFQAIMPFLDPKTEKDLLFELENDEGDVEWVTYFGGLCKIVRVNPKAFVDDVVN
AYEKLSDEKKSKCLEFLLTNHPVPLLHPYTKEWKAKLEEEELGCDAPDDIENQRGDDNVITEWIETDEENEEDEDQPEDVVMEMETGEEGEDEDEDDKREDRSGEEDEDY
WDERFRKAISSPEELEKLFKRSAEVSDELYEKQKGNMEDGDETELRGKRAKVKASEWEQIGYGPWRKRIKKSQIPPELFLRSTVRPFTYRNLVKEIVLTRHAILDGEIGV