; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lsi04G006000 (gene) of Bottle gourd (USVL1VR-Ls) v1 genome

Gene IDLsi04G006000
OrganismLagenaria siceraria USVL1VR-Ls (Bottle gourd (USVL1VR-Ls) v1)
DescriptionProtein of unknown function (DUF707)
Genome locationchr04:5538652..5544070
RNA-Seq ExpressionLsi04G006000
SyntenyLsi04G006000
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008446151.1 PREDICTED: uncharacterized protein LOC103488962 [Cucumis melo]4.1e-22593.8Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP
        MGILYRSSVNRKPNDSMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTD+K+GSS         TR GSGD N+SQN NGTSEIWVP
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP

Query:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
        SNPRGAERLAPGIVAAESD YLHRLWGNPNEDL TKPNYLVTFTVGY QRENID+AVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
Subjt:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY

Query:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVER GWCTEPNLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR

Query:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKS+GID SNS
Subjt:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

Query:  AKQ
         KQ
Subjt:  AKQ

XP_011655601.1 uncharacterized protein LOC101222926 [Cucumis sativus]7.8e-22493.05Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP
        MGILYRSSVNRKPNDSMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKID+MYNTD+K+GSS         T  GSGD NNS N NGTSEIWVP
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP

Query:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
        SNPRGAERLAPGIVAAESD YLHRLWGNP+EDL TKPNYLVTFTVGY Q+ENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
Subjt:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY

Query:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDT ERPGWCTEPNLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR

Query:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVD+QWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGID SNS
Subjt:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

Query:  AKQ
         KQ
Subjt:  AKQ

XP_022151879.1 uncharacterized protein LOC111019745 [Momordica charantia]6.2e-22191.58Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSG----------SSYTRNGSGDSNNSQNLNGTSEIW
        MGILYRSSVNRKPNDSMRLIIITFVGV +GFLIGISFPTLSLTKLGIPSGLIP ID +YNTDLKSG           SYTR+G+GD NNS+NLNGTS+IW
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSG----------SSYTRNGSGDSNNSQNLNGTSEIW

Query:  VPSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKW
        VPSNPRGAE LAPGIVAAESD YLHRLWGNPNEDLT KPNYLVTFTVGYNQR+NIDKAVKKFSENFTILLFHYDGRTTEWD+FEWSKRAIHVSARKQSKW
Subjt:  VPSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKW

Query:  WYAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF
        WYAKRFLHPDIVAPYDYIFMWDEDLGVE+FDAEEYIKLVRKHGLEISQPGLEP RGLTWQMTKKRD LEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF
Subjt:  WYAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF

Query:  SREAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSS
        SR+AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSR+ANAE AYFKSLGID S
Subjt:  SREAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSS

Query:  NSAK
        NSAK
Subjt:  NSAK

XP_023516217.1 uncharacterized protein LOC111780139 [Cucurbita pepo subsp. pepo]7.6e-21991.5Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS------YTRNGSGDSNNSQNLNGTSEIWVPSN
        MGILYRSSVNRKPNDSMRLII+TFVGVFLGF IGISFPTLSLTKLG+PSGLIPKID+MYN DLKS SS      +  +GS D  NSQNLNGTSEIWVPSN
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS------YTRNGSGDSNNSQNLNGTSEIWVPSN

Query:  PRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAK
        PRGAERLAPGIVAAESD YL RLWGNP+EDLT KPNYLVTFTVGYNQR+NIDKAVKKFSENFTILLFHYDGRTTEWD+FEWSKRAIHVSARKQSKWWYAK
Subjt:  PRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAK

Query:  RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREA
        RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLV+KHGLEISQPGLEPTRGLTWQMTKKRD LEVHKDT ER GWCTEPNLPPCAAFVEIMAPVFSR+A
Subjt:  RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREA

Query:  WRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSAK
        WRCVWYMIQNDLIHGWGLDF++RKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGID SNSAK
Subjt:  WRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSAK

XP_038891433.1 uncharacterized protein LOC120080853 [Benincasa hispida]2.1e-22995.54Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGS---------SYTRNGSGDSNNSQNLNGTSEIWV
        MGILYRSSVNRKPN+SMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQ+YNTDLKSGS         S TRNGSGDSNNS+NLNGTSEIWV
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGS---------SYTRNGSGDSNNSQNLNGTSEIWV

Query:  PSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWW
        PSNPRGAERLAPGIVAAESD YLHRLWGNPNEDLTTKPNYLVTFTVGY QRENIDKAVKKFSENFTILLFHYDG+TTEWDEFEWSKRAIHVSARKQSKWW
Subjt:  PSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWW

Query:  YAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFS
        YAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFS
Subjt:  YAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFS

Query:  REAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSN
        REAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSN
Subjt:  REAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSN

Query:  SAKQ
        SAKQ
Subjt:  SAKQ

TrEMBL top hitse value%identityAlignment
A0A0A0KVG3 Uncharacterized protein3.8e-22493.05Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP
        MGILYRSSVNRKPNDSMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKID+MYNTD+K+GSS         T  GSGD NNS N NGTSEIWVP
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP

Query:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
        SNPRGAERLAPGIVAAESD YLHRLWGNP+EDL TKPNYLVTFTVGY Q+ENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
Subjt:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY

Query:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDT ERPGWCTEPNLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR

Query:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVD+QWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGID SNS
Subjt:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

Query:  AKQ
         KQ
Subjt:  AKQ

A0A1S4DWR8 uncharacterized protein LOC1034889622.0e-22593.8Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP
        MGILYRSSVNRKPNDSMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTD+K+GSS         TR GSGD N+SQN NGTSEIWVP
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP

Query:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
        SNPRGAERLAPGIVAAESD YLHRLWGNPNEDL TKPNYLVTFTVGY QRENID+AVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
Subjt:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY

Query:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVER GWCTEPNLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR

Query:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKS+GID SNS
Subjt:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

Query:  AKQ
         KQ
Subjt:  AKQ

A0A5D3CXI6 Uncharacterized protein2.0e-22593.8Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP
        MGILYRSSVNRKPNDSMRLIIITF+GVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTD+K+GSS         TR GSGD N+SQN NGTSEIWVP
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS--------YTRNGSGDSNNSQNLNGTSEIWVP

Query:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
        SNPRGAERLAPGIVAAESD YLHRLWGNPNEDL TKPNYLVTFTVGY QRENID+AVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY
Subjt:  SNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWY

Query:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR
        AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVER GWCTEPNLPPCAAFVEIMAPVFSR
Subjt:  AKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSR

Query:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKS+GID SNS
Subjt:  EAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

Query:  AKQ
         KQ
Subjt:  AKQ

A0A6J1DFZ5 uncharacterized protein LOC1110197453.0e-22191.58Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSG----------SSYTRNGSGDSNNSQNLNGTSEIW
        MGILYRSSVNRKPNDSMRLIIITFVGV +GFLIGISFPTLSLTKLGIPSGLIP ID +YNTDLKSG           SYTR+G+GD NNS+NLNGTS+IW
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSG----------SSYTRNGSGDSNNSQNLNGTSEIW

Query:  VPSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKW
        VPSNPRGAE LAPGIVAAESD YLHRLWGNPNEDLT KPNYLVTFTVGYNQR+NIDKAVKKFSENFTILLFHYDGRTTEWD+FEWSKRAIHVSARKQSKW
Subjt:  VPSNPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKW

Query:  WYAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF
        WYAKRFLHPDIVAPYDYIFMWDEDLGVE+FDAEEYIKLVRKHGLEISQPGLEP RGLTWQMTKKRD LEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF
Subjt:  WYAKRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVF

Query:  SREAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSS
        SR+AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSR+ANAE AYFKSLGID S
Subjt:  SREAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSS

Query:  NSAK
        NSAK
Subjt:  NSAK

A0A6J1JPZ6 uncharacterized protein LOC1114866711.8e-21891.25Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS------YTRNGSGDSNNSQNLNGTSEIWVPSN
        MGILYRSSVNRKPNDSMRLII+TFVGVFLGF IGISFPTLSLTKLG+PSGLIPKID+MYN DLKS SS      +  +GS D  NSQNLNGTSEIWVPSN
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS------YTRNGSGDSNNSQNLNGTSEIWVPSN

Query:  PRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAK
        PRGAERLAPGIVAAESD YL RLWGNP+EDLT KPNYLVTFTVGYNQR+NIDKAVKKFSENFTILLFHYDGRTTEWD+FEWSKRAIHVSARKQSKWWYAK
Subjt:  PRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAK

Query:  RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREA
        RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLV+KHGLEISQPGLEPTRGLTWQMTKKRD LEVHKDT ER GWCTEPNLPPCAAFVEIMAPVFSR+A
Subjt:  RFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREA

Query:  WRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSAK
        WRCVWYMIQNDLIHGWGLDF++RKCVEPAHEKIGVVDAQWIVHQ LPSLGSQGET+NGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGID SNSAK
Subjt:  WRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSAK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13000.1 Protein of unknown function (DUF707)1.8e-16266.58Show/hide
Query:  RSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS-----YTRNGSGDSNNSQNLNGTS-EIWVPSNPRGAE
        R S  +KPND MRLII TFVG+ +GF +GISFPTLSLTKL  PSG++P +D  Y  D    +S     +T +  G  + + + +    +IWVPSNPRGAE
Subjt:  RSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSS-----YTRNGSGDSNNSQNLNGTS-EIWVPSNPRGAE

Query:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP
         L PGI+A ESD YL RLWG P ED+  KP YL+ FTVG+ Q+ N+D  VKKFS++FTI+LFHYDGRTTEW+E EWSKRAIHVS  KQ+KWWYAKRFLHP
Subjt:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP

Query:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW
        DIVAPYDY+F+WDEDLG+ENFD EEYI+L++KHGLEISQP +E  + +TW++TK++   EVHKD  E+PG C +P+LPPCAAF+EIMAPVFSR+AWRCVW
Subjt:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW

Query:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSL
        +MIQNDL+HGWGLDFA+RKCVEPAHEKIGVVD+QWI+HQ LPSLGSQGE ++GKA WQGVR+RC++EWTMFQSR+A++EK Y K +
Subjt:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSL

AT1G67850.1 Protein of unknown function (DUF707)7.4e-17268.75Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDL-------KSGSSYTRNGSGDSNNSQNLNGTSEIWVPS
        MGI  RSS++RK  D M++I   F GV  GFLIGISFP+LS+TK+ +P+  +P  D  Y  +        +S  +++ +   DS++S  ++  S+IWVPS
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDL-------KSGSSYTRNGSGDSNNSQNLNGTSEIWVPS

Query:  NPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYA
        NPRGAE L PG+VAAESD YL RLWG P+EDL ++P YL TFTVG NQ+ NID  VKKFSENFTI+LFHYDGR TEWDEFEWSK AIH+S RKQ+KWWYA
Subjt:  NPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYA

Query:  KRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSRE
        KRFLHPDIVA YDYIF+WDEDLGVE+F+AEEY+K+V+KHGLEISQPGLEP +GLTWQMTK+R  +EVHK T ERPGWC++P+LPPCAAFVEIMAPVFSR 
Subjt:  KRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSRE

Query:  AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSA
        AWRCVW++IQNDL+HGWGLDFA+R+CVEPAHEKIGVVD+QW+VHQ  PSLG+QGE  +GKAPWQGVR+RC+KEWTMFQSR+ANAEK YFKSL ++ S+++
Subjt:  AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSA

AT1G67850.2 Protein of unknown function (DUF707)7.4e-17268.75Show/hide
Query:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDL-------KSGSSYTRNGSGDSNNSQNLNGTSEIWVPS
        MGI  RSS++RK  D M++I   F GV  GFLIGISFP+LS+TK+ +P+  +P  D  Y  +        +S  +++ +   DS++S  ++  S+IWVPS
Subjt:  MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDL-------KSGSSYTRNGSGDSNNSQNLNGTSEIWVPS

Query:  NPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYA
        NPRGAE L PG+VAAESD YL RLWG P+EDL ++P YL TFTVG NQ+ NID  VKKFSENFTI+LFHYDGR TEWDEFEWSK AIH+S RKQ+KWWYA
Subjt:  NPRGAERLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYA

Query:  KRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSRE
        KRFLHPDIVA YDYIF+WDEDLGVE+F+AEEY+K+V+KHGLEISQPGLEP +GLTWQMTK+R  +EVHK T ERPGWC++P+LPPCAAFVEIMAPVFSR 
Subjt:  KRFLHPDIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSRE

Query:  AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSA
        AWRCVW++IQNDL+HGWGLDFA+R+CVEPAHEKIGVVD+QW+VHQ  PSLG+QGE  +GKAPWQGVR+RC+KEWTMFQSR+ANAEK YFKSL ++ S+++
Subjt:  AWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSA

AT3G27470.1 Protein of unknown function (DUF707)2.3e-16568.7Show/hide
Query:  RKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSSYTR-NGSGDSNNSQNLN----------GTSEIWVPSNPRGAE
        R+P+  MRL++ +F GV +GFL+GI+FPTL+LTK+ +PS L P ID  Y  D  S  S  R  GS  S     L             ++IWV +NPRGAE
Subjt:  RKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSSYTR-NGSGDSNNSQNLN----------GTSEIWVPSNPRGAE

Query:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP
        RL P IV  ESD YL RLWG+PNEDLT K  YLVTFTVGY+QR+NID  +KKFS+NF+I+LFHYDGR +EW+EFEWSKRAIHVS RKQ+KWWYAKRFLHP
Subjt:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP

Query:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW
        DIVAPY+YIF+WDEDLGVE+FD+E+Y+ +V+KHGLEISQPGLEP  GLTW+MTKKRD  EVHK   ER GWCT+PNLPPCAAFVEIMAPVFSR+AWRCVW
Subjt:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW

Query:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        +MIQNDLIHGWGLDFAVRKCV+ AHEKIGVVDAQWI+HQG+PSLG+QG+ E GK PW+GVRERCR+EWTMFQ RL +AEKAYF++    +++S
Subjt:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS

AT3G27470.2 Protein of unknown function (DUF707)2.3e-16568.7Show/hide
Query:  RKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSSYTR-NGSGDSNNSQNLN----------GTSEIWVPSNPRGAE
        R+P+  MRL++ +F GV +GFL+GI+FPTL+LTK+ +PS L P ID  Y  D  S  S  R  GS  S     L             ++IWV +NPRGAE
Subjt:  RKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSSYTR-NGSGDSNNSQNLN----------GTSEIWVPSNPRGAE

Query:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP
        RL P IV  ESD YL RLWG+PNEDLT K  YLVTFTVGY+QR+NID  +KKFS+NF+I+LFHYDGR +EW+EFEWSKRAIHVS RKQ+KWWYAKRFLHP
Subjt:  RLAPGIVAAESDLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHP

Query:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW
        DIVAPY+YIF+WDEDLGVE+FD+E+Y+ +V+KHGLEISQPGLEP  GLTW+MTKKRD  EVHK   ER GWCT+PNLPPCAAFVEIMAPVFSR+AWRCVW
Subjt:  DIVAPYDYIFMWDEDLGVENFDAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVW

Query:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS
        +MIQNDLIHGWGLDFAVRKCV+ AHEKIGVVDAQWI+HQG+PSLG+QG+ E GK PW+GVRERCR+EWTMFQ RL +AEKAYF++    +++S
Subjt:  YMIQNDLIHGWGLDFAVRKCVEPAHEKIGVVDAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAATCCTTTATCGCAGTTCTGTGAATAGAAAGCCAAACGACAGCATGAGGCTTATTATAATAACTTTTGTGGGAGTATTTTTGGGCTTTTTGATAGGAATATCTTT
CCCAACATTATCACTAACAAAGCTAGGTATCCCCTCTGGCCTTATCCCGAAAATTGACCAGATGTATAATACTGACCTAAAATCAGGCTCATCTTATACAAGGAATGGTA
GTGGAGACTCGAATAATTCTCAAAATCTGAATGGTACCTCAGAGATTTGGGTTCCATCAAATCCTAGGGGAGCAGAAAGACTAGCCCCAGGAATTGTTGCAGCTGAGTCA
GACTTATATCTGCACAGATTGTGGGGTAATCCCAATGAGGACTTAACTACGAAACCAAATTACCTTGTAACCTTTACTGTCGGTTATAATCAGAGAGAGAACATTGATAA
AGCAGTAAAAAAGTTTTCAGAGAACTTCACAATTCTTCTGTTTCATTATGATGGCCGAACGACAGAATGGGATGAATTTGAATGGTCGAAACGAGCAATTCACGTGAGTG
CTCGAAAGCAGAGTAAATGGTGGTATGCCAAGCGATTTTTGCACCCTGACATTGTGGCACCTTATGACTACATCTTTATGTGGGATGAAGACCTGGGTGTTGAGAACTTC
GATGCTGAAGAATATATAAAGTTGGTGAGAAAGCATGGCTTGGAAATTTCACAGCCTGGTTTGGAACCTACCAGAGGGTTAACATGGCAGATGACAAAGAAAAGGGATGG
TCTTGAAGTTCATAAAGATACGGTGGAAAGGCCAGGATGGTGCACTGAGCCGAATTTGCCTCCCTGTGCCGCTTTTGTAGAGATCATGGCTCCAGTTTTCTCACGAGAAG
CTTGGCGTTGTGTCTGGTATATGATTCAAAACGATTTAATCCATGGTTGGGGTCTTGATTTTGCCGTTAGAAAATGTGTAGAGCCTGCACATGAAAAGATTGGAGTCGTA
GACGCCCAATGGATCGTTCATCAAGGTCTTCCCTCGCTCGGGAGCCAGGGAGAAACTGAAAATGGGAAAGCGCCATGGCAAGGGGTGAGAGAAAGATGTCGGAAGGAGTG
GACAATGTTTCAAAGTCGGTTAGCAAATGCGGAGAAAGCGTATTTCAAGTCATTGGGAATTGATTCTTCAAATTCAGCCAAACAGTAG
mRNA sequenceShow/hide mRNA sequence
AAGAACCAAAAACCCCGTTTCTGTGCCGCTACAGTAACAGCAGAACCTTAATTAGAAACGCCCTCAAGGTTTAACATGAAAATAAATATGAAAACTTCCAATCCAATTTG
TTGAGGCCGATTCCTTCGTCTTCCTCCTCTACTGAAATTTTTCTCTGTTGCTTTTGTTCATCTCCAACTTTTTTTCACACCGTAAGGCCAGGCGATCTCAGAAGAAGGAT
TTCTTTTCACTTTTGGAACTGGTTTTCCTCGTGATTCAGAGGTCGCGGAACAGATCTTGTCATATTTGATTCTTTATATCTGAAAACGTAGACTGGTGAACGTTTTCTAG
TAATCAATGTTCTAAGCACCTGCGACTGATTGCTGTTCAAGTTGATTCTTCATTTTTAAATCTTCAAGTTACTTCAACTTTAAGACCATTCTCACTTTGTACATCTGTGA
CTGAGATCATCTTTACAACTTGGAGGATTGTAGATAGCATGGGAATCCTTTATCGCAGTTCTGTGAATAGAAAGCCAAACGACAGCATGAGGCTTATTATAATAACTTTT
GTGGGAGTATTTTTGGGCTTTTTGATAGGAATATCTTTCCCAACATTATCACTAACAAAGCTAGGTATCCCCTCTGGCCTTATCCCGAAAATTGACCAGATGTATAATAC
TGACCTAAAATCAGGCTCATCTTATACAAGGAATGGTAGTGGAGACTCGAATAATTCTCAAAATCTGAATGGTACCTCAGAGATTTGGGTTCCATCAAATCCTAGGGGAG
CAGAAAGACTAGCCCCAGGAATTGTTGCAGCTGAGTCAGACTTATATCTGCACAGATTGTGGGGTAATCCCAATGAGGACTTAACTACGAAACCAAATTACCTTGTAACC
TTTACTGTCGGTTATAATCAGAGAGAGAACATTGATAAAGCAGTAAAAAAGTTTTCAGAGAACTTCACAATTCTTCTGTTTCATTATGATGGCCGAACGACAGAATGGGA
TGAATTTGAATGGTCGAAACGAGCAATTCACGTGAGTGCTCGAAAGCAGAGTAAATGGTGGTATGCCAAGCGATTTTTGCACCCTGACATTGTGGCACCTTATGACTACA
TCTTTATGTGGGATGAAGACCTGGGTGTTGAGAACTTCGATGCTGAAGAATATATAAAGTTGGTGAGAAAGCATGGCTTGGAAATTTCACAGCCTGGTTTGGAACCTACC
AGAGGGTTAACATGGCAGATGACAAAGAAAAGGGATGGTCTTGAAGTTCATAAAGATACGGTGGAAAGGCCAGGATGGTGCACTGAGCCGAATTTGCCTCCCTGTGCCGC
TTTTGTAGAGATCATGGCTCCAGTTTTCTCACGAGAAGCTTGGCGTTGTGTCTGGTATATGATTCAAAACGATTTAATCCATGGTTGGGGTCTTGATTTTGCCGTTAGAA
AATGTGTAGAGCCTGCACATGAAAAGATTGGAGTCGTAGACGCCCAATGGATCGTTCATCAAGGTCTTCCCTCGCTCGGGAGCCAGGGAGAAACTGAAAATGGGAAAGCG
CCATGGCAAGGGGTGAGAGAAAGATGTCGGAAGGAGTGGACAATGTTTCAAAGTCGGTTAGCAAATGCGGAGAAAGCGTATTTCAAGTCATTGGGAATTGATTCTTCAAA
TTCAGCCAAACAGTAGGACCCAAATCACAGATTCTATATACATCCATCCATTATTAGCCTTAATTCCCTTACCATTTCTCACTGAGTTGTGTAAATTCGTTAAACATTAT
AATAATAATTACGTTTAGAATTTATTTCATTTTCACGAATGAACTTTTAGCTCAAACGTTTTGCGATTTTGAATCACTTTTAGGCTGGAAAAACGAGTCAACTCGAGACA
TTCTTAAAGATGAAAATTGGTATATATGCAATTTGGAACAAAGAAAGGGCAGTTGTCAAAGATGGAAGATCCATAAACAGAAAGAAGCTTGCTTGTAATGGTATATATTC
ATATTCAAGAGTATGCTACAAGACGATCACACAGCTGTTGCTCTTTTGTCCATTGCCAGAATGACAATTAAGGGGATAAGTATTTATAGGATAAATACAACATGTCCTAT
ACTTACATATAGGTTCATGAAATTAGTACTTACAGGATAAAGACAACATGTCCTATACTTACATATAGGTTCATAAAATTATTTTACTTTATTATTGGGCAAAAACAATG
ATGAAAAACATTCTATTTACTAAAAAGAAAAAGGAAAAGAATTACTTTTTGTCCTAAA
Protein sequenceShow/hide protein sequence
MGILYRSSVNRKPNDSMRLIIITFVGVFLGFLIGISFPTLSLTKLGIPSGLIPKIDQMYNTDLKSGSSYTRNGSGDSNNSQNLNGTSEIWVPSNPRGAERLAPGIVAAES
DLYLHRLWGNPNEDLTTKPNYLVTFTVGYNQRENIDKAVKKFSENFTILLFHYDGRTTEWDEFEWSKRAIHVSARKQSKWWYAKRFLHPDIVAPYDYIFMWDEDLGVENF
DAEEYIKLVRKHGLEISQPGLEPTRGLTWQMTKKRDGLEVHKDTVERPGWCTEPNLPPCAAFVEIMAPVFSREAWRCVWYMIQNDLIHGWGLDFAVRKCVEPAHEKIGVV
DAQWIVHQGLPSLGSQGETENGKAPWQGVRERCRKEWTMFQSRLANAEKAYFKSLGIDSSNSAKQ