; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G019400 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G019400
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionProtein of unknown function (DUF707)
Genome locationCG_Chr09:36616131..36620596
RNA-Seq ExpressionClCG09G019400
SyntenyClCG09G019400
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007877 - Protein of unknown function DUF707


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142781.1 uncharacterized protein LOC101219968 [Cucumis sativus]1.3e-23295.06Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRG S QFSLNETKIW
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW

Query:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
        VPTNPRGAERLPPGIVE ESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
Subjt:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW

Query:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
        WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
Subjt:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF

Query:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP
        SRDAWRCVWHLIQNDLVHGWGLDFALRKCV+PAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCR+EWEIFRSRLADAEKAYY G+GIDPP
Subjt:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP

Query:  NSTQV
        NST+V
Subjt:  NSTQV

XP_008458831.1 PREDICTED: uncharacterized protein LOC103498119 [Cucumis melo]2.3e-23495.56Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW

Query:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
        VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVG DQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
Subjt:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW

Query:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
        WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRG+SEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
Subjt:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF

Query:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP
        SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCR+EWEIFRSRLADAEKAYY+G+GIDPP
Subjt:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP

Query:  NSTQV
        NST+V
Subjt:  NSTQV

XP_022993333.1 uncharacterized protein LOC111489373 [Cucurbita maxima]6.0e-23094.31Show/hide
Query:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV
        + GNMRK NEIMRI+VTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSL+E KIWV
Subjt:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
        PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSV KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVASYDYIFVWDEDLGVEHF+AE+YIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN
        RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGR PWEGVRERCR+EW IF+SRLADAEKAYYQGMGIDPPN
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN

Query:  STQV
        ST V
Subjt:  STQV

XP_023550636.1 uncharacterized protein LOC111808719 [Cucurbita pepo subsp. pepo]7.8e-23094.31Show/hide
Query:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV
        + GNMRK NEIMRI+VTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSL FSL+E KIWV
Subjt:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
        PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSV KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVASYDYIFVWDEDLGVEHF+AE+YIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN
        RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGR PWEGVRERCR+EW IF+SRLADAEKAYYQGMGIDPPN
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN

Query:  STQV
        STQV
Subjt:  STQV

XP_038891060.1 uncharacterized protein LOC120080472 [Benincasa hispida]8.9e-23495.56Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLS EAFLNAWSSLKGNRGSSLQFSLNETKIW
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW

Query:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
        VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTV+LFHYDGRASEWEDLEWS RAIHVSVYKQTKW
Subjt:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW

Query:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
        WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRG+SEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
Subjt:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF

Query:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP
        SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCR+EWEIFRSRLADAEKAYY+G+GIDPP
Subjt:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP

Query:  NSTQV
        NST+V
Subjt:  NSTQV

TrEMBL top hitse value%identityAlignment
A0A1S3C8W3 uncharacterized protein LOC1034981191.1e-23495.56Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIW

Query:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
        VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVG DQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSVYKQTKW
Subjt:  VPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKW

Query:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
        WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRG+SEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF
Subjt:  WYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVF

Query:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP
        SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCR+EWEIFRSRLADAEKAYY+G+GIDPP
Subjt:  SRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPP

Query:  NSTQV
        NST+V
Subjt:  NSTQV

A0A6J1BZ79 uncharacterized protein LOC111006583 isoform X17.9e-22892.63Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG-NRGSSLQFSLNETKI
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPT+SLSQ       LNFPSSLIPSIDLTYIEDKYSGLS EAFLNAWSSLKG NRGSS+QF  NETKI
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG-NRGSSLQFSLNETKI

Query:  WVPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTK
        WVPTNPRGAERLPPGIVESESDFNLRRLWG+P+EDL IKPKYLVTFTVGF+QKKNIDAAVKKFSENFT+VLFHYDGRASEWEDLEWSKRAIHVSVYKQTK
Subjt:  WVPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTK

Query:  WWYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATV
        WWYAKRFLHPDIVASYDY+FVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRG+SEVHKETEEKPGWCTDPHLPPCAAFVEIMATV
Subjt:  WWYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATV

Query:  FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVH-PAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID
        FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVH PAHEKIGVVD+QWIVHQSVPSLGNQGKAENGRAPWEGVRERC+REWEIF+SRLADAE AYY GMGID
Subjt:  FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVH-PAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID

Query:  PPNSTQV
        PPNST+V
Subjt:  PPNSTQV

A0A6J1C1L2 uncharacterized protein LOC111006583 isoform X23.2e-22992.86Show/hide
Query:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG-NRGSSLQFSLNETKI
        ++ GNMRKPNEIMRILVTTFVGGVFGFFLGVSFPT+SLSQ       LNFPSSLIPSIDLTYIEDKYSGLS EAFLNAWSSLKG NRGSS+QF  NETKI
Subjt:  VKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG-NRGSSLQFSLNETKI

Query:  WVPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTK
        WVPTNPRGAERLPPGIVESESDFNLRRLWG+P+EDL IKPKYLVTFTVGF+QKKNIDAAVKKFSENFT+VLFHYDGRASEWEDLEWSKRAIHVSVYKQTK
Subjt:  WVPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTK

Query:  WWYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATV
        WWYAKRFLHPDIVASYDY+FVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRG+SEVHKETEEKPGWCTDPHLPPCAAFVEIMATV
Subjt:  WWYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATV

Query:  FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDP
        FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVD+QWIVHQSVPSLGNQGKAENGRAPWEGVRERC+REWEIF+SRLADAE AYY GMGIDP
Subjt:  FSRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDP

Query:  PNSTQV
        PNST+V
Subjt:  PNSTQV

A0A6J1FF00 uncharacterized protein LOC1114448781.4e-22994.06Show/hide
Query:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV
        + GNMRK NEIMRI+VTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSL FSL+E KIWV
Subjt:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
        PTNPRGAERLPPGIVE ESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSV KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVASYDYIFVWDEDLGVEHF+AE+YIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN
        RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGR PWEGVRERCR+EW IF+SRLADAEKAYYQGMGIDPPN
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN

Query:  STQV
        STQV
Subjt:  STQV

A0A6J1K1X0 uncharacterized protein LOC1114893732.9e-23094.31Show/hide
Query:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV
        + GNMRK NEIMRI+VTTFVGGVFGFFLGVSFPTLSLSQ       LNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSL+E KIWV
Subjt:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
        PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFT++LFHYDGRASEWEDLEWSKRAIHVSV KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVASYDYIFVWDEDLGVEHF+AE+YIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN
        RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGR PWEGVRERCR+EW IF+SRLADAEKAYYQGMGIDPPN
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPN

Query:  STQV
        ST V
Subjt:  STQV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G13000.1 Protein of unknown function (DUF707)2.4e-16866.67Show/hide
Query:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV
        +C N +KPN+ MR+++ TFVG V GFFLG+SFPTLSL++       LNFPS ++PS+D+TY+ED+    S+E  L+ WSS +G    +        KIWV
Subjt:  KCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
        P+NPRGAE L PGI+  ESD+ LRRLWG+P ED+ +KPKYL+ FTVGF QK N+DA VKKFS++FT+VLFHYDGR +EWE+LEWSKRAIHVSV KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVA YDY+F+WDEDLG+E+F+ EEYI+L++KHGLEISQP +E  + +TW++TKR+   EVHK+ +EKPG C DPHLPPCAAF+EIMA VFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAY
        RDAWRCVWH+IQNDLVHGWGLDFALRKCV PAHEKIGVVD+QWI+HQS+PSLG+QG+A++G+A W+GVR+RC+REW +F+SR+A +EK Y
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAY

AT1G67850.1 Protein of unknown function (DUF707)3.1e-17669.85Show/hide
Query:  RKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWVPTNPR
        RK  + M+I+ T F G  FGF +G+SFP+LS+++V       + P++ +PS DL+YIE+K S ++T     AWSS KGN  SS    ++++KIWVP+NPR
Subjt:  RKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWVPTNPR

Query:  GAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWWYAKRF
        GAE LPPG+V +ESDF LRRLWG+P EDL  +P+YL TFTVG +QK NIDA VKKFSENFT+VLFHYDGR +EW++ EWSK AIH+SV KQTKWWYAKRF
Subjt:  GAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWWYAKRF

Query:  LHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFSRDAWR
        LHPDIVA YDYIFVWDEDLGVEHFNAEEY+K+V+KHGLEISQPGLEPNQGLTWQMTKRRGD EVHK TEE+PGWC+DPHLPPCAAFVEIMA VFSR+AWR
Subjt:  LHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFSRDAWR

Query:  CVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID-PPNST
        CVWH+IQNDLVHGWGLDFALR+CV PAHEKIGVVD+QW+VHQS PSLGNQG+A +G+APW+GVR+RC++EW +F+SR+A+AEK Y++ + ++   NST
Subjt:  CVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID-PPNST

AT1G67850.2 Protein of unknown function (DUF707)3.1e-17669.85Show/hide
Query:  RKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWVPTNPR
        RK  + M+I+ T F G  FGF +G+SFP+LS+++V       + P++ +PS DL+YIE+K S ++T     AWSS KGN  SS    ++++KIWVP+NPR
Subjt:  RKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRGSSLQFSLNETKIWVPTNPR

Query:  GAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWWYAKRF
        GAE LPPG+V +ESDF LRRLWG+P EDL  +P+YL TFTVG +QK NIDA VKKFSENFT+VLFHYDGR +EW++ EWSK AIH+SV KQTKWWYAKRF
Subjt:  GAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWWYAKRF

Query:  LHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFSRDAWR
        LHPDIVA YDYIFVWDEDLGVEHFNAEEY+K+V+KHGLEISQPGLEPNQGLTWQMTKRRGD EVHK TEE+PGWC+DPHLPPCAAFVEIMA VFSR+AWR
Subjt:  LHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFSRDAWR

Query:  CVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID-PPNST
        CVWH+IQNDLVHGWGLDFALR+CV PAHEKIGVVD+QW+VHQS PSLGNQG+A +G+APW+GVR+RC++EW +F+SR+A+AEK Y++ + ++   NST
Subjt:  CVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGID-PPNST

AT3G27470.1 Protein of unknown function (DUF707)1.0e-17469.39Show/hide
Query:  MRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG----NRGSSLQFSLNETKIWV
        MR+P+++MR+L+T+F G + GF +G++FPTL+L++       +N PS+L PSIDL YIEDKYS +S +    +WSS KG    N      ++ N+TKIWV
Subjt:  MRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG----NRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
         TNPRGAERLPP IV  ESDF LRRLWG P+EDL +K +YLVTFTVG+DQ+KNID  +KKFS+NF+++LFHYDGRASEWE+ EWSKRAIHVS+ KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVA Y+YIF+WDEDLGVEHF++E+Y+ +V+KHGLEISQPGLEP +GLTW+MTK+R D+EVHK  EE+ GWCTDP+LPPCAAFVEIMA VFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQ
        R AWRCVWH+IQNDL+HGWGLDFA+RKCV  AHEKIGVVDAQWI+HQ VPSLGNQG+ E G+ PWEGVRERCRREW +F+ RL DAEKAY++
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQ

AT3G27470.2 Protein of unknown function (DUF707)1.0e-17469.39Show/hide
Query:  MRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG----NRGSSLQFSLNETKIWV
        MR+P+++MR+L+T+F G + GF +G++FPTL+L++       +N PS+L PSIDL YIEDKYS +S +    +WSS KG    N      ++ N+TKIWV
Subjt:  MRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKG----NRGSSLQFSLNETKIWV

Query:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW
         TNPRGAERLPP IV  ESDF LRRLWG P+EDL +K +YLVTFTVG+DQ+KNID  +KKFS+NF+++LFHYDGRASEWE+ EWSKRAIHVS+ KQTKWW
Subjt:  PTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHVSVYKQTKWW

Query:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS
        YAKRFLHPDIVA Y+YIF+WDEDLGVEHF++E+Y+ +V+KHGLEISQPGLEP +GLTW+MTK+R D+EVHK  EE+ GWCTDP+LPPCAAFVEIMA VFS
Subjt:  YAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMATVFS

Query:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQ
        R AWRCVWH+IQNDL+HGWGLDFA+RKCV  AHEKIGVVDAQWI+HQ VPSLGNQG+ E G+ PWEGVRERCRREW +F+ RL DAEKAY++
Subjt:  RDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCATATGATGTGCAAAGCGGCAATGGCATCGTCCCCTCCAACAGCTCACGTAAAGTGTGGAAACATGAGAAAACCTAATGAGATAATGAGGATTCTAGTT
ACAACATTTGTTGGAGGTGTTTTTGGTTTCTTTTTAGGAGTATCCTTTCCCACGCTCTCATTATCCCAGGTACAAAATTTTATTTTCGGCCTAAATTTCCCATCC
AGCCTGATTCCTTCTATTGATCTCACTTACATTGAGGACAAGTACTCAGGCCTCTCCACTGAAGCTTTCTTGAATGCTTGGTCTTCTTTGAAGGGTAATAGAGGC
AGCTCCTTACAATTTTCACTGAACGAGACAAAGATATGGGTTCCTACAAATCCTCGAGGAGCTGAAAGACTACCACCTGGTATTGTTGAGTCTGAATCTGATTTT
AACCTTCGGCGTCTGTGGGGTATGCCAAGCGAGGATTTGGCCATCAAACCTAAGTATCTGGTAACATTTACCGTTGGTTTTGATCAGAAAAAGAATATTGATGCA
GCAGTTAAAAAGTTCTCAGAGAACTTCACAGTCGTGTTGTTTCACTATGATGGACGAGCAAGTGAATGGGAAGATCTTGAGTGGTCGAAGCGGGCTATCCACGTG
AGTGTATACAAGCAAACTAAATGGTGGTATGCTAAACGTTTTCTGCATCCTGACATCGTGGCATCCTATGACTACATATTTGTGTGGGACGAGGATCTTGGAGTA
GAGCATTTTAACGCAGAAGAATACATAAAACTTGTGAGAAAACATGGCTTAGAGATCTCGCAACCTGGTTTGGAACCAAATCAAGGGTTAACGTGGCAGATGACT
AAAAGAAGAGGTGACAGTGAAGTTCACAAGGAGACAGAGGAGAAACCTGGTTGGTGCACTGATCCACATCTTCCACCTTGTGCAGCTTTCGTTGAAATCATGGCA
ACTGTATTTTCTCGGGATGCATGGCGCTGTGTTTGGCATTTGATTCAAAATGACTTGGTTCATGGTTGGGGTCTCGATTTTGCATTAAGAAAATGTGTTCATCCC
GCTCATGAGAAAATAGGGGTCGTAGATGCTCAGTGGATTGTGCATCAAAGTGTTCCTTCTCTTGGGAACCAGGGAAAAGCAGAAAATGGAAGGGCACCATGGGAA
GGGGTGAGAGAGAGATGTAGAAGAGAATGGGAAATTTTTAGGAGCCGGTTGGCTGATGCAGAGAAGGCGTACTATCAAGGAATGGGGATTGATCCGCCAAATTCA
ACTCAAGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGCCATATGATGTGCAAAGCGGCAATGGCATCGTCCCCTCCAACAGCTCACGTAAAGTGTGGAAACATGAGAAAACCTAATGAGATAATGAGGATTCTAGTT
ACAACATTTGTTGGAGGTGTTTTTGGTTTCTTTTTAGGAGTATCCTTTCCCACGCTCTCATTATCCCAGGTACAAAATTTTATTTTCGGCCTAAATTTCCCATCC
AGCCTGATTCCTTCTATTGATCTCACTTACATTGAGGACAAGTACTCAGGCCTCTCCACTGAAGCTTTCTTGAATGCTTGGTCTTCTTTGAAGGGTAATAGAGGC
AGCTCCTTACAATTTTCACTGAACGAGACAAAGATATGGGTTCCTACAAATCCTCGAGGAGCTGAAAGACTACCACCTGGTATTGTTGAGTCTGAATCTGATTTT
AACCTTCGGCGTCTGTGGGGTATGCCAAGCGAGGATTTGGCCATCAAACCTAAGTATCTGGTAACATTTACCGTTGGTTTTGATCAGAAAAAGAATATTGATGCA
GCAGTTAAAAAGTTCTCAGAGAACTTCACAGTCGTGTTGTTTCACTATGATGGACGAGCAAGTGAATGGGAAGATCTTGAGTGGTCGAAGCGGGCTATCCACGTG
AGTGTATACAAGCAAACTAAATGGTGGTATGCTAAACGTTTTCTGCATCCTGACATCGTGGCATCCTATGACTACATATTTGTGTGGGACGAGGATCTTGGAGTA
GAGCATTTTAACGCAGAAGAATACATAAAACTTGTGAGAAAACATGGCTTAGAGATCTCGCAACCTGGTTTGGAACCAAATCAAGGGTTAACGTGGCAGATGACT
AAAAGAAGAGGTGACAGTGAAGTTCACAAGGAGACAGAGGAGAAACCTGGTTGGTGCACTGATCCACATCTTCCACCTTGTGCAGCTTTCGTTGAAATCATGGCA
ACTGTATTTTCTCGGGATGCATGGCGCTGTGTTTGGCATTTGATTCAAAATGACTTGGTTCATGGTTGGGGTCTCGATTTTGCATTAAGAAAATGTGTTCATCCC
GCTCATGAGAAAATAGGGGTCGTAGATGCTCAGTGGATTGTGCATCAAAGTGTTCCTTCTCTTGGGAACCAGGGAAAAGCAGAAAATGGAAGGGCACCATGGGAA
GGGGTGAGAGAGAGATGTAGAAGAGAATGGGAAATTTTTAGGAGCCGGTTGGCTGATGCAGAGAAGGCGTACTATCAAGGAATGGGGATTGATCCGCCAAATTCA
ACTCAAGTGTAGGAAAACGGTTGCAACTATTCTTTCCCTTTCTACTGGAAGAGATCTTTGAGTGGAGGCACTGGTAATCGTTTGTCTCTTGAATACACTTTATAG
GAGAATTAATACGAATGCTCTATCTTTTGAAGCTCCTAAAATACTTCCTGTGAGAGAAAAGTACGGCTACGTATGGATTAGATCATAGCTATATAATGGATCAAA
TCTACTGTTTATATAACATATATATTAAAAAATATTCCTTAAAAAGGATCTAGCTTAGACATCTCAGATCTCTATGGAAACCATGCCCTTGAAATCTTTATGTTG
G
Protein sequenceShow/hide protein sequence
MGHMMCKAAMASSPPTAHVKCGNMRKPNEIMRILVTTFVGGVFGFFLGVSFPTLSLSQVQNFIFGLNFPSSLIPSIDLTYIEDKYSGLSTEAFLNAWSSLKGNRG
SSLQFSLNETKIWVPTNPRGAERLPPGIVESESDFNLRRLWGMPSEDLAIKPKYLVTFTVGFDQKKNIDAAVKKFSENFTVVLFHYDGRASEWEDLEWSKRAIHV
SVYKQTKWWYAKRFLHPDIVASYDYIFVWDEDLGVEHFNAEEYIKLVRKHGLEISQPGLEPNQGLTWQMTKRRGDSEVHKETEEKPGWCTDPHLPPCAAFVEIMA
TVFSRDAWRCVWHLIQNDLVHGWGLDFALRKCVHPAHEKIGVVDAQWIVHQSVPSLGNQGKAENGRAPWEGVRERCRREWEIFRSRLADAEKAYYQGMGIDPPNS
TQV