; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc08g03600 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc08g03600
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr8:2645640..2649020
RNA-Seq ExpressionMoc08g03600
SyntenyMoc08g03600
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7020327.1 hypothetical protein SDJN02_17011, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-18382.48Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKW ALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAYILFPSFFVELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLLVL SKT KNFFNSGD+VVCS VQLCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVLS
        MWI+SKVVVLS
Subjt:  MWILSKVVVLS

XP_022132051.1 uncharacterized protein LOC111005015 [Momordica charantia]2.2e-232100Show/hide
Query:  MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL
        MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL
Subjt:  MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL

Query:  SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL
        SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL
Subjt:  SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL

Query:  TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF
        TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF
Subjt:  TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF

Query:  GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL
        GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL
Subjt:  GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL

Query:  HTLFAFEFSLVMWILSKVVVLS
        HTLFAFEFSLVMWILSKVVVLS
Subjt:  HTLFAFEFSLVMWILSKVVVLS

XP_022951427.1 uncharacterized protein LOC111454247 [Cucurbita moschata]7.2e-18382.24Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKW ALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAYILFPSFFVELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLL L SKT KNFFNSGD+VVCS VQLCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVLS
        MWI+SKVVVLS
Subjt:  MWILSKVVVLS

XP_023002362.1 uncharacterized protein LOC111496225 [Cucurbita maxima]6.5e-18482.24Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKWCALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAY+LFPSFF ELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLLVL SKT+KNFFNSGD+VVCS VQLCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVLS
        MWI+SKVVVLS
Subjt:  MWILSKVVVLS

XP_023537461.1 uncharacterized protein LOC111798502 [Cucurbita pepo subsp. pepo]1.6e-18282.2Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKW ALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAYILFPSFFVELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLLVL SKT KNFFNSGD+VVCS V+LCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVL
        MWI+SKVVVL
Subjt:  MWILSKVVVL

TrEMBL top hitse value%identityAlignment
A0A0A0LS65 Uncharacterized protein2.8e-17276.74Show/hide
Query:  ADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRF
        +  SN+ + L H  YCRSKS+TSD+LRN+RISLKWCALDHSS AGKF SY  F L  +VVPAV I+AV  P DDPISFNKLVQVPESGLAA+GF +L RF
Subjt:  ADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRF

Query:  FRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCE
        FRRYGLRQLLFL+GLQEDSVYVQ+GYAREL+K FR +AYILFPSFFVELAHKI FF+TVKI FP GFP+NSI+F+LVLG+WVYRTGVFLLVC+LFRLTCE
Subjt:  FRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCE

Query:  LQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAA
        LQILRLQGV KLFE S+GS S++ +IF+EH+RI+KQL +TSHRYRFFII CLLVITVSQFVALLLVLASKT KNFFNSGDVVVCS VQLCGFFLCLFGAA
Subjt:  LQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAA

Query:  RITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPP-----------TPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVY
        RITHRAQGI S+A+RWHMLVTS AS  GS+ TKPP             +   SG D DDTDSDSS  VLISV   + SSFQTRQALVSYL+HNNGGIT++
Subjt:  RITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPP-----------TPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVY

Query:  GFALDRGLLHTLFAFEFSLVMWILSKVVVL
        G+ALDRGLLHTLFAFEFSLVM ILSKVVVL
Subjt:  GFALDRGLLHTLFAFEFSLVMWILSKVVVL

A0A5A7VCT9 Uncharacterized protein1.2e-17277.28Show/hide
Query:  SNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRR
        SN+ + L H  YCRSKS+TSD+LRNFRISLKWCALDHSS  GKF SY AF    +VVPAV ++AV  P DDPISFNKLVQVPESGLAA+ F +L RFFRR
Subjt:  SNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRR

Query:  YGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQI
        YGLRQLLFL+GLQEDSVYVQ+GYAREL+K FR +AYILFPSFFVELAHKI FF+TVKISFP GFP+NSI+F+ VLG+WVYRTGVFLLVC+LFRLTCELQI
Subjt:  YGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQI

Query:  LRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARIT
        LRLQGV KLFE S+GS S++ +IF+EH+RI+KQL +TSHRYRFFII CLLVITVSQFVALLLVLASK  KNFFNSGDVVVCS VQLCGFFLCLFGAARIT
Subjt:  LRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARIT

Query:  HRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHP----------DGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFAL
        HRAQGI SIA+RWHMLVTS AS  GS+ TK P   P            SG D DDTDSDSS  VLISV   + SSFQTRQALVSYL+HNNGGIT++G+AL
Subjt:  HRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHP----------DGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFAL

Query:  DRGLLHTLFAFEFSLVMWILSKVVVLS
        DRGLLHTLFAFEFSLVMWILSKVVVLS
Subjt:  DRGLLHTLFAFEFSLVMWILSKVVVLS

A0A6J1BR58 uncharacterized protein LOC1110050151.1e-232100Show/hide
Query:  MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL
        MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL
Subjt:  MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTL

Query:  SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL
        SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL
Subjt:  SRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRL

Query:  TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF
        TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF
Subjt:  TCELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLF

Query:  GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL
        GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL
Subjt:  GAARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLL

Query:  HTLFAFEFSLVMWILSKVVVLS
        HTLFAFEFSLVMWILSKVVVLS
Subjt:  HTLFAFEFSLVMWILSKVVVLS

A0A6J1GHK9 uncharacterized protein LOC1114542473.5e-18382.24Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKW ALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAYILFPSFFVELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLL L SKT KNFFNSGD+VVCS VQLCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVLS
        MWI+SKVVVLS
Subjt:  MWILSKVVVLS

A0A6J1KJB4 uncharacterized protein LOC1114962253.1e-18482.24Show/hide
Query:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL
        LL   YCRSKSLTSDELRNFRISLKWCALDHSS AGKFVSYFAFV+  VVVP V IV+V  P DDPIS NKLVQVPESGLAAIGFF+L RFFRRYGLRQL
Subjt:  LLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQL

Query:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV
        LFLDGLQEDS+YVQRGYARELDK+FRYLAY+LFPSFF ELAHKI FF+TVKISFPGGFP+NSI+F+ VL +WVYRTGVFLLVC+LFRLTCELQILRLQGV
Subjt:  LFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGV

Query:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI
         KLFE S+GSGS+A VIF+EH+RI+KQLWVTSHRYRFFII CL +ITVSQFVALLLVL SKT+KNFFNSGD+VVCS VQLCGFFLC+ GAARITHRAQ I
Subjt:  QKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGI

Query:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV
         S+A+RWHMLVTS  +AAG + TKPP P P     DGDDTDSDSS  VLIS+S  +PSSFQTRQALVSYL+HNNGGIT++GFALDRGLLHTLFAFEFSLV
Subjt:  VSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSG-VLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLV

Query:  MWILSKVVVLS
        MWI+SKVVVLS
Subjt:  MWILSKVVVLS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)1.8e-6737.41Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPI---SFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLD
        R  S   DEL +FR  L+W  +DHSS     +S+  F++F +VVPA++   +     D      ++ +VQ+  S +A + F  L+RF  +YGLR+ LF D
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPI---SFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLD

Query:  GLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQ
         L ++S  V+R Y  +L+ S   ++Y + P F    A+KI ++ +   +I F G   + +++  ++ L +W+YRT V  LVC+LFRL C LQILRLQ   
Subjt:  GLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQ

Query:  KLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIV
        KLF+  S  GS    I  EH+RI++ L + SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIV

Query:  SIASRWHMLVTSLA---SAAGSDQT-KPPT--------------------PHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGIT
         +A++WH+  T  +   +    DQT + PT                       D  G + DD D++    +I V A    SFQ RQALVSY  +N+ GIT
Subjt:  SIASRWHMLVTSLA---SAAGSDQT-KPPT--------------------PHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGIT

Query:  VYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS
        VYGF LDRG LHT+F  E SLV+W+L K + +S
Subjt:  VYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS

AT1G50630.2 Protein of unknown function (DUF3537)1.2e-5034.79Show/hide
Query:  RSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPI---SFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLD
        R  S   DEL +FR  L+W  +DHSS     +S+  F++F +VVPA++   +     D      ++ +VQ+  S +A + F  L+RF  +YGLR+ LF D
Subjt:  RSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPI---SFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLD

Query:  GLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQ
         L ++S  V+R Y  +L+ S   ++Y + P F    A+KI ++ +   +I F G   + +++  ++ L +W+YRT V  LVC+LFRL C LQILRLQ   
Subjt:  GLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQ

Query:  KLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIV
        KLF+  S  GS    I  EH+RI++ L + SHRYR FI+  L+++T SQF +LL+   + T+ N + +G++ +CS   +    + L  A++ITH+AQ + 
Subjt:  KLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIV

Query:  SIASRWHMLVTSLA---SAAGSDQT-KPPT--------------------PHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQAL
         +A++WH+  T  +   +    DQT + PT                       D  G + DD D++    +I V A    SFQ RQAL
Subjt:  SIASRWHMLVTSLA---SAAGSDQT-KPPT--------------------PHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQAL

AT2G21080.1 unknown protein2.8e-12460.33Show/hide
Query:  SLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVP------IDDPISFNKLVQVPESGLAAIGFFTLSRFFR
        +L + P   SKSL SD+LRNFR+ LKWCALDHSS+ GK VSY  FV+F ++VP ++ + +  P      + D  SFN LVQ PESGLA IGF TL  FFR
Subjt:  SLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVP------IDDPISFNKLVQVPESGLAAIGFFTLSRFFR

Query:  RYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFP----GGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLT
         Y L +LLFLD    DS  V+ GY+RELDK+ RYLAYIL PSF VEL HK  FF + ++SFP        +N +MF LVL +WVYRTGVFLLVCILFRLT
Subjt:  RYGLRQLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFP----GGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLT

Query:  CELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFG
        CELQILR +G+ KLF+       E   +  EHVRIKKQL  TSHRYRFFII   +VI+ SQFVALLLVLASK++K+F +SGD+VVCSAVQL GFFLCL G
Subjt:  CELQILRLQGVQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFG

Query:  AARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLH
        AARITHRAQG+V IA+RWHM +T  + A                  + D   SD+  + +S S    S FQ RQALV YLRHNN GIT+YG+ALDRGLLH
Subjt:  AARITHRAQGIVSIASRWHMLVTSLASAAGSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLH

Query:  TLFAFEFSLVMWILSKVVVLS
        TLFAFEFSLVMWILSKVVVLS
Subjt:  TLFAFEFSLVMWILSKVVVLS

AT3G20300.1 Protein of unknown function (DUF3537)2.6e-6636.85Show/hide
Query:  YCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVA---IVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLF
        + RS S   DEL +FR  L+W  +D SS     +S+  FV+F +VVPA +   +            ++ +VQ+  S  AA+ F  LSRF  +YGLR+ LF
Subjt:  YCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVA---IVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLF

Query:  LDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQG
         D L ++S  V+ GY  +L++S + L+Y + P F    ++KI ++ +   +I F G   + +++  ++ L +W+YRT V  LVC+LFRL C LQILRLQ 
Subjt:  LDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTT--VKISFPGGFPM-NSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQG

Query:  VQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQG
          ++F+  S  GS    I  EH+RI++ L + SHRYR FI+  L+++T SQF +LL+   +  + N + +G++ +CS   +    + L  A++ITH+AQ 
Subjt:  VQKLFEGSSGSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQG

Query:  IVSIASRWHMLVTSLASAAGSDQTKP------------PTPHPDGSGGD---GDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALD
        +  +A++WH+  T + S    D   P            PT   +G       GD+ D   +  LI   A+   SFQ RQALV+Y  +N  GITV+GF LD
Subjt:  IVSIASRWHMLVTSLASAAGSDQTKP------------PTPHPDGSGGD---GDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALD

Query:  RGLLHTLFAFEFSLVMWILSKVVVLS
        R  LHT+F  E SLV+W+L K + +S
Subjt:  RGLLHTLFAFEFSLVMWILSKVVVLS

AT4G22270.1 Protein of unknown function (DUF3537)1.9e-4832.24Show/hide
Query:  DHSSTAGKFVSYFAFVLFAVVVPAVA---IVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFR
        D S+     +S+  F L  V+VP ++   +V           ++ +VQ+  S  A I F +LS + R++G+R+ LFLD L + S  V+  Y  E+ +S +
Subjt:  DHSSTAGKFVSYFAFVLFAVVVPAVA---IVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLRQLLFLDGLQEDSVYVQRGYARELDKSFR

Query:  YLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIM-----FVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQKLFEGSSGSGSEAAVIFDEH
         L   + PS  +E  ++I ++ +     P  + +N I+       L L +W+YR  +F++VCIL+++TC LQ LRL    + F   +   ++      EH
Subjt:  YLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIM-----FVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQKLFEGSSGSGSEAAVIFDEH

Query:  VRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIASRWHMLVT--SLASAAG
         +I++ L + SHR+R FI+  L+++T +QF+ALL    +    N +  G++ +CS   + G F+CL  A +ITH+AQ + S+A++W++  T  S     G
Subjt:  VRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIASRWHMLVT--SLASAAG

Query:  SDQTKPPTPHPDGSGGDGDDTDSDSSGV----LISVSAH----QPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVV
           T           G+  +T  D  G     L +   H       S+Q RQALV+YL +N  GITVYGF +DR  L+T+F  E +L++W+L+K +V
Subjt:  SDQTKPPTPHPDGSGGDGDDTDSDSSGV----LISVSAH----QPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCTCCGCCGATCACTCAAACGTAACCGTCTCTCTCCTCCATCCCCCCTACTGCAGATCCAAATCCCTAACCTCCGACGAGCTCCGCAACTTCCGAATCAGCCTCAA
ATGGTGCGCCCTAGACCACTCCTCCACCGCCGGGAAGTTCGTCTCCTACTTCGCCTTCGTCCTCTTTGCCGTCGTCGTCCCCGCCGTCGCGATCGTCGCCGTCCATGTCC
CGATCGACGATCCGATCTCGTTCAACAAACTGGTTCAGGTTCCGGAATCCGGCCTGGCCGCGATCGGATTCTTCACGCTGTCGCGATTCTTCCGGCGGTACGGCCTCCGG
CAGCTGCTGTTCCTGGACGGATTGCAGGAGGATTCCGTGTACGTACAGCGGGGGTACGCGCGTGAGCTCGACAAGTCGTTCCGTTATTTGGCGTACATTCTGTTTCCGTC
GTTCTTCGTGGAGCTGGCCCACAAGATCTCCTTCTTCACCACCGTCAAGATCTCCTTCCCCGGCGGGTTTCCAATGAACTCGATAATGTTCGTGCTCGTGTTGGGGGCGT
GGGTGTACCGGACGGGGGTGTTCTTGTTGGTGTGCATATTGTTTCGGCTCACGTGCGAGTTGCAAATTTTGCGACTCCAAGGAGTGCAGAAGTTGTTCGAAGGGAGCAGC
GGATCGGGATCGGAGGCGGCGGTGATCTTCGACGAACACGTGAGGATCAAAAAACAGTTGTGGGTTACGAGCCACCGGTACCGGTTCTTCATCATAGGGTGTTTGTTGGT
CATCACTGTCAGCCAGTTCGTGGCGCTGCTACTTGTTTTGGCCTCCAAAACTCAGAAGAATTTCTTCAACTCCGGCGACGTCGTGGTTTGCTCGGCGGTGCAGTTATGCG
GATTCTTCTTGTGCCTATTTGGAGCGGCAAGAATAACTCATAGAGCCCAAGGGATTGTTTCCATAGCTTCCAGATGGCACATGCTCGTTACTTCCCTCGCCTCCGCCGCT
GGATCGGACCAGACCAAACCCCCGACGCCGCACCCTGACGGCAGCGGCGGCGACGGCGACGACACCGATTCCGACTCGTCGGGCGTTTTAATTTCGGTGAGCGCGCATCA
ACCTTCATCCTTCCAGACCAGACAAGCGCTAGTGTCGTATTTGCGGCACAACAACGGCGGAATTACGGTGTACGGATTCGCGCTGGATCGGGGATTGCTCCATACGCTCT
TTGCGTTTGAATTCTCCCTGGTGATGTGGATTTTGAGTAAGGTGGTTGTATTGTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCTCCGCCGATCACTCAAACGTAACCGTCTCTCTCCTCCATCCCCCCTACTGCAGATCCAAATCCCTAACCTCCGACGAGCTCCGCAACTTCCGAATCAGCCTCAA
ATGGTGCGCCCTAGACCACTCCTCCACCGCCGGGAAGTTCGTCTCCTACTTCGCCTTCGTCCTCTTTGCCGTCGTCGTCCCCGCCGTCGCGATCGTCGCCGTCCATGTCC
CGATCGACGATCCGATCTCGTTCAACAAACTGGTTCAGGTTCCGGAATCCGGCCTGGCCGCGATCGGATTCTTCACGCTGTCGCGATTCTTCCGGCGGTACGGCCTCCGG
CAGCTGCTGTTCCTGGACGGATTGCAGGAGGATTCCGTGTACGTACAGCGGGGGTACGCGCGTGAGCTCGACAAGTCGTTCCGTTATTTGGCGTACATTCTGTTTCCGTC
GTTCTTCGTGGAGCTGGCCCACAAGATCTCCTTCTTCACCACCGTCAAGATCTCCTTCCCCGGCGGGTTTCCAATGAACTCGATAATGTTCGTGCTCGTGTTGGGGGCGT
GGGTGTACCGGACGGGGGTGTTCTTGTTGGTGTGCATATTGTTTCGGCTCACGTGCGAGTTGCAAATTTTGCGACTCCAAGGAGTGCAGAAGTTGTTCGAAGGGAGCAGC
GGATCGGGATCGGAGGCGGCGGTGATCTTCGACGAACACGTGAGGATCAAAAAACAGTTGTGGGTTACGAGCCACCGGTACCGGTTCTTCATCATAGGGTGTTTGTTGGT
CATCACTGTCAGCCAGTTCGTGGCGCTGCTACTTGTTTTGGCCTCCAAAACTCAGAAGAATTTCTTCAACTCCGGCGACGTCGTGGTTTGCTCGGCGGTGCAGTTATGCG
GATTCTTCTTGTGCCTATTTGGAGCGGCAAGAATAACTCATAGAGCCCAAGGGATTGTTTCCATAGCTTCCAGATGGCACATGCTCGTTACTTCCCTCGCCTCCGCCGCT
GGATCGGACCAGACCAAACCCCCGACGCCGCACCCTGACGGCAGCGGCGGCGACGGCGACGACACCGATTCCGACTCGTCGGGCGTTTTAATTTCGGTGAGCGCGCATCA
ACCTTCATCCTTCCAGACCAGACAAGCGCTAGTGTCGTATTTGCGGCACAACAACGGCGGAATTACGGTGTACGGATTCGCGCTGGATCGGGGATTGCTCCATACGCTCT
TTGCGTTTGAATTCTCCCTGGTGATGTGGATTTTGAGTAAGGTGGTTGTATTGTCTTAG
Protein sequenceShow/hide protein sequence
MTSADHSNVTVSLLHPPYCRSKSLTSDELRNFRISLKWCALDHSSTAGKFVSYFAFVLFAVVVPAVAIVAVHVPIDDPISFNKLVQVPESGLAAIGFFTLSRFFRRYGLR
QLLFLDGLQEDSVYVQRGYARELDKSFRYLAYILFPSFFVELAHKISFFTTVKISFPGGFPMNSIMFVLVLGAWVYRTGVFLLVCILFRLTCELQILRLQGVQKLFEGSS
GSGSEAAVIFDEHVRIKKQLWVTSHRYRFFIIGCLLVITVSQFVALLLVLASKTQKNFFNSGDVVVCSAVQLCGFFLCLFGAARITHRAQGIVSIASRWHMLVTSLASAA
GSDQTKPPTPHPDGSGGDGDDTDSDSSGVLISVSAHQPSSFQTRQALVSYLRHNNGGITVYGFALDRGLLHTLFAFEFSLVMWILSKVVVLS