; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003753 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003753
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF1068)
Genome locationscaffold4:47167354..47170847
RNA-Seq ExpressionSpg003753
SyntenySpg003753
Gene Ontology termsNA
InterPro domainsIPR010471 - Protein of unknown function DUF1068


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004139277.1 uncharacterized protein LOC101212944 [Cucumis sativus]8.9e-7671.82Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERK+TSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

XP_008457359.1 PREDICTED: uncharacterized protein LOC103497063 [Cucumis melo]4.0e-7672.27Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

XP_022143173.1 uncharacterized protein LOC111013108 [Momordica charantia]8.9e-7672.27Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKA QLGDSK SC PCICDCPPPLSLLKIAPG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLK EMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

XP_022938183.1 uncharacterized protein LOC111444344 [Cucurbita moschata]5.7e-7571.23Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPAL+WRFKKAL LGDSKTSCAPCICDCPPPLSLLKIAPG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CGGNDPDLK+EMEKQFVDLLTEELKLQEAVAGEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEG
        ERKLTSLWERRARQMGW+G
Subjt:  ERKLTSLWERRARQMGWEG

XP_038890751.1 uncharacterized protein LOC120080238 [Benincasa hispida]1.0e-7673.18Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +SI+      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

TrEMBL top hitse value%identityAlignment
A0A0A0LG38 Uncharacterized protein4.3e-7671.82Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERK+TSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

A0A1S3C6L6 uncharacterized protein LOC1034970631.9e-7672.27Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

A0A5A7VC79 DUF1068 domain-containing protein1.9e-7672.27Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPALYWRFKKALQLGDSK SC PCICDCPPPLSLLKI+PG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLKQEMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

A0A6J1CQ21 uncharacterized protein LOC1110131084.3e-7672.27Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKA QLGDSK SC PCICDCPPPLSLLKIAPG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CG NDPDLK EMEKQFVDLLTEELKLQEAV+GEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEGE
        ERKLTSLWERRARQMGWEGE
Subjt:  ERKLTSLWERRARQMGWEGE

A0A6J1FI57 uncharacterized protein LOC1114443442.8e-7571.23Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET
        MSRRSGACLRCCLV FAVVSALAVCGPAL+WRFKKAL LGDSKTSCAPCICDCPPPLSLLKIAPG+                        +S++      
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLET

Query:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK
                            +CGGNDPDLK+EMEKQFVDLLTEELKLQEAVAGEHTRHMNIT+FEAKRAASQYQREAEKCI+ATETCEEARERAEAL IK
Subjt:  GVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIK

Query:  ERKLTSLWERRARQMGWEG
        ERKLTSLWERRARQMGW+G
Subjt:  ERKLTSLWERRARQMGWEG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05070.1 Protein of unknown function (DUF1068)2.1e-2232.24Show/hide
Query:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLETGVS
        R  A L+  L +  +  A  + GP LYW   +AL    S +SC  C C+C                                      + S +++   +S
Subjt:  RSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLETGVS

Query:  ACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK
            +            +C  +DP++ ++ EK + +LLTEELKL+EA + E  +  ++ L EAK+  S YQ+EA+KC +  ETCEEARE+AE  + +++K
Subjt:  ACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERK

Query:  LTSLWERRARQMGW
        LTS WE RARQ GW
Subjt:  LTSLWERRARQMGW

AT2G24290.1 Protein of unknown function (DUF1068)2.6e-5755.61Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS---CAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMS
        M+RRSG C+R CLVIF+VVSAL VCGPALYW+  K   +G ++++   C PC+CD PPPLSLL+IAPG+                        +SI+   
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS---CAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMS

Query:  LETGVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEAL
                                CG +DP+LK+EMEK FVDLLTEELKLQEAVA EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERA+AL
Subjt:  LETGVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEAL

Query:  MIKERKLTSLWERRARQMGWEGE
        ++KERK+T LWERRARQ+GWEGE
Subjt:  MIKERKLTSLWERRARQMGWEGE

AT2G32580.1 Protein of unknown function (DUF1068)1.5e-2533.49Show/hide
Query:  ACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLETGVSACV
        A L+  L + A+     + GP LYW   +AL +  S TSC+ C+CDC                                      S+ L+++ TG+S   
Subjt:  ACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLETGVSACV

Query:  MSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTS
                   +  +C   DP++ ++ EK + +LLTEELK +EA + E  + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++KLTS
Subjt:  MSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTS

Query:  LWERRARQMGWE
        +WE+RARQ G++
Subjt:  LWERRARQMGWE

AT2G32580.2 Protein of unknown function (DUF1068)1.0e-2148.98Show/hide
Query:  NCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE
        NC   DP++ ++ EK + +LLTEELK +EA + E  + ++  L EAK+  S YQ+EA+KC +  ETCEEARE+AE  +++++KLTS+WE+RARQ G++
Subjt:  NCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWERRARQMGWE

AT4G30996.1 Protein of unknown function (DUF1068)2.9e-6159.01Show/hide
Query:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS--CAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSL
        M RRSG C+R CLVIFAVVSAL VCGPALYW+F K   +G ++ +  C PC+CDCPPPLSLL+IAPG+                        +SI+    
Subjt:  MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTS--CAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSL

Query:  ETGVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALM
                              +CG +DP+LKQEMEKQFVDLLTEELKLQEAVA EH+RHMN+TL EAKR ASQYQ+EAEKC AATE CE ARERAEAL+
Subjt:  ETGVSACVMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALM

Query:  IKERKLTSLWERRARQMGWEGE
        IKERK+TSLWE+RARQ GWEGE
Subjt:  IKERKLTSLWERRARQMGWEGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCACGCCGATCTGGGGCTTGCCTGCGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAG
GCTTTGCAATTGGGTGATTCCAAAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTGCCCCTGGGGTGATCCTGGTAGCA
AGGGATTCTGATATTTGGGATGAGTTCCCTAAGGGTCTTGGGTTCAAGCCTCCAGGGGGAATCTCAATATCCTTGATGTCTCTTGAAACTGGTGTTAGCGCATGC
GTTATGTCTGGCCAACCTCTCCGTCACAGAGTTCCCAACTCAAGAAACTGTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGACCTT
TTGACAGAGGAATTGAAACTTCAAGAAGCAGTTGCTGGCGAACATACTCGCCATATGAACATCACTTTATTCGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGG
GAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGAGGCATTGATGATCAAGGAGAGGAAGCTTACATCATTGTGGGAGCGA
CGAGCCCGCCAAATGGGTTGGGAAGGGGAATAA
mRNA sequenceShow/hide mRNA sequence
ATGTCACGCCGATCTGGGGCTTGCCTGCGGTGTTGTCTCGTGATTTTTGCTGTAGTTTCTGCTTTGGCTGTTTGTGGACCGGCTTTGTATTGGAGATTCAAGAAG
GCTTTGCAATTGGGTGATTCCAAAACCTCCTGTGCTCCTTGCATCTGCGATTGTCCACCCCCATTATCCCTTTTGAAGATTGCCCCTGGGGTGATCCTGGTAGCA
AGGGATTCTGATATTTGGGATGAGTTCCCTAAGGGTCTTGGGTTCAAGCCTCCAGGGGGAATCTCAATATCCTTGATGTCTCTTGAAACTGGTGTTAGCGCATGC
GTTATGTCTGGCCAACCTCTCCGTCACAGAGTTCCCAACTCAAGAAACTGTGGAGGTAATGACCCAGATCTCAAGCAGGAGATGGAAAAACAATTTGTGGACCTT
TTGACAGAGGAATTGAAACTTCAAGAAGCAGTTGCTGGCGAACATACTCGCCATATGAACATCACTTTATTCGAGGCAAAAAGGGCAGCTTCTCAGTATCAGAGG
GAGGCTGAAAAGTGCATTGCTGCCACAGAAACTTGTGAAGAGGCCCGAGAACGCGCCGAGGCATTGATGATCAAGGAGAGGAAGCTTACATCATTGTGGGAGCGA
CGAGCCCGCCAAATGGGTTGGGAAGGGGAATAA
Protein sequenceShow/hide protein sequence
MSRRSGACLRCCLVIFAVVSALAVCGPALYWRFKKALQLGDSKTSCAPCICDCPPPLSLLKIAPGVILVARDSDIWDEFPKGLGFKPPGGISISLMSLETGVSAC
VMSGQPLRHRVPNSRNCGGNDPDLKQEMEKQFVDLLTEELKLQEAVAGEHTRHMNITLFEAKRAASQYQREAEKCIAATETCEEARERAEALMIKERKLTSLWER
RARQMGWEGE