; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034209 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034209
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionDUF4228 domain-containing protein
Genome locationchr3:5280202..5281048
RNA-Seq ExpressionLag0034209
SyntenyLag0034209
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6587486.1 hypothetical protein SDJN03_16051, partial [Cucurbita argyrosperma subsp. sororia]1.9e-8285.12Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S                NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT

Query:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS
        RIKLLRPAD LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD   AAA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARS
Subjt:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_022933525.1 uncharacterized protein LOC111440926 isoform X1 [Cucurbita moschata]1.1e-8285.12Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S                NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT

Query:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS
        RIKLLRPAD LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD   AAA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARS
Subjt:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

XP_022933526.1 uncharacterized protein LOC111440926 isoform X2 [Cucurbita moschata]7.9e-8489.27Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S      NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI
        LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD   AAA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

XP_022965769.1 uncharacterized protein LOC111465553 isoform X2 [Cucurbita maxima]8.7e-8388.29Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET------SNETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE        NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET------SNETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI
        LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD+   AA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

XP_023529445.1 uncharacterized protein LOC111792303 [Cucurbita pepo subsp. pepo]1.3e-8184.19Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S                NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT

Query:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAA---AVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS
        RIKLLRPAD LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD+A   A GRSVSEDP QATKHEKNNRPRTSTSTTSA  RS
Subjt:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAA---AVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

TrEMBL top hitse value%identityAlignment
A0A0A0LRI4 Uncharacterized protein1.5e-8086.87Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQAIDAATLVIQHPSGK DKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNN          N+TSNETSSNSVRLTRIKLLRPADMLVLGQV
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD--AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLIT+QEVM+GLSAKKQAKVKQSQLEAA+KP RRK+RT R SD  AAA GRSVSED +QA KHEKNNRPRTSTSTTSA ARSRTWQPSLHSISEAGS
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD--AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS

A0A6J1EZA8 uncharacterized protein LOC111440926 isoform X23.8e-8489.27Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS------NETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S      NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS------NETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI
        LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD   AAA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

A0A6J1F544 uncharacterized protein LOC111440926 isoform X15.5e-8385.12Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE S                NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETS----------------NETSSNSVRLT

Query:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS
        RIKLLRPAD LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD   AAA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARS
Subjt:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSD---AAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1HMJ8 uncharacterized protein LOC111465553 isoform X16.1e-8284.19Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET----------------SNETSSNSVRLT
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE                  NE SSNSVRLT
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET----------------SNETSSNSVRLT

Query:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS
        RIKLLRPAD LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD+   AA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARS
Subjt:  RIKLLRPADMLVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARS

Query:  RTWQPSLHSISEAGS
        RTWQPSLHSISEAGS
Subjt:  RTWQPSLHSISEAGS

A0A6J1HRW5 uncharacterized protein LOC111465553 isoform X24.2e-8388.29Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET------SNETSSNSVRLTRIKLLRPADM
        MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTM TPNES+NNNETSKVEKKENE        NE SSNSVRLTRIKLLRPAD 
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENET------SNETSSNSVRLTRIKLLRPADM

Query:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI
        LVLGQVYRLITS+EVM GLSAKKQAKVKQSQLEAAEK  RRKER ARGSD+   AA GRSVSEDP QATKHEKNNRPRTSTSTTSA ARSRTWQPSLHSI
Subjt:  LVLGQVYRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDA---AAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSI

Query:  SEAGS
        SEAGS
Subjt:  SEAGS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G10530.1 unknown protein1.8e-2543.65Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQA++AA LV+QHP G +D+ Y  V+  E+M M PGHYV+L+I     +  E  N   T K + K+          +VR TR++LLRP + LVLG  
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRP-RTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLITSQEVM+ L  KK AK K+ Q+E        K  TA         +  S+  V   K  K  R  R STS      +S+TW+PSL SISEA S
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRP-RTSTSTTSAAARSRTWQPSLHSISEAGS

AT1G60010.1 unknown protein2.2e-3145.41Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQA+DAA LV+QHP GK+D+ Y PV+  EIM+M PGHYV+L+I      P ++     T+  +K E +         VR TR+KLLRP + LVLG  
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLITSQEVM+ L AKK AK K+ Q E +      KE+    S+     + + E+  +    E  +  + S  T SA++RS+TW+PSL SISEA S
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS

AT5G50090.1 unknown protein7.0e-3044.9Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT  +   S +                      +RLTRIKLLRP D LVLG V
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLIT++EVM+GL AKK +K+K+    + +K +  K   +   D        +ED +Q  K EK             +  SR+WQPSL SISE GS
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS

AT5G50090.2 unknown protein1.7e-2842.86Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQA+D A +VIQHP+GK +KL  PV+A  +MKMNPGH V+LLISTT  +   S +                      +RLTRIKLLRP D LVLG V
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLIT++EVM+GL AKK +K+K               + ++GSD          D ++  K   + +          +  SR+WQPSL SISE GS
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS

AT5G62900.1 unknown protein8.8e-2540.82Show/hide
Query:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV
        MGNCQA +AAT VIQ P GK  + Y  V A E++K +PGH+VALL+S+ +                             S+R+TRIKLLRP+D L+LG V
Subjt:  MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQV

Query:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS
        YRLI+S+EVM+G+ AKK  K+K+   E +   +     T R         S S+   Q   HEK    R   +T  A  + R WQPSL SISE+ S
Subjt:  YRLITSQEVMRGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGAAATTGCCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAGTGGGAAAGTAGACAAATTGTATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATCTCCACCACCATGTTTACACCAAATGAAAGCAATAACAACAATGAAACCAGCAAGGTTGAGAAAAAGGAGAATGAAACCAGCA
ATGAAACCAGTAGTAATTCGGTTCGTTTAACTCGAATCAAGCTTCTCCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGGCTCATCACTTCTCAAGAGGTTATG
AGAGGTTTATCAGCAAAGAAACAAGCAAAGGTTAAACAAAGCCAGTTAGAAGCAGCAGAGAAACCACAGAGGAGGAAAGAACGTACAGCCAGAGGCTCAGATGCAGCAGC
AGTTGGAAGATCTGTATCTGAAGACCCTGTTCAGGCGACCAAACACGAGAAGAACAACAGACCAAGGACAAGTACATCGACAACCTCGGCCGCAGCCAGGTCAAGAACAT
GGCAACCTTCATTACATAGCATCTCAGAAGCTGGAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGAAATTGCCAAGCCATTGATGCAGCAACACTTGTGATACAACACCCAAGTGGGAAAGTAGACAAATTGTATTGGCCTGTGACTGCTAGAGAGATCATGAAGATGAA
TCCTGGTCACTATGTTGCTCTTCTCATCTCCACCACCATGTTTACACCAAATGAAAGCAATAACAACAATGAAACCAGCAAGGTTGAGAAAAAGGAGAATGAAACCAGCA
ATGAAACCAGTAGTAATTCGGTTCGTTTAACTCGAATCAAGCTTCTCCGCCCAGCTGACATGCTTGTTCTTGGCCAAGTTTACAGGCTCATCACTTCTCAAGAGGTTATG
AGAGGTTTATCAGCAAAGAAACAAGCAAAGGTTAAACAAAGCCAGTTAGAAGCAGCAGAGAAACCACAGAGGAGGAAAGAACGTACAGCCAGAGGCTCAGATGCAGCAGC
AGTTGGAAGATCTGTATCTGAAGACCCTGTTCAGGCGACCAAACACGAGAAGAACAACAGACCAAGGACAAGTACATCGACAACCTCGGCCGCAGCCAGGTCAAGAACAT
GGCAACCTTCATTACATAGCATCTCAGAAGCTGGAAGCTAA
Protein sequenceShow/hide protein sequence
MGNCQAIDAATLVIQHPSGKVDKLYWPVTAREIMKMNPGHYVALLISTTMFTPNESNNNNETSKVEKKENETSNETSSNSVRLTRIKLLRPADMLVLGQVYRLITSQEVM
RGLSAKKQAKVKQSQLEAAEKPQRRKERTARGSDAAAVGRSVSEDPVQATKHEKNNRPRTSTSTTSAAARSRTWQPSLHSISEAGS