; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019028 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019028
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr5:37770806..37774283
RNA-Seq ExpressionLag0019028
SyntenyLag0019028
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022136930.1 uncharacterized protein LOC111008504 [Momordica charantia]2.2e-12188.64Show/hide
Query:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE
        MA+ T   DSIS+F AMAVN CQ A VSNGVHVQEKLAKV  LDEAERHCSLEILPILFE+ASFPFQSSS RDSSG  S EEFDNSPDCDPHLAFLS LE
Subjt:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE

Query:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF
        VTHPTKSRMSLE+SDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSK C+G FE+LKT+NTLVRIEKVLQRQSSLK+GVKLV YLLDHGLMLLKF
Subjt:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF

Query:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSV
        S+KEK  TERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSV
Subjt:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSV

XP_022941601.1 uncharacterized protein LOC111446909 [Cucurbita moschata]1.7e-11890.8Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVNHCQAAT+SNGVHVQEKLAKVT  DEAERH SLEILP LFE+ASFP Q S ARDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT S+MSL +SD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLL+FSSKEKS TERVHD PN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

XP_022991750.1 uncharacterized protein LOC111488280 [Cucurbita maxima]1.9e-11790Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVNHCQAA +SNGVHVQEKLAKVT  DE ERH SLEILP LF++ASFP Q S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKS TERVHD PN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

XP_023531596.1 uncharacterized protein LOC111793785 [Cucurbita pepo subsp. pepo]7.3e-11790Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVNHCQAAT+SNGVHVQEKLAKV   DE ERH SLEILP LFE+ASFP Q S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTF +LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKS TERVHD PN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

XP_038903131.1 uncharacterized protein LOC120089804 isoform X1 [Benincasa hispida]2.3e-11885.34Show/hide
Query:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE
        MA+  A +   STFA MAVN CQ A VSNGV VQEKLAKV+ L+E+ERHCSLEILP LFE+ASFPFQ+SSARDSSGFLSTEEFDNSP+CDPHLAFLSFLE
Subjt:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE

Query:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF
        VTH TKSRMSLE+SD RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKL+ SKSCEG+ E++KTENTL+RIEKVLQRQSSLKMG KL  YLLDHGLMLLKF
Subjt:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF

Query:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        SSKEK  TER  DMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

TrEMBL top hitse value%identityAlignment
A0A6J1C5A8 uncharacterized protein LOC1110085041.1e-12188.64Show/hide
Query:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE
        MA+ T   DSIS+F AMAVN CQ A VSNGVHVQEKLAKV  LDEAERHCSLEILPILFE+ASFPFQSSS RDSSG  S EEFDNSPDCDPHLAFLS LE
Subjt:  MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLE

Query:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF
        VTHPTKSRMSLE+SDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSK C+G FE+LKT+NTLVRIEKVLQRQSSLK+GVKLV YLLDHGLMLLKF
Subjt:  VTHPTKSRMSLESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKF

Query:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSV
        S+KEK  TERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQ GDGSV
Subjt:  SSKEKSATERVHDMPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSV

A0A6J1ENF8 uncharacterized protein LOC1114360861.4e-11084.58Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVN CQ A VSNGV VQEKLAKVTGLDE ERHCSLEILPILFE+ SFPFQ+S A DSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK+RMSLE+SDT
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLKTSKS EG+FE+L+TE+TLVRIEKVLQRQSSLKMG KL+ YLLDHGLMLLKFSSKEKS  ER  D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHD

Query:  MPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRV+QQ GDG VAM
Subjt:  MPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

A0A6J1FLJ9 uncharacterized protein LOC1114469098.5e-11990.8Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVNHCQAAT+SNGVHVQEKLAKVT  DEAERH SLEILP LFE+ASFP Q S ARDSSGF STEEFDNSPDCDPHLAFLSFLEVTHPT S+MSL +SD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLL+FSSKEKS TERVHD PN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

A0A6J1JCA6 uncharacterized protein LOC1114831626.9e-11386.17Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVN CQ A VSNGV VQEKLAKVT LDE ERHCSLEILPILFE+ SFPFQ+S ARDSS FLSTE FDNSP+CDPHLAFLSFLEVTHPTK+RMSLE+SDT
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHD
        RLTCQNVIDIHVN GDA SSCIVNIDIDK   DKLKTSKSCEGTFE+L+TE+TLVRIEKVLQRQSSLKMG KLV YLLDHGLMLLKFSSKEKS  E+  D
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDK---DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHD

Query:  MPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
          NNRWRKYKR+ASFDSRKIV+LFSVLSSLGTLILIYLTLRVRQQ GDGSVAM
Subjt:  MPNNRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

A0A6J1JTU6 uncharacterized protein LOC1114882809.3e-11890Show/hide
Query:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT
        MAVNHCQAA +SNGVHVQEKLAKVT  DE ERH SLEILP LF++ASFP Q S ARDSSGFLSTEE DNSPDCDPHLAFLSFLEVTHPT S+MSL +SD 
Subjt:  MAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDT

Query:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN
         LTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKT KSCEGTFE+LKTENTL+RIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKS TERVHD PN
Subjt:  RLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPN

Query:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM
        NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQS DGSVAM
Subjt:  NRWRKYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G39900.1 unknown protein4.4e-3538.98Show/hide
Query:  KLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQNVIDIHV
        KL K T   +  +H ++++ P+L ++A+FP +     D+S         +  +E +  P C  H   LSF++   P+K++M +++       QN I++ +
Subjt:  KLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGF-------LSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMSLESSDTRLTCQNVIDIHV

Query:  NGGDAYSSCIVNIDIDK-DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPNNRWRKYKRAAS
         G D+Y SC+V+I+++K +  +T  S +    ++K+E+  V ++KVLQRQ+SL                     S +K+ +ER HD P NRWR+YKRAAS
Subjt:  NGGDAYSSCIVNIDIDK-DKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPNNRWRKYKRAAS

Query:  FDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGS
        FDSRKIVILFS+LSS+GTLILIYLTLRV+ Q+GD +
Subjt:  FDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAAGCCCACAGCTGAGGAGGACTCAATCTCTACCTTCGCAGCAATGGCTGTTAATCACTGTCAGGCTGCAACTGTGTCGAATGGAGTTCACGTTCAAGAAAAACT
AGCGAAAGTTACTGGACTTGACGAGGCGGAGCGCCATTGTTCTTTAGAAATTCTGCCAATTCTCTTTGAGCAGGCGTCGTTCCCCTTTCAAAGTTCTTCGGCCCGTGATT
CCTCTGGCTTTTTAAGTACCGAGGAATTCGACAACAGTCCAGATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCA
TTGGAAAGTTCAGATACCCGCTTGACTTGCCAGAACGTAATTGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAA
GCTCAAAACATCTAAATCCTGTGAAGGAACTTTTGAAAATTTGAAAACTGAGAATACATTGGTGCGCATAGAAAAGGTATTGCAGAGACAGTCCAGCCTTAAAATGGGGG
TGAAACTTGTGCATTACTTGTTGGACCATGGACTAATGTTACTGAAGTTCTCGTCTAAAGAAAAATCAGCGACTGAGAGGGTTCACGATATGCCAAACAACAGGTGGAGA
AAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCAGTATTGTCAAGCTTGGGAACCTTGATATTGATATATTTGACTCTGAGAGTAAGGCA
GCAGAGTGGAGATGGATCTGTTGCTATGTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCAAGCCCACAGCTGAGGAGGACTCAATCTCTACCTTCGCAGCAATGGCTGTTAATCACTGTCAGGCTGCAACTGTGTCGAATGGAGTTCACGTTCAAGAAAAACT
AGCGAAAGTTACTGGACTTGACGAGGCGGAGCGCCATTGTTCTTTAGAAATTCTGCCAATTCTCTTTGAGCAGGCGTCGTTCCCCTTTCAAAGTTCTTCGGCCCGTGATT
CCTCTGGCTTTTTAAGTACCGAGGAATTCGACAACAGTCCAGATTGTGATCCACATTTAGCTTTCCTCAGCTTCCTGGAAGTTACCCATCCAACAAAAAGTAGGATGTCA
TTGGAAAGTTCAGATACCCGCTTGACTTGCCAGAACGTAATTGACATACATGTGAATGGTGGAGATGCTTATTCCTCGTGCATAGTAAATATTGATATTGACAAGGACAA
GCTCAAAACATCTAAATCCTGTGAAGGAACTTTTGAAAATTTGAAAACTGAGAATACATTGGTGCGCATAGAAAAGGTATTGCAGAGACAGTCCAGCCTTAAAATGGGGG
TGAAACTTGTGCATTACTTGTTGGACCATGGACTAATGTTACTGAAGTTCTCGTCTAAAGAAAAATCAGCGACTGAGAGGGTTCACGATATGCCAAACAACAGGTGGAGA
AAATATAAACGTGCTGCTTCATTTGATTCAAGAAAGATTGTTATTCTCTTCTCAGTATTGTCAAGCTTGGGAACCTTGATATTGATATATTTGACTCTGAGAGTAAGGCA
GCAGAGTGGAGATGGATCTGTTGCTATGTGA
Protein sequenceShow/hide protein sequence
MAKPTAEEDSISTFAAMAVNHCQAATVSNGVHVQEKLAKVTGLDEAERHCSLEILPILFEQASFPFQSSSARDSSGFLSTEEFDNSPDCDPHLAFLSFLEVTHPTKSRMS
LESSDTRLTCQNVIDIHVNGGDAYSSCIVNIDIDKDKLKTSKSCEGTFENLKTENTLVRIEKVLQRQSSLKMGVKLVHYLLDHGLMLLKFSSKEKSATERVHDMPNNRWR
KYKRAASFDSRKIVILFSVLSSLGTLILIYLTLRVRQQSGDGSVAM