; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015688 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015688
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr12:20231432..20240244
RNA-Seq ExpressionLag0015688
SyntenyLag0015688
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]7.8e-6346.62Show/hide
Query:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS
        P+ ++DY +PV     S I+  PINANNFEL+  LI M Q   + GS ++DPN HL  FL+IC T                     DKAR WLQS+ PGS
Subjt:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS

Query:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE
        I +W  + + FL KFF PAKT +LR++IG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQ+Q+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE
Subjt:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE

Query:  EMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKL-----QREKKSTFGQMQASVETPALKRLDACIPYQNRR
        EMA+N+YQWP+ER+  KK+ AG+ E++ ++A  A+      ++     QR  +ST      S+  P+ +     + Y N R
Subjt:  EMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKL-----QREKKSTFGQMQASVETPALKRLDACIPYQNRR

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]7.8e-6346.62Show/hide
Query:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS
        P+ ++DY +PV     S I+  PINANNFEL+  LI M Q   + GS ++DPN HL  FL+IC T                     DKAR WLQS+ PGS
Subjt:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS

Query:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE
        I +W  + + FL KFF PAKT +LR++IG F+Q   E L+EAWER+K+L+R+CPQHG PDWLQ+Q+FYNGL   T+TIVDAA+GGTL+SKT E A  LLE
Subjt:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE

Query:  EMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKL-----QREKKSTFGQMQASVETPALKRLDACIPYQNRR
        EMA+N+YQWP+ER+  KK+ AG+ E++ ++A  A+      ++     QR  +ST      S+  P+ +     + Y N R
Subjt:  EMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKL-----QREKKSTFGQMQASVETPALKRLDACIPYQNRR

WP_217833153.1 retrotransposon gag domain-containing protein, partial [Synechococcus sp. PCC 7002]7.8e-7152.79Show/hide
Query:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS
        PK IRDYFQP     Q GI+  PIN NNFEL+ GLIQMA++ A+RG + EDP+ HL+SFL+ICGT                     D+A+DWL++IPP S
Subjt:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS

Query:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE
        ITTW+ L Q FL K+F PAK+ +LRT+IGTF+Q  DEQL+EAWER+K+LLR+CPQHGYPDWLQIQLFYNGL  STK+I+DA AGG++ SK  + A T+LE
Subjt:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLE

Query:  EMATNSYQWPSERSGPK-KIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFGQMQASVETPALKRLDA
        ++AT SY WP ER+ P    AAG++E+D +++  A+       L    K T G  QA    P++  L A
Subjt:  EMATNSYQWPSERSGPK-KIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFGQMQASVETPALKRLDA

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]7.8e-6351.69Show/hide
Query:  EPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPG
        +P+ ++DY +P+  +  SGI    INANNFEL+  LI M Q   + GS ++DPN HL  FL+IC T                     DKAR WLQS+ PG
Subjt:  EPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPG

Query:  SITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLL
        SIT+W  + + FL KFF PAKT +LR++IG F+Q   E L+EAWER+K+L+R CPQHG PDWLQ+Q+FYNGL   T+TIVDAA+GGTL+SKT E A +LL
Subjt:  SITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLL

Query:  EEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAK
        EEMA+N+YQWP+ER+  KK+ AG+ E++  +A  A+
Subjt:  EEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAK

XP_023903214.1 uncharacterized protein LOC112015077 [Quercus suber]7.8e-6351.69Show/hide
Query:  EPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPG
        +P+ ++DY +P+  +  SGI    INANNFEL+  LI M Q   + GS ++DPN HL  FL+IC T                     DKAR WLQS+ PG
Subjt:  EPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPG

Query:  SITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLL
        SIT+W  + + FL KFF PAKT +LR++IG F+Q   E L+EAWER+K+L+R CPQHG PDWLQ+Q+FYNGL   T+TIVDAA+GGTL+SKT E A +LL
Subjt:  SITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLL

Query:  EEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAK
        EEMA+N+YQWP+ER+  KK+ AG+ E++  +A  A+
Subjt:  EEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAK

TrEMBL top hitse value%identityAlignment
A0A2I4F4C8 uncharacterized protein LOC1089953739.9e-5651.82Show/hide
Query:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS
        P+ ++DY +PV  +  SGI    INANNFEL+  LI M Q   +  S ++DPN HL  FL IC T                     DKAR WLQS+  GS
Subjt:  PKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSIPPGS

Query:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESART-LL
        IT+W  + + FL KFF PAKT +LR++I  F+Q   E L+EAWER+K L+R CPQHG P+WLQ+Q+FYNGL   T+TIVDAAAGGTL+SKT+E A T LL
Subjt:  ITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESART-LL

Query:  EEMATNSYQWPSERSGPKKI
        EEM +N+YQWP+E++  KK+
Subjt:  EEMATNSYQWPSERSGPKKI

A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129457.4e-5946.94Show/hide
Query:  PKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCA-YRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI
        P+  + +RDY  P+ Q     I    INANNFE++   IQM Q    + G   +DPNSHL +FL+IC T                     DKA+ WL S+
Subjt:  PKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCA-YRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI

Query:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR
        P GSITTW+ L Q FL KFF PAKT K+R  I +F Q   E L+EAWERFKELLR+CP HG PDWLQ+Q FYNGL  S KTI+DAAAGG L+SK    A 
Subjt:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR

Query:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFG--QMQASVETPAL----KRLDACIPYQNRRLKVLGEFN
         LLEEMA+N+YQWPSERSG +K A G +EID L     +     KKL      T G   +Q S+    +       D C PY +  ++ +G FN
Subjt:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFG--QMQASVETPAL----KRLDACIPYQNRRLKVLGEFN

A0A6J0ZYV0 uncharacterized protein LOC1104134139.6e-5946.94Show/hide
Query:  PKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCA-YRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI
        P+  + +RDY  P+ Q     I    INANNFE++   IQM Q    + G   +DPNSHL +FL+IC T                     DKA+ WL S+
Subjt:  PKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCA-YRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI

Query:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR
        P GSITTW+ L Q FL KFF PAKT K+R  I +F Q   E L+EAWERFKELLR+CP HG PDWLQ+Q FYNGL  S KTI+DAAAGG L+SK    A 
Subjt:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR

Query:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFG--QMQASVETPAL----KRLDACIPYQNRRLKVLGEFN
         LLEEMA+N+YQWPSERSG +K A G +EID L     +     KKL      T G   +Q S+    +       D C PY +  ++ +G FN
Subjt:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFG--QMQASVETPAL----KRLDACIPYQNRRLKVLGEFN

A0A6J1DU19 uncharacterized protein LOC1110243612.0e-5656.48Show/hide
Query:  IRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGTDKARDWLQS-----IPPGSITTWDALVQTFLKKFFLPA
        IRDY QP F     GI+  PINANN EL+ GLIQM ++  +RG++ EDPN+HL  FLD+CGT K    +       + P S+   + +VQ FL  FF PA
Subjt:  IRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGTDKARDWLQS-----IPPGSITTWDALVQTFLKKFFLPA

Query:  KTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLEEMATNSYQWPSERSGPKKI
        KT +LRT+I +F++   EQLFE WER+KELLRKCPQHG  +WLQIQ+FYNGL   T+TI+DAAAGGTLLS+T E+A  LL++MA NS+QWPSERS  KK+
Subjt:  KTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLEEMATNSYQWPSERSGPKKI

Query:  AAGVFEIDNLSARDAK
         AG++EID LS+  A+
Subjt:  AAGVFEIDNLSARDAK

A0A6P6XAQ1 Reverse transcriptase5.6e-5949.03Show/hide
Query:  NPKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI
        N    + +RD+  P  Q  Q+ IV   +NANNFE++  LIQM Q   Y G++ EDPNSHL +FL+IC T                     DKA+ WLQS 
Subjt:  NPKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGT---------------------DKARDWLQSI

Query:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR
        PP + TTWD L + FL KFF P KT KLR  I +F QQ  E L+EAWER++EL R+CP HG PDWL +Q FYNGLT  TKT VDAAAGG L+ KT E A+
Subjt:  PPGSITTWDALVQTFLKKFFLPAKTIKLRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESAR

Query:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFGQ
         L+EEMA N+YQW +ER G  +  AG+ E+D L+   AK     K L R+  S+  Q
Subjt:  TLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARDAKRCKGKKKLQREKKSTFGQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAATCCTAATCCTAAGGAGCCTAAGCCTATTAGGGACTATTTTCAACCAGTCTTTCAGGAGCAACAGTCTGGGATCGTCTATGGCCCGATCAATGCTAACAATTT
TGAGCTGGAAACAGGTCTCATTCAGATGGCTCAAGATTGTGCTTACAGAGGATCATCCATGGAGGATCCAAATTCTCATTTGAAATCCTTCCTAGACATCTGTGGGACGG
ACAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGATGCTTTGGTCCAGACATTCTTAAAGAAGTTCTTTCTTCCTGCCAAGACGATCAAG
CTGAGAACCAAAATTGGGACATTCCAGCAACAATTCGATGAGCAACTGTTCGAGGCTTGGGAGAGATTTAAAGAGCTTCTAAGGAAGTGCCCTCAGCATGGCTACCCCGA
CTGGCTCCAGATTCAGTTATTTTATAATGGTTTAACTCCTAGCACAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTTGAGAGTGCTAGGA
CTTTGTTAGAGGAGATGGCCACCAATAGCTATCAGTGGCCATCAGAGCGATCGGGACCCAAAAAGATTGCTGCTGGAGTGTTTGAGATCGATAACTTAAGTGCTCGGGAC
GCGAAAAGATGCAAAGGAAAGAAAAAGCTTCAAAGAGAAAAAAAGTCAACATTTGGTCAAATGCAGGCTAGCGTCGAGACGCCAGCCCTTAAGCGTCTCGACGCTTGCAT
TCCATATCAGAATAGGCGCCTAAAGGTTTTGGGGGAGTTCAATTTGGGACGTTTTGGAGCCGACAATAGGAGAAAAACAGAGGCTTTGGAGGTAGAAACAAAGGGAGCAA
GTCCAAGGGATGTCGAGACGGTGAAAAGACCAAAGCGGAGCACTCAAGACTCGAAAGCAACGTCAAAAAGATCAAGAGATGAAGCAGCCAAACAAGGAAGACTCGAGAGA
TGGCATCGAGACACCACCTCACGACGTCTCGACACTTCAGATTCAAGGGACCGCCTGAATCATTGTCAGCGTCAAGACACTGGCAACGACGTCTCGACAACGCCTCGTAG
TTTAGGTTGGTTTCTCGAGCCTTGGGCTTGGAATAGACTTGTCTATGGTCGTGTAAGCACCATTCACTGTAGTTGGAACCTTGAGATTGTTGGAGCTCGTGGTGGCTACA
ACGTTCGACATTGGAGCGGCAGACAACACATTCGACTGCAAGTGGCATTTGGGATTTTTGATCGTGGGCTGTATTCGTGTGGGTCTCATTCAACGCCGGATTCAAGTACT
GTTCACGGTGGTCTCCAATCTCTCTCCTCTATTCGGCTCCCAAAGACTCTCGATCCATTGTCCGCTGAGATAGAAAGGGCCGAGCGTGCAGTGCAAGTTCTCTTTTCAGC
TCCTAAATATCCACCACCTCCATGCATTATAAATCACGAAGAAGGAATAGGTGCCTACTTTCTTGTATCAGAGTTCTTCTCGAGCCGTTCCAAAAAGAAACCCCCGGAGG
TGCCTGAAGATCAATTCAAGTTGTGGGTCTTTTCATACTCACTGCAAGGAAAAGCTAGGAAGTGGATTTTCTCGTTGCCTCCAAGGTCGGTATCTTCGTGGAGCGAGATG
AGAAGTATTTTCCTAAAGAAGTACCACCCCATTGCGCTGCTAATAGAAGCCACCAGAAGAATTGTCAACGTGGCGCAAGCGCCTGAATAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAATCCTAATCCTAAGGAGCCTAAGCCTATTAGGGACTATTTTCAACCAGTCTTTCAGGAGCAACAGTCTGGGATCGTCTATGGCCCGATCAATGCTAACAATTT
TGAGCTGGAAACAGGTCTCATTCAGATGGCTCAAGATTGTGCTTACAGAGGATCATCCATGGAGGATCCAAATTCTCATTTGAAATCCTTCCTAGACATCTGTGGGACGG
ACAAAGCGAGGGACTGGTTACAATCTATTCCACCTGGGAGCATTACCACTTGGGATGCTTTGGTCCAGACATTCTTAAAGAAGTTCTTTCTTCCTGCCAAGACGATCAAG
CTGAGAACCAAAATTGGGACATTCCAGCAACAATTCGATGAGCAACTGTTCGAGGCTTGGGAGAGATTTAAAGAGCTTCTAAGGAAGTGCCCTCAGCATGGCTACCCCGA
CTGGCTCCAGATTCAGTTATTTTATAATGGTTTAACTCCTAGCACAAAAACTATTGTTGATGCAGCTGCAGGTGGGACTCTGTTGTCCAAGACTGTTGAGAGTGCTAGGA
CTTTGTTAGAGGAGATGGCCACCAATAGCTATCAGTGGCCATCAGAGCGATCGGGACCCAAAAAGATTGCTGCTGGAGTGTTTGAGATCGATAACTTAAGTGCTCGGGAC
GCGAAAAGATGCAAAGGAAAGAAAAAGCTTCAAAGAGAAAAAAAGTCAACATTTGGTCAAATGCAGGCTAGCGTCGAGACGCCAGCCCTTAAGCGTCTCGACGCTTGCAT
TCCATATCAGAATAGGCGCCTAAAGGTTTTGGGGGAGTTCAATTTGGGACGTTTTGGAGCCGACAATAGGAGAAAAACAGAGGCTTTGGAGGTAGAAACAAAGGGAGCAA
GTCCAAGGGATGTCGAGACGGTGAAAAGACCAAAGCGGAGCACTCAAGACTCGAAAGCAACGTCAAAAAGATCAAGAGATGAAGCAGCCAAACAAGGAAGACTCGAGAGA
TGGCATCGAGACACCACCTCACGACGTCTCGACACTTCAGATTCAAGGGACCGCCTGAATCATTGTCAGCGTCAAGACACTGGCAACGACGTCTCGACAACGCCTCGTAG
TTTAGGTTGGTTTCTCGAGCCTTGGGCTTGGAATAGACTTGTCTATGGTCGTGTAAGCACCATTCACTGTAGTTGGAACCTTGAGATTGTTGGAGCTCGTGGTGGCTACA
ACGTTCGACATTGGAGCGGCAGACAACACATTCGACTGCAAGTGGCATTTGGGATTTTTGATCGTGGGCTGTATTCGTGTGGGTCTCATTCAACGCCGGATTCAAGTACT
GTTCACGGTGGTCTCCAATCTCTCTCCTCTATTCGGCTCCCAAAGACTCTCGATCCATTGTCCGCTGAGATAGAAAGGGCCGAGCGTGCAGTGCAAGTTCTCTTTTCAGC
TCCTAAATATCCACCACCTCCATGCATTATAAATCACGAAGAAGGAATAGGTGCCTACTTTCTTGTATCAGAGTTCTTCTCGAGCCGTTCCAAAAAGAAACCCCCGGAGG
TGCCTGAAGATCAATTCAAGTTGTGGGTCTTTTCATACTCACTGCAAGGAAAAGCTAGGAAGTGGATTTTCTCGTTGCCTCCAAGGTCGGTATCTTCGTGGAGCGAGATG
AGAAGTATTTTCCTAAAGAAGTACCACCCCATTGCGCTGCTAATAGAAGCCACCAGAAGAATTGTCAACGTGGCGCAAGCGCCTGAATAG
Protein sequenceShow/hide protein sequence
MANPNPKEPKPIRDYFQPVFQEQQSGIVYGPINANNFELETGLIQMAQDCAYRGSSMEDPNSHLKSFLDICGTDKARDWLQSIPPGSITTWDALVQTFLKKFFLPAKTIK
LRTKIGTFQQQFDEQLFEAWERFKELLRKCPQHGYPDWLQIQLFYNGLTPSTKTIVDAAAGGTLLSKTVESARTLLEEMATNSYQWPSERSGPKKIAAGVFEIDNLSARD
AKRCKGKKKLQREKKSTFGQMQASVETPALKRLDACIPYQNRRLKVLGEFNLGRFGADNRRKTEALEVETKGASPRDVETVKRPKRSTQDSKATSKRSRDEAAKQGRLER
WHRDTTSRRLDTSDSRDRLNHCQRQDTGNDVSTTPRSLGWFLEPWAWNRLVYGRVSTIHCSWNLEIVGARGGYNVRHWSGRQHIRLQVAFGIFDRGLYSCGSHSTPDSST
VHGGLQSLSSIRLPKTLDPLSAEIERAERAVQVLFSAPKYPPPPCIINHEEGIGAYFLVSEFFSSRSKKKPPEVPEDQFKLWVFSYSLQGKARKWIFSLPPRSVSSWSEM
RSIFLKKYHPIALLIEATRRIVNVAQAPE