; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0042183 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0042183
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotran_gag_3 domain-containing protein
Genome locationchr13:38141971..38142696
RNA-Seq ExpressionLag0042183
SyntenyLag0042183
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR029472 - Retrotransposon Copia-like, N-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0049700.1 T4.5 [Cucumis melo var. makuwa]1.7e-7264.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

KAE8645659.1 hypothetical protein Csa_020439 [Cucumis sativus]2.9e-7264.56Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVP---NPLYE
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKLF FVDG+NPCP             +SPS+ T +VP   NPLYE
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVP---NPLYE

Query:  DWIAKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILN
        DWIAKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LN
Subjt:  DWIAKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILN

Query:  GLPSDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        GLP++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  GLPSDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

XP_008448007.1 PREDICTED: uncharacterized protein LOC103490319 isoform X2 [Cucumis melo]1.7e-7264.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

XP_008448008.1 PREDICTED: uncharacterized protein LOC103490319 isoform X3 [Cucumis melo]1.7e-7264.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

XP_016900446.1 PREDICTED: uncharacterized protein LOC103490319 isoform X1 [Cucumis melo]1.7e-7264.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

TrEMBL top hitse value%identityAlignment
A0A1S3BI58 uncharacterized protein LOC103490319 isoform X28.3e-7364.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

A0A1S3BIR3 uncharacterized protein LOC103490319 isoform X38.3e-7364.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

A0A1S4DWT9 uncharacterized protein LOC103490319 isoform X18.3e-7364.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

A0A5D3CLI6 T4.58.3e-7364.1Show/hide
Query:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI
        MSS+T + S  S A+ D+ +PIFLLSNICNLIS+RLDS+NFVLWKFQ T++LKAHKL+ F+DG+NPCP  T  SSST        S      NP YEDWI
Subjt:  MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWI

Query:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP
        AKD ALMT+INATLS EALAY+VG+ SS+QVW+ L K YSS SR+N+VNLK++LQ++ K   E+IDAYIKRIKEIKD+LANVS  +NEEDLLIY LNGLP
Subjt:  AKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLP

Query:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA
        ++YNTFRTSMRTRSQ ++F ELH +  +     A
Subjt:  SDYNTFRTSMRTRSQSISFGELHAVATSSRRGAA

A0A6J1D9L6 uncharacterized protein LOC1110188928.6e-7062.56Show/hide
Query:  SSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWIA
        SS   M+S+ +  K D  +PIFLLSNICNL+SIRLDS++F+LWKFQ T++LKAHKLF F+DGS   P+  + SSS T +S   ++ +  V NP +EDWIA
Subjt:  SSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWIA

Query:  KDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPS
        KD ALMTLINATLS+EALAY+V + +S+QVWE LEKHYSSNSR N+VNLK++LQS+ K + E+IDAY+KRIKEIKD+ ANVS+ +N+E LLIY LNGL +
Subjt:  KDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPS

Query:  DYNTFRTSMRTRSQSISFGELHAVATS
        +YNT  TSMRTR+QS+SF ELH    S
Subjt:  DYNTFRTSMRTRSQSISFGELHAVATS

SwissProt top hitse value%identityAlignment
P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-945.3e-0828.95Show/hide
Query:  EDWIAKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYIL
        EDW   D    + I   LS + +  I+   +++ +W  LE  Y S +  N + LK +L ++  + G    +++     +  +LAN+ V + EED  I +L
Subjt:  EDWIAKDHALMTLINATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYIL

Query:  NGLPSDYNTFRTSM
        N LPS Y+   T++
Subjt:  NGLPSDYNTFRTSM

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.8e-0925.13Show/hide
Query:  RLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVP--NPLYEDWIAKDHALMTLINATLSSEALAYIVGTKSSQQVW
        +L S+N+++W  Q  +L   ++L  F+DGS P P ATI               T +VP  NP Y  W  +D  + + I   +S      +    ++ Q+W
Subjt:  RLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVP--NPLYEDWIAKDHALMTLINATLSSEALAYIVGTKSSQQVW

Query:  ETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPSDYNTFRTSMRTRSQSISFGELH
        ETL K Y++ S  ++    T+L+ +T+                 D+LA +   ++ ++ +  +L  LP DY      +  +    S  E+H
Subjt:  ETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPSDYNTFRTSMRTRSQSISFGELH

Arabidopsis top hitse value%identityAlignment
AT1G34070.1 CONTAINS InterPro DOMAIN/s: Retrotransposon gag protein (InterPro:IPR005162)1.3e-0623.04Show/hide
Query:  IFLLSNICNLISIRLD--SSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWIAKDHALMTLINATLSSEAL
        I+ +SNI + I + LD   SN+  W+  F +   +  +   +DG                        T    N    +W  +D  +   +  TL+ +  
Subjt:  IFLLSNICNLISIRLD--SSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWIAKDHALMTLINATLSSEAL

Query:  -AYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGE-TIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPSDYNTFRTSMRTRSQSI
            V + +S+ +W  ++  + +N  A  + L +EL+  TK+ G+  +  Y +++K++ D L NV V V + +L++Y+LNGL   ++     ++ R    
Subjt:  -AYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGE-TIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPSDYNTFRTSMRTRSQSI

Query:  SFGE
        SF +
Subjt:  SFGE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTTCTACTACAATGATGTCTTCAAACCCTTCCCCTGCTAAGGACGACACTCAGACTCCGATTTTCCTTCTCTCCAATATTTGCAACTTGATTTCTATTCGGCTTGA
TTCATCCAATTTTGTACTCTGGAAATTTCAATTCACTTCTCTCTTGAAGGCTCACAAACTATTTAGTTTTGTTGATGGCTCTAATCCGTGTCCTGCTGCCACAATTCGAT
CGTCCTCTACCACTGGTGATTCTTCATCTCCTTCCTCTGTTACTCAATCAGTTCCTAATCCTCTTTATGAAGACTGGATTGCGAAGGATCATGCTCTCATGACACTGATC
AATGCTACACTATCGTCAGAAGCTCTTGCATACATTGTCGGCACCAAGTCTTCTCAACAGGTTTGGGAGACTCTTGAAAAACATTATTCTTCTAATTCTCGTGCCAATAT
TGTTAATTTAAAGACTGAGTTGCAATCTGTTACTAAAAATTCTGGGGAGACTATTGATGCTTATATCAAAAGGATTAAGGAAATCAAGGATCGATTGGCTAATGTTTCTG
TTGTTGTTAATGAGGAAGATTTGCTCATCTACATCTTGAATGGCTTACCTAGTGATTATAATACTTTTCGGACCTCTATGAGGACTCGCTCACAATCTATCTCTTTTGGA
GAATTACATGCTGTCGCAACGTCATCGCGACGCGGAGCGGCCGACATCCTTTCTTGGGTTTTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTTCTACTACAATGATGTCTTCAAACCCTTCCCCTGCTAAGGACGACACTCAGACTCCGATTTTCCTTCTCTCCAATATTTGCAACTTGATTTCTATTCGGCTTGA
TTCATCCAATTTTGTACTCTGGAAATTTCAATTCACTTCTCTCTTGAAGGCTCACAAACTATTTAGTTTTGTTGATGGCTCTAATCCGTGTCCTGCTGCCACAATTCGAT
CGTCCTCTACCACTGGTGATTCTTCATCTCCTTCCTCTGTTACTCAATCAGTTCCTAATCCTCTTTATGAAGACTGGATTGCGAAGGATCATGCTCTCATGACACTGATC
AATGCTACACTATCGTCAGAAGCTCTTGCATACATTGTCGGCACCAAGTCTTCTCAACAGGTTTGGGAGACTCTTGAAAAACATTATTCTTCTAATTCTCGTGCCAATAT
TGTTAATTTAAAGACTGAGTTGCAATCTGTTACTAAAAATTCTGGGGAGACTATTGATGCTTATATCAAAAGGATTAAGGAAATCAAGGATCGATTGGCTAATGTTTCTG
TTGTTGTTAATGAGGAAGATTTGCTCATCTACATCTTGAATGGCTTACCTAGTGATTATAATACTTTTCGGACCTCTATGAGGACTCGCTCACAATCTATCTCTTTTGGA
GAATTACATGCTGTCGCAACGTCATCGCGACGCGGAGCGGCCGACATCCTTTCTTGGGTTTTTTAG
Protein sequenceShow/hide protein sequence
MSSTTMMSSNPSPAKDDTQTPIFLLSNICNLISIRLDSSNFVLWKFQFTSLLKAHKLFSFVDGSNPCPAATIRSSSTTGDSSSPSSVTQSVPNPLYEDWIAKDHALMTLI
NATLSSEALAYIVGTKSSQQVWETLEKHYSSNSRANIVNLKTELQSVTKNSGETIDAYIKRIKEIKDRLANVSVVVNEEDLLIYILNGLPSDYNTFRTSMRTRSQSISFG
ELHAVATSSRRGAADILSWVF