; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0015255 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0015255
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotransposon gag protein
Genome locationchr12:9246380..9250891
RNA-Seq ExpressionLag0015255
SyntenyLag0015255
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050736.1 retrotransposon gag protein [Cucumis melo var. makuwa]3.9e-3739.87Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI---------------------------------------------------------------------
        FVLK+LILKLA+E KI+LD+DEVAQ+N   +                                                                     
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI---------------------------------------------------------------------

Query:  -----------KG----KSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                   KG    K K +R K   K +P +++ +KF QPR+ +T+ E F ++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------KG----KSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +EENQC MST TR SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K++SR+PSR+KRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         S+ INTEGSL
Subjt:  FSVLINTEGSL

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.6e-3639.55Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------
        FVLK+LILKLA+E KIELD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------

Query:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                      I  K K +R K   K +P + + + F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +E+NQC  ST  R SAF+RLS+STSKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+PSRMKRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         SV INTEGSL
Subjt:  FSVLINTEGSL

KAA0065608.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.1e-3651.87Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFVTSYCIDVEEVDNPERGEQ
        FVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L       + F +      ++     T    + +N   SY    EEVDN    +Q
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFVTSYCIDVEEVDNPERGEQ

Query:  RTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRM
        RT VFDRIKP TT  SVFQR+SMA  +EE QC  ST TR S F+RLS+STSKK R ST  FDRLK+TNDQ +++M +L+ K F E N D K+HSR+PSR 
Subjt:  RTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

KAA0065984.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]9.5e-3648.9Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-----
        FVLK+LILKLA+E KI+LD+DE        IKGK                   + F QPR+ +T+ E   ++F +   E  +      T+  ++V     
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-----

Query:  --EEVDNPERGEQRTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVN
          EEVDN    +QRTSVFDRIKP TT   VFQR+SMA  +EENQC  ST TR SAF+RLS+STSKK R ST  FDRLK+ NDQ +R+M +L+ K F E N
Subjt:  --EEVDNPERGEQRTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVN

Query:  SDKKLHSRIPSRMKRKFSVLINTEGSL
         D K+HS IPS   RK SV IN EGSL
Subjt:  SDKKLHSRIPSRMKRKFSVLINTEGSL

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.6e-3639.55Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------
        FVLK+LILKLA+E KIELD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------

Query:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                      I  K K +R K   K +P + + + F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +E+NQC  ST  R SAF+RLS+STSKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+PSRMKRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         SV INTEGSL
Subjt:  FSVLINTEGSL

TrEMBL top hitse value%identityAlignment
A0A5A7U974 Retrotransposon gag protein1.9e-3739.87Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI---------------------------------------------------------------------
        FVLK+LILKLA+E KI+LD+DEVAQ+N   +                                                                     
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI---------------------------------------------------------------------

Query:  -----------KG----KSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                   KG    K K +R K   K +P +++ +KF QPR+ +T+ E F ++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------KG----KSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +EENQC MST TR SAF+RLS+S SKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K++SR+PSR+KRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         S+ INTEGSL
Subjt:  FSVLINTEGSL

A0A5A7UI09 Retrotransposon gag protein2.7e-3639.55Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------
        FVLK+LILKLA+E KIELD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------

Query:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                      I  K K +R K   K +P + + + F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +E+NQC  ST  R SAF+RLS+STSKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+PSRMKRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         SV INTEGSL
Subjt:  FSVLINTEGSL

A0A5A7VFA5 Ty3-gypsy retrotransposon protein4.6e-3648.9Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-----
        FVLK+LILKLA+E KI+LD+DE        IKGK                   + F QPR+ +T+ E   ++F +   E  +      T+  ++V     
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-----

Query:  --EEVDNPERGEQRTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVN
          EEVDN    +QRTSVFDRIKP TT   VFQR+SMA  +EENQC  ST TR SAF+RLS+STSKK R ST  FDRLK+ NDQ +R+M +L+ K F E N
Subjt:  --EEVDNPERGEQRTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVN

Query:  SDKKLHSRIPSRMKRKFSVLINTEGSL
         D K+HS IPS   RK SV IN EGSL
Subjt:  SDKKLHSRIPSRMKRKFSVLINTEGSL

A0A5D3CA53 Retrotransposon gag protein5.4e-3751.87Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFVTSYCIDVEEVDNPERGEQ
        FVLK+LILKLA + KIELD+DEVAQ+N A +   S       P  L       + F +      ++     T    + +N   SY    EEVDN    +Q
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFVTSYCIDVEEVDNPERGEQ

Query:  RTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRM
        RT VFDRIKP TT  SVFQR+SMA  +EE QC  ST TR S F+RLS+STSKK R ST  FDRLK+TNDQ +++M +L+ K F E N D K+HSR+PSR 
Subjt:  RTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRM

Query:  KRKFSVLINTEGSL
        KRK SV INTEGSL
Subjt:  KRKFSVLINTEGSL

A0A5D3CCI8 Retrotransposon gag protein2.7e-3639.55Show/hide
Query:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------
        FVLK+LILKLA+E KIELD+DEVAQ+N                                                                         
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSN-------------------------------------------------------------------------

Query:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS
                      I  K K +R K   K +P + + + F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTS
Subjt:  -----------LATIKGKSKYQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTS

Query:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK
        VFDRIKP TT  SVFQR+SMA  +E+NQC  ST  R SAF+RLS+STSKK R ST  FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+PSRMKRK
Subjt:  VFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRK

Query:  FSVLINTEGSL
         SV INTEGSL
Subjt:  FSVLINTEGSL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGGGAGGAAGTCCCTCCCTCCCTCCCCGTTACATAGAAGGACCTCCTCCCTCAATATATTTTCCTCCCTTATTCGTTTTTGGCGTCGAAACCACCACCATTGACTT
CCGGAACCAAAACGTCGTCGCCGCAAGAAGCCAAGATCAGGCGTTCGTGAAACGCTGCAAGAAGGAACCACCGTTCGTGAAACGTCGTTGCTGCCGTTTCGTCCTAAAGG
ACTTAATTTTAAAGCTGGCTAAGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATTAAAGGAAAGAGCAAGTATCAAAGAAAGAAG
GATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAAAA
AGAGAACTTTGTGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATCCTGAGAGGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCTTC
CTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCA
AAGAAAAGTCGATCTTCAACACCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAG
TGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCATTCTCTCCAAGTTCGAGGGTCCTT
ACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTAC
GTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTTCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTTTCC
AATGTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACCCAAGTTCGAAGGTTCACACGCGCGTCGCCACAGTTCCTTCCTCCAA
GTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATC
GCCACGAAGGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTTCGTTGTTGCATTGTTCCCT
CTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTTTCCAATGTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCCCTTCCTCCAAGTTCGAAGGT
TCTCACCCAAGTTCGAAGGTTCACACGCGCGTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATCGCCACGAAGGTTC
CTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAATTCGAAG
GTTCTCACGCGCATTCCAAATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCA
AATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCCTCCAAGTTCAAAGGAAGGTTCTGCATCTCAAAGGTTCACACTCCTTCCTCCAAGTTTGAAAGACCCAA
GACCAATAAAGCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCGGGGGTTCAGAAATTC
TACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAGACTCTCTGCAATCCACA
AGTCCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAATACTCTTCAGGACATCAACACTTCTTGAAGACCAAACGCTC
TTCAAGACCTCAACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGAAGACCAAACACTCTTCAAGATCTCAACACTCCTTGAAGATCAAAAACTCTT
CAGGACATCAACATTTCTTGAAGACCAAACACTCTTCAAAACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGACTGAAGACTCCTTG
AAGACTAGAAGGCTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGGGAGGAAGTCCCTCCCTCCCTCCCCGTTACATAGAAGGACCTCCTCCCTCAATATATTTTCCTCCCTTATTCGTTTTTGGCGTCGAAACCACCACCATTGACTT
CCGGAACCAAAACGTCGTCGCCGCAAGAAGCCAAGATCAGGCGTTCGTGAAACGCTGCAAGAAGGAACCACCGTTCGTGAAACGTCGTTGCTGCCGTTTCGTCCTAAAGG
ACTTAATTTTAAAGCTGGCTAAGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATTAAAGGAAAGAGCAAGTATCAAAGAAAGAAG
GATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAAAA
AGAGAACTTTGTGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATCCTGAGAGGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCTTC
CTTCAGTATTCCAAAGAATGAGTATGGCCGCGACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCA
AAGAAAAGTCGATCTTCAACACCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAG
TGACAAGAAGCTTCATAGTAGAATCCCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGTTCATTCTCTCCAAGTTCGAGGGTCCTT
ACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCGTTGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTAC
GTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTTCGTTGTTGCATTGTTCCCTCTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTTTCC
AATGTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCCCTTCCTCCAAGTTCGAAGGTTCTCACCCAAGTTCGAAGGTTCACACGCGCGTCGCCACAGTTCCTTCCTCCAA
GTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATC
GCCACGAAGGTTCCTTCTCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTCCTCCAAGTGCGAAGGATCTTATGTGGTTCGTTGTTGCATTGTTCCCT
CTTCTCTCAAGTTCGATGGTTCTCACGCAGCTTTGCTGGAGTTTCTTCTTTCCAATGTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCCCTTCCTCCAAGTTCGAAGGT
TCTCACCCAAGTTCGAAGGTTCACACGCGCGTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTC
ACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCATCGCCACGAAGGTTC
CTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAATTCGAAG
GTTCTCACGCGCATTCCAAATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCAAATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCTTCCA
AATTCGAAGGTTCTCACGCGCATCGCCACGAAGGTTCCTTCCTCCAAGTTCAAAGGAAGGTTCTGCATCTCAAAGGTTCACACTCCTTCCTCCAAGTTTGAAAGACCCAA
GACCAATAAAGCCCAAGCCCAAGTTGTTAGGCCCAAAAGTCACCAGGGCCCATCCAGCGAGAACTCTATAAATAGAGGGGTTCTCCTCCATTTCGGGGGTTCAGAAATTC
TACACTCTCACAAAGACAAGAGTTCAGAGTTTTCAAAGCTCTCAAGCAGAACCAGAGAATTCAGAGAGACTCCACCAAGTCTGAAGACCGAAGACTCTCTGCAATCCACA
AGTCCAAGTGTTGAACACTTCTTGAAGACCAAACACTCTTCAAGACTTCAACACTCCTTGAAGATCAAATACTCTTCAGGACATCAACACTTCTTGAAGACCAAACGCTC
TTCAAGACCTCAACACTCCTTGAAGATCAAAAACTCTTCAGGACATCAACATTTCTTGAAGACCAAACACTCTTCAAGATCTCAACACTCCTTGAAGATCAAAAACTCTT
CAGGACATCAACATTTCTTGAAGACCAAACACTCTTCAAAACTTCAACACTCCTTGAAGATCAAAGACTCTTCAGGACATCAACACTTCTTGAAGACTGAAGACTCCTTG
AAGACTAGAAGGCTTTAA
Protein sequenceShow/hide protein sequence
MEGGSPSLPPRYIEGPPPSIYFPPLFVFGVETTTIDFRNQNVVAARSQDQAFVKRCKKEPPFVKRRCCRFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKYQRKK
DPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFVTSYCIDVEEVDNPERGEQRTSVFDRIKPPTTLPSVFQRMSMAATKEENQCSMSTSTRPSAFQRLSVSTS
KKSRSSTPVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRIPSRMKRKFSVLINTEGSLKFILSKFEGPYTVRYCVVPSPSSKVLRCILLRCSFSKFEGSQLYNCY
VVPPPSAKDLMWFVVALFPLLSSSMVLTQLCWSFFFPMSKVLTRFAAVPLPPSSKVLTQVRRFTRASPQFLPPSSKVHTRIATVPSSKFEGSHAHRHEGSFFQVRRFSRI
ATKVPSPSSRVLSCTTATLFLLQVRRILCGSLLHCSLFSQVRWFSRSFAGVSSFQCRRFSRASLQFPFLQVRRFSPKFEGSHARRHSSFLQVRRFTRASPQFLPPSSKVL
TRIATKVPSSKFEGSHASPRRFLLPSSKVLTHRHEGSFFQVRRFSRASPRRFLLPSSKVLTRIATKVPSSKFEGSHAHSKFEGSHAHRHEGSFFQIRRFSRASPRRFLLP
NSKVLTRIATKVPSSKFKGRFCISKVHTPSSKFERPKTNKAQAQVVRPKSHQGPSSENSINRGVLLHFGGSEILHSHKDKSSEFSKLSSRTREFRETPPSLKTEDSLQST
SPSVEHFLKTKHSSRLQHSLKIKYSSGHQHFLKTKRSSRPQHSLKIKNSSGHQHFLKTKHSSRSQHSLKIKNSSGHQHFLKTKHSSKLQHSLKIKDSSGHQHFLKTEDSL
KTRRL