; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040747 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040747
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy1-copia retrotransposon protein
Genome locationchr13:7880139..7883659
RNA-Seq ExpressionLag0040747
SyntenyLag0040747
Gene Ontology termsGO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0052404.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]1.9e-7546.49Show/hide
Query:  KSREPDGAQANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPS
        +  E    QAN+VE      A +V ANL++NK+DWILDT AS  FC+NR L H+++DT DG+C++M NSAT GV+ K KV+LKLTSGK LSL+++LYV S
Subjt:  KSREPDGAQANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPS

Query:  LRRNLVSGSLLNRAWLRVVFEADKVVITKNSDFV------------------------------------------------------------------
        L RNLVS SLLNRA L++V E DKVV+TKN +FV                                                                  
Subjt:  LRRNLVSGSLLNRAWLRVVFEADKVVITKNSDFV------------------------------------------------------------------

Query:  -----------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQ
                                                 GGV SWKSAKQTCIARST+ESEFIAL+LA QEAEW+++LL DVPLWG SVPVS+ CDSQ
Subjt:  -----------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQ

Query:  AAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF
        AAI IAK++VYNGK RHI LRH VVKQLLK G ISLE++R  +NL + LTKGLTR+V+L+SS     + F
Subjt:  AAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF

KAF3628742.1 hypothetical protein FXO38_28084 [Capsicum annuum]6.3e-4747.16Show/hide
Query:  NVVEAVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWL
        N++ A+VV+ NLV NK++W+LDTG   +FC N+ LFH+F+++ + +CIYMGN  TA VL                                         
Subjt:  NVVEAVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWL

Query:  RVVFEADKVVITKNSDFVGGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRH
                           G  SWKS+KQTCIA ST++SEFI ++L GQEAEWLR+LL DVPLWG  V PVSLHCDSQA I IAK+SVYN K+RHI +R+
Subjt:  RVVFEADKVVITKNSDFVGGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRH

Query:  GVVKQLLKSGIISLEYMRSGRNLTDSLTK
         +VKQLLK G+ISLEY+RS RNL D + K
Subjt:  GVVKQLLKSGIISLEYMRSGRNLTDSLTK

PHU13801.1 hypothetical protein BC332_15006 [Capsicum chinense]1.0e-5246.97Show/hide
Query:  QANVVE---------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNL
        QAN++E         A+  E NL  N  +W +D GA+ + C+N+ LF  F   +  K IYM NSA A V    KV LK+TSGK+L+LN++LYVP LRRNL
Subjt:  QANVVE---------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNL

Query:  VSGSLLNRAWLRVVFEADKVVI----------------TKNSDFV----GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-
        +S SLL++   + V  ++K++I                T  S +V    GG  SWKS+KQTCIA+ST+ESEFIAL+ AG+EAEWL++ L+D+  W   V 
Subjt:  VSGSLLNRAWLRVVFEADKVVI----------------TKNSDFV----GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-

Query:  PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTR
        PV +HCDSQAAI  A S +YNGK  HI  RH  V++LL S II++++++S  N+ D LTKGL+R
Subjt:  PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTR

TYK05191.1 ty1-copia retrotransposon protein [Cucumis melo var. makuwa]5.3e-6244.57Show/hide
Query:  QANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSG
        QAN+ E      A +VEANL++NK+DWILDT AS +FC+NR L H+++DT D +C++MGNSATAGV+GK KV+LKLTSGK LSL+++LYVPSLRRNLVSG
Subjt:  QANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSG

Query:  SLLNRAWLRVVFEADKVVITKNSDFVG------GVTSWKSAKQTCIARST---------------------------------------------IESEF
        SLLNRA L++V E DKVV+TKN DFVG      G+    +      A S+                                             +ES+F
Subjt:  SLLNRAWLRVVFEADKVVITKNSDFVG------GVTSWKSAKQTCIARST---------------------------------------------IESEF

Query:  -------------IALELAGQEAEWLRS-------------------------------LLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHL
                       LEL   +    R+                                ++DVPLWG SVPVS+ CDSQAAI  AK+SVYNGK RHI L
Subjt:  -------------IALELAGQEAEWLRS-------------------------------LLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHL

Query:  RHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF
        RH VVKQLLK G ISLE++RS +NL D LTKGLTR+++L+SS     + F
Subjt:  RHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF

XP_009784430.1 PREDICTED: uncharacterized protein LOC104232850, partial [Nicotiana sylvestris]1.5e-4839.6Show/hide
Query:  LVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVI
        +V NK++W+LD GAS + C+N+ LFH+F+++ DG+C+YM NS T  V+GK K+LLKLTS K L+LN++LYVPSLRRNLVS +LLN+A L+++FE DKVVI
Subjt:  LVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVI

Query:  TKNSDFVG--------------------------------------------------------------------------------------------
        ++  DFVG                                                                                            
Subjt:  TKNSDFVG--------------------------------------------------------------------------------------------

Query:  ----------------------GVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIH
                              G  SWKS+KQTCIA ST++ EFIALEL GQEAEWLR+ L DVPLW     PVSL+CDSQ A GIA++S+YN KRR+I 
Subjt:  ----------------------GVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIH

Query:  LRH
        +RH
Subjt:  LRH

TrEMBL top hitse value%identityAlignment
A0A1U7XD95 uncharacterized protein LOC1042328507.2e-4939.6Show/hide
Query:  LVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVI
        +V NK++W+LD GAS + C+N+ LFH+F+++ DG+C+YM NS T  V+GK K+LLKLTS K L+LN++LYVPSLRRNLVS +LLN+A L+++FE DKVVI
Subjt:  LVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVI

Query:  TKNSDFVG--------------------------------------------------------------------------------------------
        ++  DFVG                                                                                            
Subjt:  TKNSDFVG--------------------------------------------------------------------------------------------

Query:  ----------------------GVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIH
                              G  SWKS+KQTCIA ST++ EFIALEL GQEAEWLR+ L DVPLW     PVSL+CDSQ A GIA++S+YN KRR+I 
Subjt:  ----------------------GVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-PVSLHCDSQAAIGIAKSSVYNGKRRHIH

Query:  LRH
        +RH
Subjt:  LRH

A0A2G3C502 Uncharacterized protein4.8e-5346.97Show/hide
Query:  QANVVE---------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNL
        QAN++E         A+  E NL  N  +W +D GA+ + C+N+ LF  F   +  K IYM NSA A V    KV LK+TSGK+L+LN++LYVP LRRNL
Subjt:  QANVVE---------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNL

Query:  VSGSLLNRAWLRVVFEADKVVI----------------TKNSDFV----GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-
        +S SLL++   + V  ++K++I                T  S +V    GG  SWKS+KQTCIA+ST+ESEFIAL+ AG+EAEWL++ L+D+  W   V 
Subjt:  VSGSLLNRAWLRVVFEADKVVI----------------TKNSDFV----GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASV-

Query:  PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTR
        PV +HCDSQAAI  A S +YNGK  HI  RH  V++LL S II++++++S  N+ D LTKGL+R
Subjt:  PVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTR

A0A2N9FSN7 Uncharacterized protein6.8e-4734.58Show/hide
Query:  QANVVEAVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRA
        ++ V+ AVV + NLV N ++W++DTGA+ + CS++ LF  +++  DG+ +Y+G++ T  V GK KV LKLTSGK L+L+ +L+VP +RRNLVSGSLLN+A
Subjt:  QANVVEAVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRA

Query:  WLRVVFEADKVVITKNSDFV--------------------------------------------------------------------------------
         +++VF+ADK+V+T+N DFV                                                                                
Subjt:  WLRVVFEADKVVITKNSDFV--------------------------------------------------------------------------------

Query:  --------------------------------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRS
                                                                      GG  SWKS+KQTC ARST+ESEF+ALE+AG EAEWLR+
Subjt:  --------------------------------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRS

Query:  LLQDVPLW-GASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVV
        LL D+PLW   +  +SLHCDSQAAIG AK+ +YNGK+RH+ LRH +V
Subjt:  LLQDVPLW-GASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVV

A0A5A7UAK1 Ty1-copia retrotransposon protein9.1e-7646.49Show/hide
Query:  KSREPDGAQANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPS
        +  E    QAN+VE      A +V ANL++NK+DWILDT AS  FC+NR L H+++DT DG+C++M NSAT GV+ K KV+LKLTSGK LSL+++LYV S
Subjt:  KSREPDGAQANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPS

Query:  LRRNLVSGSLLNRAWLRVVFEADKVVITKNSDFV------------------------------------------------------------------
        L RNLVS SLLNRA L++V E DKVV+TKN +FV                                                                  
Subjt:  LRRNLVSGSLLNRAWLRVVFEADKVVITKNSDFV------------------------------------------------------------------

Query:  -----------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQ
                                                 GGV SWKSAKQTCIARST+ESEFIAL+LA QEAEW+++LL DVPLWG SVPVS+ CDSQ
Subjt:  -----------------------------------------GGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQ

Query:  AAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF
        AAI IAK++VYNGK RHI LRH VVKQLLK G ISLE++R  +NL + LTKGLTR+V+L+SS     + F
Subjt:  AAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF

A0A5D3BZU0 Ty1-copia retrotransposon protein2.6e-6244.57Show/hide
Query:  QANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSG
        QAN+ E      A +VEANL++NK+DWILDT AS +FC+NR L H+++DT D +C++MGNSATAGV+GK KV+LKLTSGK LSL+++LYVPSLRRNLVSG
Subjt:  QANVVE------AVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSG

Query:  SLLNRAWLRVVFEADKVVITKNSDFVG------GVTSWKSAKQTCIARST---------------------------------------------IESEF
        SLLNRA L++V E DKVV+TKN DFVG      G+    +      A S+                                             +ES+F
Subjt:  SLLNRAWLRVVFEADKVVITKNSDFVG------GVTSWKSAKQTCIARST---------------------------------------------IESEF

Query:  -------------IALELAGQEAEWLRS-------------------------------LLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHL
                       LEL   +    R+                                ++DVPLWG SVPVS+ CDSQAAI  AK+SVYNGK RHI L
Subjt:  -------------IALELAGQEAEWLRS-------------------------------LLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHL

Query:  RHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF
        RH VVKQLLK G ISLE++RS +NL D LTKGLTR+++L+SS     + F
Subjt:  RHGVVKQLLKSGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWF

SwissProt top hitse value%identityAlignment
P04146 Copia protein1.6e-0829.91Show/hide
Query:  WKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLT
        W + +Q  +A S+ E+E++AL  A +EA WL+ LL  + +     P+ ++ D+Q  I IA +   + + +HI +++   ++ +++ +I LEY+ +   L 
Subjt:  WKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYMRSGRNLT

Query:  DSLTKGL
        D  TK L
Subjt:  DSLTKGL

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-947.3e-1435.34Show/hide
Query:  FVGGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYM
        F GG  SW+S  Q C+A ST E+E+IA    G+E  WL+  LQ++ L      V  +CDSQ+AI ++K+S+Y+ + +HI +R+  +++++    + +  +
Subjt:  FVGGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLKSGIISLEYM

Query:  RSGRNLTDSLTKGLTR
         +  N  D LTK + R
Subjt:  RSGRNLTDSLTKGLTR

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-942.5e-0629.63Show/hide
Query:  KSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVITKNS
        +S+W++DT AS +    R LF  +    D   + MGN++ + + G   + +K   G  L L D+ +VP LR NL+SG  L+R      F   K  +TK S
Subjt:  KSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNRAWLRVVFEADKVVITKNS

Query:  DFVG-GV---TSWKSAKQTCIARSTIESEFIALEL
          +  GV   T +++  + C        + I+++L
Subjt:  DFVG-GV---TSWKSAKQTCIARSTIESEFIALEL

Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACCATCAAATCCAGGGAGCCTGATGGTGCACAAGCCAACGTTGTTGAAGCAGTGGTGGTGGAAGCCAACTTGGTGAAGAATAAATCAGACTGGATTCTTGACACCGG
TGCCTCAGGAAACTTCTGCTCAAATCGAAGTTTGTTCCACGAGTTCCAAGACACTATCGATGGCAAATGCATATACATGGGAAACTCAGCCACTGCTGGAGTACTTGGGA
AATGGAAGGTTCTCTTGAAACTTACTTCTGGAAAAATTTTATCTTTAAATGATATGTTATATGTTCCTTCATTGCGTAGGAATCTGGTGTCTGGAAGTCTGTTGAACAGA
GCATGGCTTAGAGTTGTGTTTGAAGCTGACAAGGTGGTCATCACTAAAAATAGTGACTTTGTCGGAGGAGTTACATCATGGAAATCTGCAAAGCAGACGTGCATTGCGCG
ATCCACCATAGAGTCAGAGTTTATAGCTCTTGAGCTTGCAGGGCAAGAGGCGGAGTGGTTGAGAAGTTTACTTCAAGACGTACCACTGTGGGGGGCGTCTGTTCCAGTCT
CCTTGCACTGTGATTCACAAGCAGCCATAGGAATTGCCAAAAGTAGTGTATATAATGGCAAGAGAAGGCATATACACCTGAGACATGGAGTTGTGAAACAGTTGTTGAAA
AGTGGAATTATCTCCTTGGAGTATATGAGGTCTGGGAGGAATTTGACAGATTCTCTCACCAAAGGCCTAACAAGGAGAGTAATTCTAGAGTCCTCAACAACTCAAGATTT
TGAGTGGTTTACTGCGGTAAACTTGATGAAGCATGACGTGATAGTGAAGGTGTGGCCGCCTTCTATGAAAGAGTGTAATGGATCTCTTTCTAGAGCTTTCACGATAACCC
AAGCGTCTACGTCCACGTGTCAATCCGGCGTCCACACACTCCAACAGTTGACCAGTTCGTTTATTGACCGAGTTTGGTGCCTTTTTGCAAAGGCAGAGGAGTTGTTATGC
GTGGACTCCTCCATGGCTGACACGTGGCAGTTGGACCCTGTTTTGTGGAAGACGGGGGAGAAACGACCGAGCATGCAAGGGATGGTTGTCGACGCGATGGATGAAGGTCG
TTGGCACGACGGAGATGGTTCGTCGGCGTGGTATGAGAGACTCCGTTTACGTCATTTCCGAGGACAGAGATGTGTTTTGGATGGTGAACGAATTTCAAAACGGTACATAC
AGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGATCGTCGCGTAGGGGGAGAAACGACCGAGCATGCAAG
AGATGGTTGTCGACGCGGCGGCTGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGTAGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGACCATCAAATCCAGGGAGCCTGATGGTGCACAAGCCAACGTTGTTGAAGCAGTGGTGGTGGAAGCCAACTTGGTGAAGAATAAATCAGACTGGATTCTTGACACCGG
TGCCTCAGGAAACTTCTGCTCAAATCGAAGTTTGTTCCACGAGTTCCAAGACACTATCGATGGCAAATGCATATACATGGGAAACTCAGCCACTGCTGGAGTACTTGGGA
AATGGAAGGTTCTCTTGAAACTTACTTCTGGAAAAATTTTATCTTTAAATGATATGTTATATGTTCCTTCATTGCGTAGGAATCTGGTGTCTGGAAGTCTGTTGAACAGA
GCATGGCTTAGAGTTGTGTTTGAAGCTGACAAGGTGGTCATCACTAAAAATAGTGACTTTGTCGGAGGAGTTACATCATGGAAATCTGCAAAGCAGACGTGCATTGCGCG
ATCCACCATAGAGTCAGAGTTTATAGCTCTTGAGCTTGCAGGGCAAGAGGCGGAGTGGTTGAGAAGTTTACTTCAAGACGTACCACTGTGGGGGGCGTCTGTTCCAGTCT
CCTTGCACTGTGATTCACAAGCAGCCATAGGAATTGCCAAAAGTAGTGTATATAATGGCAAGAGAAGGCATATACACCTGAGACATGGAGTTGTGAAACAGTTGTTGAAA
AGTGGAATTATCTCCTTGGAGTATATGAGGTCTGGGAGGAATTTGACAGATTCTCTCACCAAAGGCCTAACAAGGAGAGTAATTCTAGAGTCCTCAACAACTCAAGATTT
TGAGTGGTTTACTGCGGTAAACTTGATGAAGCATGACGTGATAGTGAAGGTGTGGCCGCCTTCTATGAAAGAGTGTAATGGATCTCTTTCTAGAGCTTTCACGATAACCC
AAGCGTCTACGTCCACGTGTCAATCCGGCGTCCACACACTCCAACAGTTGACCAGTTCGTTTATTGACCGAGTTTGGTGCCTTTTTGCAAAGGCAGAGGAGTTGTTATGC
GTGGACTCCTCCATGGCTGACACGTGGCAGTTGGACCCTGTTTTGTGGAAGACGGGGGAGAAACGACCGAGCATGCAAGGGATGGTTGTCGACGCGATGGATGAAGGTCG
TTGGCACGACGGAGATGGTTCGTCGGCGTGGTATGAGAGACTCCGTTTACGTCATTTCCGAGGACAGAGATGTGTTTTGGATGGTGAACGAATTTCAAAACGGTACATAC
AGGGGTGCTGTGTAGTCGTTGACATCCCGAGTGTCAATACTGGTATACACCCCGTGGGGACAGTTTCAAAGGGATCGTCGCGTAGGGGGAGAAACGACCGAGCATGCAAG
AGATGGTTGTCGACGCGGCGGCTGAAGGTCGTTGACACGGCGGAGATGGTTCGTCGGCGTGGTGTGAGAGGTAGGTAG
Protein sequenceShow/hide protein sequence
MTIKSREPDGAQANVVEAVVVEANLVKNKSDWILDTGASGNFCSNRSLFHEFQDTIDGKCIYMGNSATAGVLGKWKVLLKLTSGKILSLNDMLYVPSLRRNLVSGSLLNR
AWLRVVFEADKVVITKNSDFVGGVTSWKSAKQTCIARSTIESEFIALELAGQEAEWLRSLLQDVPLWGASVPVSLHCDSQAAIGIAKSSVYNGKRRHIHLRHGVVKQLLK
SGIISLEYMRSGRNLTDSLTKGLTRRVILESSTTQDFEWFTAVNLMKHDVIVKVWPPSMKECNGSLSRAFTITQASTSTCQSGVHTLQQLTSSFIDRVWCLFAKAEELLC
VDSSMADTWQLDPVLWKTGEKRPSMQGMVVDAMDEGRWHDGDGSSAWYERLRLRHFRGQRCVLDGERISKRYIQGCCVVVDIPSVNTGIHPVGTVSKGSSRRGRNDRACK
RWLSTRRLKVVDTAEMVRRRGVRGR