; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0020035 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0020035
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr5:47542773..47547524
RNA-Seq ExpressionLag0020035
SyntenyLag0020035
Gene Ontology termsGO:0006310 - DNA recombination (biological process)
GO:0015074 - DNA integration (biological process)
GO:0071897 - DNA biosynthetic process (biological process)
GO:0090502 - RNA phosphodiester bond hydrolysis, endonucleolytic (biological process)
GO:0003723 - RNA binding (molecular function)
GO:0003887 - DNA-directed DNA polymerase activity (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]6.5e-15365.37Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T+  T E+RMAE+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK+I+Q+  PQ S S+ASLS+QQLQ+MI N I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGH
         T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YCKYHRVI H
Subjt:  ET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGH

Query:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        PVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]1.1e-0635.06Show/hide
Query:  KEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRMKRKFSILINTEGSLKVKPNL
        K+ NQ S  +STR SAFQRL+ S  K +S S T     S F RL V+  + ++K +  +      V  D+++ S+ PSRMKRK  + +NTEGSLKVK + 
Subjt:  KEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRMKRKFSILINTEGSLKVKPNL

Query:  IILTNPANEGSDQDHDKMRAFKCKSSLSQEPKLHD------APSPHELKSGGNT
        ++ T P +   ++  D+     C     +E   HD        +P  L+ GG +
Subjt:  IILTNPANEGSDQDHDKMRAFKCKSSLSQEPKLHD------APSPHELKSGGNT

KAA0047477.1 uncharacterized protein E6C27_scaffold498G00940 [Cucumis melo var. makuwa]2.7e-15164.46Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS   +VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+D+WE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.6e-17255.83Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR 
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQTKRKRSKKFSQPRK-----PDLRLRSHQASNYSSFSIPKNEYGRDK
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R  +     PR      P+  +    A + +S     N YG  K
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQTKRKRSKKFSQPRK-----PDLRLRSHQASNYSSFSIPKNEYGRDK

Query:  -------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDK
                                             EENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M +   K F E N D 
Subjt:  -------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDK

Query:  KLHSSIPSRMKRKFSILINTEGSLKVKPNLIILTNPANEGSDQ
        K+HS +PSRMKRK  + INTEGSL VKP  II TNP NEG +Q
Subjt:  KLHSSIPSRMKRKFSILINTEGSLKVKPNLIILTNPANEGSDQ

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.7e-15748.95Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------P
         HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N                                   + TI  ++K    KD               P
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------P

Query:  KKLQ-----------------TKRKRSKKFSQPRK---------------------PDLRLRSH-------QASNYSSFSIPKNEYGRDK----------
          +Q                  K +R+KK   P+                      P   L  H          + +S     N YG  K          
Subjt:  KKLQ-----------------TKRKRSKKFSQPRK---------------------PDLRLRSH-------QASNYSSFSIPKNEYGRDK----------

Query:  ---------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRM
                                   EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L  K F E N D K+HS +PSRM
Subjt:  ---------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRM

Query:  KRKFSILINTE
        KRK  + INT+
Subjt:  KRKFSILINTE

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]6.1e-15166.82Show/hide
Query:  MEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSASVASLSIQQLQDMITNCIRAQ------------
        +E  MAEM+  IN LMK ++E+D +I  LK Q++ +  AESSQT VVK  DKGK++VQ++ P Q S SVASLS+QQLQDMITN IRAQ            
Subjt:  MEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-QCSASVASLSIQQLQDMITNCIRAQ------------

Query:  -----------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFE
                    P+GYQPPKFQQFDGKGNPKQH+AHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPES++SWE+LE+EFLNRFYSTRRTVSM E
Subjt:  -----------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFE

Query:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------D
        LTNTKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIASR  +D L+P ++K+ +         
Subjt:  LTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRN-------D

Query:  EETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERC
        + T +ESMVVNTT L  S  KE R   + +G+    LTLKERQ+K+YPFPD+DI DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+C
Subjt:  EETIEESMVVNTT-LPKSSSKEKR---QTNGA--HHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERC

Query:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI
        FVLK+LIL+LA+E +IELDL+EVAQ+N A +
Subjt:  FVLKDLILKLAKEGKIELDLDEVAQSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7TZU9 Ribonuclease H3.2e-15365.37Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T+  T E+RMAE+++ +N LMKA+EE+D +I  LK+ IE++  AESS T  +KN +KGK+I+Q+  PQ S S+ASLS+QQLQ+MI N I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWYTDLEPES+DSWE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELT TKQRKGE V++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +RKE +  +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGH
         T        +E+MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+DP YCKYHRVI H
Subjt:  ET-------IEESMVVNTT----LPKSSSKEKRQTNG-AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGH

Query:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI
        PVE+CFVLK+LILKLA + KIEL+LD+VAQ+N A +
Subjt:  PVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI

A0A5A7TZU9 Ribonuclease H5.3e-0735.06Show/hide
Query:  KEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRMKRKFSILINTEGSLKVKPNL
        K+ NQ S  +STR SAFQRL+ S  K +S S T     S F RL V+  + ++K +  +      V  D+++ S+ PSRMKRK  + +NTEGSLKVK + 
Subjt:  KEENQCSMSTSTRPSAFQRLSVSTSK-KSRSST-----SVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRMKRKFSILINTEGSLKVKPNL

Query:  IILTNPANEGSDQDHDKMRAFKCKSSLSQEPKLHD------APSPHELKSGGNT
        ++ T P +   ++  D+     C     +E   HD        +P  L+ GG +
Subjt:  IILTNPANEGSDQDHDKMRAFKCKSSLSQEPKLHD------APSPHELKSGGNT

A0A5A7TZU9 Ribonuclease H1.3e-15164.46Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS   +VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+D+WE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV+NYINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K       H        TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+ EKVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

A0A5A7URH1 Ty3-gypsy retrotransposon protein8.0e-17355.83Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE+LER+FLNRFYSTR 
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQTKRKRSKKFSQPRK-----PDLRLRSHQASNYSSFSIPKNEYGRDK
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A I+  S   + KD   LQ +R  +     PR      P+  +    A + +S     N YG  K
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQTKRKRSKKFSQPRK-----PDLRLRSHQASNYSSFSIPKNEYGRDK

Query:  -------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDK
                                             EENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M +   K F E N D 
Subjt:  -------------------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDK

Query:  KLHSSIPSRMKRKFSILINTEGSLKVKPNLIILTNPANEGSDQ
        K+HS +PSRMKRK  + INTEGSL VKP  II TNP NEG +Q
Subjt:  KLHSSIPSRMKRKFSILINTEGSLKVKPNLIILTNPANEGSDQ

A0A5D3BX77 Retrotransposon gag protein2.8e-15748.95Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T   T E RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFD Y DLEPES+D+WE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  R +     
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------P
         HPVE+CFVLK+LILKLA+E KIEL++DEVAQ+N                                   + TI  ++K    KD               P
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN-----------------------------------LATIKGKSKHQRKKD---------------P

Query:  KKLQ-----------------TKRKRSKKFSQPRK---------------------PDLRLRSH-------QASNYSSFSIPKNEYGRDK----------
          +Q                  K +R+KK   P+                      P   L  H          + +S     N YG  K          
Subjt:  KKLQ-----------------TKRKRSKKFSQPRK---------------------PDLRLRSH-------QASNYSSFSIPKNEYGRDK----------

Query:  ---------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRM
                                   EENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L  K F E N D K+HS +PSRM
Subjt:  ---------------------------EENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRM

Query:  KRKFSILINTE
        KRK  + INT+
Subjt:  KRKFSILINTE

A0A5D3D4X3 Ty3-gypsy retrotransposon protein1.2e-14964.01Show/hide
Query:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------
        M+T+    E+RMAE+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ Q      
Subjt:  MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQ------

Query:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR
                          P GYQPPKFQQFDGKGNPKQH+AHF+ETCE AGTRGDLLVKQFVRTLKGNAFDWY DLEPES+D+WE+LER+FLNRFYSTRR
Subjt:  -----------------TPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRR

Query:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE
         VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  R +    +
Subjt:  TVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDE

Query:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI
        +T       I+ESMVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI
Subjt:  ET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVI

Query:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK
         HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N   I+
Subjt:  GHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGACTGAGACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAA
GAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGCTT
CGGTCGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAAACGCCAATTGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAG
GGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGATCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTT
TGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGC
TCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAG
ATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGC
TAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGT
CTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAA
CTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGT
GGAAAGATGTTTCGTCCTAAAGGACCTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAA
AGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAAACCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGAAAACCGGACCTTCGTCTTCGATCGCATCAA
GCCTCCAACTACTCATCCTTCAGTATTCCAAAGAATGAGTATGGCCGTGACAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCT
AAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGATGTTGAAACTTT
TCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTATTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAAT
CTCATTATCTTGACCAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAATGAGAGCTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGATGC
TCCTAGCCCACACGAGCTTAAAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCG
AAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAA
TTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCGTCCCCCAAGTTTGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTT
CTCGCGCGCTTCGATGCAGTTCCTTCCTCCAAGTTTGAAGGTTTTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTGCCACGCGCTTCGCTATGCAATTC
CTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCT
TCCCCCAAGTTCGAAGGTTCTCACACGCTTCATGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGTTGCATTTCCTTCCCCCCAAAGTCGAAGGTTCTCAC
ACGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTCGTGCAGTTCCTTCCT
CCAAATTCGAAGGTTCTCCCGCATTTCGTTGCATTTCCTTCCCCCTAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGC
GCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCAAGGGTCCTCATGCTATGCTCGGCTACATT
GCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGGATTCGTGCTGAAACGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCC
CAACTCAAGGAACATGTCCGTGCACTCGGGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGG
CGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTC
AAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCATGGCG
GCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGCCCTTGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGT
GCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGACGACACAAGTCCAAGGAACAT
GTCCCAACTCAAGGAACATGCGCGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTC
CAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCA
CTCGTGCTGAAAGGCGTGACGGCGACACAAGTCCAAGGACACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAG
GAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGT
GCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTTCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATG
TCCCAACTCAAGGAACATGTCCTTATACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGATGACTGAGACAAGAACTATGGAAGAAAGAATGGCTGAGATGCAAGAACACATCAACAACTTGATGAAGGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAA
GAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAAAGTCTATAGTGCAAGATGATCATCCACAATGTTCTGCTT
CGGTCGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAAACGCCAATTGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAG
GGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGATCTACTAGTCAAACAGTTCGTTCGAACACTTAAAGGAAATGCTTT
TGACTGGTACACTGATCTAGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACAGCACTAGACGAACCGTTAGCATGTTCGAGC
TCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGCTGGAGAGCTATGAGTCTAGATTGCAAAGATCGCCTCACTGAACTCTCTTCCGTTGAG
ATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGC
TAGTCGAGAAAACCAAGACCTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCATGGTTGTAAACACAACCCTTCCCAAGTCGT
CTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGACATCCCTGATATGTTGGAACAA
CTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGT
GGAAAGATGTTTCGTCCTAAAGGACCTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTTGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAA
AGAGCAAGCATCAAAGAAAGAAGGATCCTAAGAAACTTCAAACCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGAAAACCGGACCTTCGTCTTCGATCGCATCAA
GCCTCCAACTACTCATCCTTCAGTATTCCAAAGAATGAGTATGGCCGTGACAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCT
AAGTGTCTCCACATCAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGATGTTGAAACTTT
TCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCCGTCACGTATGAAGAGGAAGTTTTCTATTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAAT
CTCATTATCTTGACCAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAATGAGAGCTTTTAAATGTAAAAGCTCCTTGTCGCAAGAGCCTAAACTGCATGATGC
TCCTAGCCCACACGAGCTTAAAAGTGGGGGCAACACAGCGAATGGAAGTTGCTTCCTCCAAGTTCGAAGGTTCCCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAATTCG
AAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAA
TTCCTTCCTCCAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCGTCCCCCAAGTTTGAAGGTTCTCACGCGCTTCACTGCAGTTCCTTCCTCCAAGTTCAAAGGTT
CTCGCGCGCTTCGATGCAGTTCCTTCCTCCAAGTTTGAAGGTTTTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTCGAAGGTTGCCACGCGCTTCGCTATGCAATTC
CTTCCCCCAAGTTCGAAGGTTCTCACGCGCTTCGTGCAGTTCCTTCCTCCAAATTCGAAGTTCCTTCCTCTAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCT
TCCCCCAAGTTCGAAGGTTCTCACACGCTTCATGCAGTTCCTTCCTCCAAATTCGAAGGTTCTCACGCACTTCGTTGCATTTCCTTCCCCCCAAAGTCGAAGGTTCTCAC
ACGCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTCTGCAATTCCTTCCCCCAAGTTCGAAGGTTCTTACGCGCTTCGTGCAGTTCCTTCCT
CCAAATTCGAAGGTTCTCCCGCATTTCGTTGCATTTCCTTCCCCCTAAATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGC
GCTTCGCTGCAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCAAAGTTCAAAGGTTCAAGGGTCCTCATGCTATGCTCGGCTACATT
GCTGCGCTACTTCCTAAAGTCCAAAGACGTCAATTGTCCCTGGATTCGTGCTGAAACGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCC
CAACTCAAGGAACATGTCCGTGCACTCGGGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGG
CGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTC
AAGGAACATGTCCGTGCACTCGTGCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCATGGCG
GCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGCCCTTGCGTGGCGGCGACACAAGTCCAAGGAACATGTCTCAACTCAAGGAACATGTCCGTGCACTCGT
GCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGACGACACAAGTCCAAGGAACAT
GTCCCAACTCAAGGAACATGCGCGACGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTC
CAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCA
CTCGTGCTGAAAGGCGTGACGGCGACACAAGTCCAAGGACACATGTCCCAACTCAAGGAACACGTCCTTGCAACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAG
GAACATGTCCCAACTCAAGGAACATGACCGTGCACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTCCGTGCACTCGT
GCTGGAAGGCGCGGCGGCGGCACAAGTCCAAGGAACATGTCCCAACTCAAGGAACATGTTCGTGCACTCGTGCTGAAAGGCGTGGCGGCGGCACAAGTCCAAGGAACATG
TCCCAACTCAAGGAACATGTCCTTATACTCGTGCTGAAAGGCGTGGCGGCGACACAAGTCCAAGGAACATGTCCCAACTCAAGGAACACGTCCTTGCACTCGTGCTGA
Protein sequenceShow/hide protein sequence
MMTETRTMEERMAEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSASVASLSIQQLQDMITNCIRAQTPIGYQPPKFQQFDGK
GNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVE
MCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKERQKKIYPFPDADIPDMLEQ
LLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQTKRKRSKKFSQPRKPDLRLRSHQ
ASNYSSFSIPKNEYGRDKEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLMLKLFDEVNSDKKLHSSIPSRMKRKFSILINTEGSLKVKPN
LIILTNPANEGSDQDHDKMRAFKCKSSLSQEPKLHDAPSPHELKSGGNTANGSCFLQVRRFPRASLQFLPPKFEGSHALRCSSFLTVRRFSRASLQFLPPSSKVLTRFVQ
FLPPNSKVLTRFAAVPSPKFEGSHALHCSSFLQVQRFSRASMQFLPPSLKVFTRFVAVPSSKFEGCHALRYAIPSPKFEGSHALRAVPSSKFEVPSSKFEGSHALRSAIP
SPKFEGSHTLHAVPSSKFEGSHALRCISFPPKSKVLTRFAAVPSSKFEGSHALRSAIPSPKFEGSYALRAVPSSKFEGSPAFRCISFPLNSKVLTRFAAVPSPQVRRFSR
ASLQFLPPSSKVLTRFAAVPSSKFKGSRVLMLCSATLLRYFLKSKDVNCPWIRAETRGGDTSPRNMSQLKEHVPTQGTCPCTRAERRGGDTSPRNMSQLKEHVLATRAER
RGGDTSPRNMSQLKEHVLALVLKGVAATQVQGTCPNSRNMSVHSCWKARRRHKSKEHVPTQGTCPCTRAERHGGGTSPRNMSQLKEHALAWRRHKSKEHVSTQGTCPCTR
AERRGGDTSPRNMSQLKEHVLATRAERRGDDTSPRNMSQLKEHARRRHKSKEHVPTQGTCPCTRAERRGGGTSPRNMSQLKEHVLATRAERRGGDTSPRNMSQLKEHVRA
LVLKGVTATQVQGHMSQLKEHVLATRAERRGGDTSPRNMSQLKEHDRALVLKGVAATQVQGTCPNSRNMSVHSCWKARRRHKSKEHVPTQGTCSCTRAERRGGGTSPRNM
SQLKEHVLILVLKGVAATQVQGTCPNSRNTSLHSC