; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035683 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035683
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr3:27056070..27059577
RNA-Seq ExpressionLag0035683
SyntenyLag0035683
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]7.8e-16954.35Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q

Query:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL----
         SVSVASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+    
Subjt:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL----

Query:  -NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPN
         N F  TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT AHDM+LSIA+R  +DFL+  
Subjt:  -NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPN

Query:  MRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDP
         R +     +T       + ESM+V  T  KS SK K   +  +H        TL+E QKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP
Subjt:  MRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDP

Query:  KYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ--------
         YCKYHRVI H VE+CFVLK LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++        
Subjt:  KYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ--------

Query:  --LVMLNKSFSKTFHKKKKENLA--TSYSINV-------EEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVST
          +  L +SF +   ++  E  A  T+  + V       EE+DNS + +QRTSVFD IKPLTTR SVF R+SM   +EENQC   T  + SAF+RLS+S 
Subjt:  --LVMLNKSFSKTFHKKKKENLA--TSYSINV-------EEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVST

Query:  SKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS
        SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS + SRMKRK SV INTEGSL VK   II TNP NEG ++  D++KS
Subjt:  SKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS

KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]2.1e-16148.73Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK++VQ++ P Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q

Query:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFY
         SVSVASLS+QQLQDMI N IRAQYGGP Q S +YSKSYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+    
Subjt:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFY

Query:  STRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEG
              + FE     + +GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+T AHDMELSIA+   +DFL+   ++  
Subjt:  STRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEG

Query:  RND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK
        +N+           + ESM+V  T  KS SK K   +  ++        TL+E QKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCK
Subjt:  RND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK

Query:  YHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-------------------------------------------------------------
        YHRVI HPVE+CFVLK LILKLA E KIELD+DEVAQ+N                                                             
Subjt:  YHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-------------------------------------------------------------

Query:  -----------------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINV----------EE
                                  I  K K +R K   K +P + + K F QP++ + L +   ++F   H +K   +   ++ ++          EE
Subjt:  -----------------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINV----------EE

Query:  IDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK
        +DNS + +QRT VF RIKPLT R SVF R+SM   EEENQC  ST  R SAF+RLS+ST KK R STS FDRLK+ NDQ +R+M +L+ K F E N D K
Subjt:  IDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK

Query:  LHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS
        +HS + SRMKRK SV INTEGSL VK   II TNP NEG ++  D++KS
Subjt:  LHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS

KAA0045440.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.4e-15448.71Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE+   AESS    VKN DKGK+++Q+  P+   S+ASLS+QQLQ+MIT+ I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELV
         QYGG  Q   LYSK YTKRIDNLR P GYQPPKFQQ+  K      +    +  +   PES+D+WE+LER+FLNRFYSTRR +SM ELTNT+Q+KGELV
Subjt:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELV

Query:  VNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNT
        ++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELAT AHDMEL++A++  +DFL+P +R +    ++T       I+ESMVV+ 
Subjt:  VNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNT

Query:  TLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLA
        T  KS SK K     R+ +G      TLKE Q+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  KVDDP YCKYHRVI HP+E+CFVLK LILKLA
Subjt:  TLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLA

Query:  MERKIELDLDEVAQSN---------------------------------------LATIKEKSK---------------HQRKKDPKKLQ----------
         E+KIELD+DEVAQ+N                                       + TI  ++K               HQ+ + P  +Q          
Subjt:  MERKIELDLDEVAQSN---------------------------------------LATIKEKSK---------------HQRKKDPKKLQ----------

Query:  -------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL--------ATSYSIN-----VEEIDNSKKSEQRTSVFDRIKPLTTR
                K +R+KK             F Q ++ + L +   ++F +   E +        A+   +N      +E++NS +  QRTSVFDRIKP TTR
Subjt:  -------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL--------ATSYSIN-----VEEIDNSKKSEQRTSVFDRIKPLTTR

Query:  PSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEG
         SVF R+SM   EE+NQC     TR S F+RLS+STSKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS I SRMKRK  V INTEG
Subjt:  PSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEG

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.1e-17853.29Show/hide
Query:  KKINKSADRSSKKINKSVDRSSNRSSKK--INKGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQI
        ++I  S D    ++ K++    ++  K   + K  +  D+      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ I
Subjt:  KKINKSADRSSKKINKSVDRSSNRSSKK--INKGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQI

Query:  ENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHI
        E++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QYGGP Q   LY K YTKRIDNLR P GYQPPKFQQFDGKGNPKQH+
Subjt:  ENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHI

Query:  AHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELS
        AHF++TCE A                          EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS
Subjt:  AHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--
        +VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R  +DFL+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G  
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--

Query:  AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKE
            TLKE Q+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK LILKLA E+KIELD+DEVAQ+N A I+ 
Subjt:  AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKE

Query:  KSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYSINVEEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVA
         S   + KD   LQ +R         RS     P++++ +        H      +  +Y  + +E++NS +  QRTSVFDRIKP TTR SVF R+S+  
Subjt:  KSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYSINVEEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVA

Query:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTN
         EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS + SRMKRK  V INTEGSL VK   II TN
Subjt:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTN

Query:  PANEGSDQDHDKDKS
        P NEG +Q   ++KS
Subjt:  PANEGSDQDHDKDKS

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.7e-16850.49Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL
         QYGGP Q   LYSK YTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSI +R  +DFL+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKE Q+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------
        YCKYHRVI HPVE+CFVLK LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD        
Subjt:  YCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------

Query:  -------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINVEEIDNSKKS----
               P  +Q                  K +R+KK             F Q ++ + L +   ++F   H ++   + T ++ ++ E++N+  S    
Subjt:  -------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINVEEIDNSKKS----

Query:  ------EQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVF R+SM   EEENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSILSRMKRKFSVLINTE
        HS + SRMKRK  V INT+
Subjt:  HSSILSRMKRKFSVLINTE

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein3.8e-16954.35Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK +EE+D +I  LK Q++    +ESSQT VVK  DKGK++V+++ P Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q

Query:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL----
         SVSVASLS+QQLQDMI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L R+F+    
Subjt:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFL----

Query:  -NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPN
         N F  TRR VSM ELTNT QRKGE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEELAT AHDM+LSIA+R  +DFL+  
Subjt:  -NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPN

Query:  MRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDP
         R +     +T       + ESM+V  T  KS SK K   +  +H        TL+E QKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDDP
Subjt:  MRKEGRNDEET-------IEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDP

Query:  KYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ--------
         YCKYHRVI H VE+CFVLK LI KLA E KIELD+DEVAQ+N   +   S          QRK        +P  ++ ++K     SQ ++        
Subjt:  KYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKEKSK--------HQRKK-------DPKKLQPKRKRSKKFSQPQQ--------

Query:  --LVMLNKSFSKTFHKKKKENLA--TSYSINV-------EEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVST
          +  L +SF +   ++  E  A  T+  + V       EE+DNS + +QRTSVFD IKPLTTR SVF R+SM   +EENQC   T  + SAF+RLS+S 
Subjt:  --LVMLNKSFSKTFHKKKKENLA--TSYSINV-------EEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVST

Query:  SKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS
        SKK R ST  FDRLK+TNDQ +R+M  L+ K F E N D K+HS + SRMKRK SV INTEGSL VK   II TNP NEG ++  D++KS
Subjt:  SKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS

A0A5A7SUW1 Retrotransposon gag protein1.0e-16148.73Show/hide
Query:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q
        +++ NPLF            E   +VV+VMM +  T E  M EM+  IN LMK  EE+D +I  LK Q++     ESSQT VVK  DKGK++VQ++ P Q
Subjt:  LVKYNPLF------------EPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHP-Q

Query:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFY
         SVSVASLS+QQLQDMI N IRAQYGGP Q S +YSKSYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENA        ++L R+F+    
Subjt:  CSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFY

Query:  STRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEG
              + FE     + +GE V++YINRWRA+SLDCKD+LTELS+VEMC QGMHWELLYIL+GIKPRTFEEL+T AHDMELSIA+   +DFL+   ++  
Subjt:  STRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEG

Query:  RND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK
        +N+           + ESM+V  T  KS SK K   +  ++        TL+E QKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCK
Subjt:  RND--------EETIEESMVVNTTLPKSSSKEKRQTNGAHH-------LTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCK

Query:  YHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-------------------------------------------------------------
        YHRVI HPVE+CFVLK LILKLA E KIELD+DEVAQ+N                                                             
Subjt:  YHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-------------------------------------------------------------

Query:  -----------------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINV----------EE
                                  I  K K +R K   K +P + + K F QP++ + L +   ++F   H +K   +   ++ ++          EE
Subjt:  -----------------------LATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINV----------EE

Query:  IDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK
        +DNS + +QRT VF RIKPLT R SVF R+SM   EEENQC  ST  R SAF+RLS+ST KK R STS FDRLK+ NDQ +R+M +L+ K F E N D K
Subjt:  IDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKK

Query:  LHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS
        +HS + SRMKRK SV INTEGSL VK   II TNP NEG ++  D++KS
Subjt:  LHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKS

A0A5A7TQK6 Retrotransposon gag protein2.6e-15448.71Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE+   AESS    VKN DKGK+++Q+  P+   S+ASLS+QQLQ+MIT+ I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELV
         QYGG  Q   LYSK YTKRIDNLR P GYQPPKFQQ+  K      +    +  +   PES+D+WE+LER+FLNRFYSTRR +SM ELTNT+Q+KGELV
Subjt:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELV

Query:  VNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNT
        ++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTF+ELAT AHDMEL++A++  +DFL+P +R +    ++T       I+ESMVV+ 
Subjt:  VNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNT

Query:  TLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLA
        T  KS SK K     R+ +G      TLKE Q+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  KVDDP YCKYHRVI HP+E+CFVLK LILKLA
Subjt:  TLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLA

Query:  MERKIELDLDEVAQSN---------------------------------------LATIKEKSK---------------HQRKKDPKKLQ----------
         E+KIELD+DEVAQ+N                                       + TI  ++K               HQ+ + P  +Q          
Subjt:  MERKIELDLDEVAQSN---------------------------------------LATIKEKSK---------------HQRKKDPKKLQ----------

Query:  -------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL--------ATSYSIN-----VEEIDNSKKSEQRTSVFDRIKPLTTR
                K +R+KK             F Q ++ + L +   ++F +   E +        A+   +N      +E++NS +  QRTSVFDRIKP TTR
Subjt:  -------PKRKRSKK-------------FSQPQQLVMLNKSFSKTFHKKKKENL--------ATSYSIN-----VEEIDNSKKSEQRTSVFDRIKPLTTR

Query:  PSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEG
         SVF R+SM   EE+NQC     TR S F+RLS+STSKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+HS I SRMKRK  V INTEG
Subjt:  PSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEG

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.0e-17853.29Show/hide
Query:  KKINKSADRSSKKINKSVDRSSNRSSKK--INKGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQI
        ++I  S D    ++ K++    ++  K   + K  +  D+      P  E  P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ I
Subjt:  KKINKSADRSSKKINKSVDRSSNRSSKK--INKGAYTNDKFLVKYNPLFE--PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQI

Query:  ENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHI
        E++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+ QYGGP Q   LY K YTKRIDNLR P GYQPPKFQQFDGKGNPKQH+
Subjt:  ENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHI

Query:  AHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELS
        AHF++TCE A                          EPES+D+WE+LER+FLNRFYSTR  VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS
Subjt:  AHFVETCENA--------------------------EPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELS

Query:  SVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--
        +VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSIA+R  +DFL+P  R +    ++T       I+ESMVV+ T  KS SK K     R+ +G  
Subjt:  SVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--

Query:  AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKE
            TLKE Q+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK LILKLA E+KIELD+DEVAQ+N A I+ 
Subjt:  AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKE

Query:  KSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYSINVEEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVA
         S   + KD   LQ +R         RS     P++++ +        H      +  +Y  + +E++NS +  QRTSVFDRIKP TTR SVF R+S+  
Subjt:  KSKHQRKKDPKKLQPKRK--------RSKKFSQPQQLVMLNKSFSKTFHKKKKENLATSYSINVEEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVA

Query:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTN
         EEENQC     TR S  +RLS+ST KK R STS FDRLK+TNDQ +R+M + + K F E N D K+HS + SRMKRK  V INTEGSL VK   II TN
Subjt:  TEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTN

Query:  PANEGSDQDHDKDKS
        P NEG +Q   ++KS
Subjt:  PANEGSDQDHDKDKS

A0A5D3BX77 Retrotransposon gag protein8.4e-16950.49Show/hide
Query:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR
        P  ++++VM+T   T E RM E+++ +N LMK +EE+D +I  LK+ IE++  AESS    VKN DKGK+++Q+  PQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIR

Query:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL
         QYGGP Q   LYSK YTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPES+D+WE+LER+FL
Subjt:  AQYGGPTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESVDSWEELEREFL

Query:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNM
        NRFYSTRR VSM ELTNT+Q+KGELV++YINRWRA+SLDCKDRLTELS+VEMC QGMHW LLYIL+GIKPRTFEELAT AHDMELSI +R  +DFL+P  
Subjt:  NRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCKDRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNM

Query:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK
        R +     +T       I+ESMVV+ T  KS SK K     R+ +G      TLKE Q+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDDP 
Subjt:  RKEGRNDEET-------IEESMVVNTTLPKSSSKEK-----RQTNG--AHHLTLKEIQKKIYPFPDADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPK

Query:  YCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------
        YCKYHRVI HPVE+CFVLK LILKLA E+KIEL++DEVAQ+N                                   + TI  ++K    KD        
Subjt:  YCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSN-----------------------------------LATIKEKSKHQRKKD--------

Query:  -------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINVEEIDNSKKS----
               P  +Q                  K +R+KK             F Q ++ + L +   ++F   H ++   + T ++ ++ E++N+  S    
Subjt:  -------PKKLQ-----------------PKRKRSKK-------------FSQPQQLVMLNKSFSKTF---HKKKKENLATSYSINVEEIDNSKKS----

Query:  ------EQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL
               QRTSVFDRIKP TTR SVF R+SM   EEENQC     TR S F+RLS+S SKK+R STS FDRLK+TNDQ +R+M +L+ K F E N D K+
Subjt:  ------EQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKL

Query:  HSSILSRMKRKFSVLINTE
        HS + SRMKRK  V INT+
Subjt:  HSSILSRMKRKFSVLINTE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAATTGCAGCAACATTTGACATCTCATTTGAAGGGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATCA
GAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCGGAAGATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAG
CAGACCGATCATCCAAGAAGATCAACAAGTCAGTAGACCGATCATCAAACCGATCATCCAAGAAGATCAACAAGGGGGCATATACCAATGACAAGTTTCTTGTTAAGTAT
AACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAA
GGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAA
AGTCTATAGTGCAAGATGATCATCCACAATGTTCTGTTTCGGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAATTGTATCAGAGCTCAGTACGGTGGA
CCTACTCAAGATTCCCTCTTGTACTCCAAATCTTATACTAAGAGGATTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGG
CAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACA
GCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCTATGAGTCTAGATTGCAAA
GATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAAC
TTGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACTTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCA
TGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAATACAGAAGAAAATCTACCCTTTCCCT
GATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTG
CAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGTACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAG
CTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAA
CAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCCTACTCCATCAACGTAGAAGAAATTGACAATTCTAAGAA
GAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCTAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGA
TGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAA
CCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCTGTCACGTATGAAGAGGAAGTTTTCTGTTCT
CATAAATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAAATTGCAGCAACATTTGACATCTCATTTGAAGGGGGGATTCAGATTCGGAGATTCAGATCAGAGAACTCAGAGTCGAGAGAATTCTGTCAGAGTCCAGAGTCATCA
GAAGTCAGAGAGTCTAGAGAATTCAGAAGATCCTAGATTCAGAATTCAACCAACTCAAGACTCGGAAGATCATCAAGCCAATCGACCGATCAAGAAGATCAACAAGTCAG
CAGACCGATCATCCAAGAAGATCAACAAGTCAGTAGACCGATCATCAAACCGATCATCCAAGAAGATCAACAAGGGGGCATATACCAATGACAAGTTTCTTGTTAAGTAT
AACCCTCTGTTTGAACCTGATTCTGACGTAGTGACTGTCATGATGACTGAGACAAGAACTATGGAAGAAAGAATGACTGAGATGCAAGAACACATCAACAACTTGATGAA
GGCGATTGAAGAAAAAGATTCTCAAATTGAGCAACTAAAGAGCCAAATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCGTAAAAAATCACGACAAAGGAA
AGTCTATAGTGCAAGATGATCATCCACAATGTTCTGTTTCGGTTGCTTCACTATCCATCCAACAGCTCCAAGATATGATCACAAATTGTATCAGAGCTCAGTACGGTGGA
CCTACTCAAGATTCCCTCTTGTACTCCAAATCTTATACTAAGAGGATTGATAATTTGAGAACGCCAATCGGGTATCAGCCACCGAAATTTCAACAATTTGATGGAAAGGG
CAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGAACCTGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTCTACA
GCACTAGACGAACCGTTAGCATGTTCGAGCTCACAAACACTAAACAACGAAAAGGTGAACTCGTTGTTAACTATATAAATCGTTGGAGAGCTATGAGTCTAGATTGCAAA
GATCGCCTCACTGAACTCTCTTCCGTTGAGATGTGCATTCAAGGCATGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAAC
TTGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACTTTCTCCTCCCTAACATGAGAAAAGAAGGAAGGAACGATGAAGAGACTATAGAAGAATCCA
TGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCGACAAACTAATGGAGCGCATCACTTAACTTTAAAGGAAATACAGAAGAAAATCTACCCTTTCCCT
GATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATAAAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTG
CAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAAGTACTTAATTCTAAAGCTGGCTATGGAAAGAAAAATTGAGCTCGACCTTGATGAAGTAG
CTCAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAACATCAAAGAAAGAAGGATCCTAAGAAACTTCAACCCAAGAGAAAGAGAAGTAAAAAGTTCTCTCAACCTCAA
CAACTGGTGATGTTGAATAAATCATTCTCCAAAACTTTCCACAAAAAGAAAAAAGAGAACCTTGCGACTTCCTACTCCATCAACGTAGAAGAAATTGACAATTCTAAGAA
GAGTGAACAAAGGACTTCCGTCTTTGATCGCATCAAGCCTCTAACTACTCGTCCTTCGGTATTCCATAGAATGAGTATGGTCGCGACAGAGGAAGAAAATCAATGTTCGA
TGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTCCACATCGAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAA
CCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAACAGTGACAAGAAGCTTCATAGTAGCATCCTGTCACGTATGAAGAGGAAGTTTTCTGTTCT
CATAAATACGGAAGGTTCCTTGAAGGTGAAGTCAAATCTCATTATCTTGACCAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAGATAAGAGCTTTTAA
Protein sequenceShow/hide protein sequence
MKLQQHLTSHLKGGFRFGDSDQRTQSRENSVRVQSHQKSESLENSEDPRFRIQPTQDSEDHQANRPIKKINKSADRSSKKINKSVDRSSNRSSKKINKGAYTNDKFLVKY
NPLFEPDSDVVTVMMTETRTMEERMTEMQEHINNLMKAIEEKDSQIEQLKSQIENQHIAESSQTQVVKNHDKGKSIVQDDHPQCSVSVASLSIQQLQDMITNCIRAQYGG
PTQDSLLYSKSYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESVDSWEELEREFLNRFYSTRRTVSMFELTNTKQRKGELVVNYINRWRAMSLDCK
DRLTELSSVEMCIQGMHWELLYILKGIKPRTFEELATCAHDMELSIASRENQDFLLPNMRKEGRNDEETIEESMVVNTTLPKSSSKEKRQTNGAHHLTLKEIQKKIYPFP
DADIPDMLEQLLEAQLIKLPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKYLILKLAMERKIELDLDEVAQSNLATIKEKSKHQRKKDPKKLQPKRKRSKKFSQPQ
QLVMLNKSFSKTFHKKKKENLATSYSINVEEIDNSKKSEQRTSVFDRIKPLTTRPSVFHRMSMVATEEENQCSMSTSTRPSAFQRLSVSTSKKSRSSTSVFDRLKVTNDQ
PKRKMNNLELKLFDEVNSDKKLHSSILSRMKRKFSVLINTEGSLKVKSNLIILTNPANEGSDQDHDKDKSF