; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032456 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032456
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:32854081..32858070
RNA-Seq ExpressionLag0032456
SyntenyLag0032456
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]3.8e-14849.71Show/hide
Query:  KTSSMVTVMNKSYMSRTARCCFNELTLQEDKTSIVVGQETTLQGAYINDK--FLVKYNPLFE---PDSN---------VVTVMMTETKTMEERMDEMQEH
        K +S  + ++ SY+    +      ++QE +   V+ ++ +L+    + K   +++ NPLF    P SN         VV+VMM +  T E  M EM+  
Subjt:  KTSSMVTVMNKSYMSRTARCCFNELTLQEDKTSIVVGQETTLQGAYINDK--FLVKYNPLFE---PDSN---------VVTVMMTETKTMEERMDEMQEH

Query:  INTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLR
        IN LMK ++E+D +IA LK Q++    +ESSQT V+K  DKGK  V+++QP Q S SVASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR
Subjt:  INTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLR

Query:  MSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFL-----NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTE
        M +GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L ++F+     N F  TRR VSM ELTNT QRKGE V++YI RWRA+SLDCKD+ TE
Subjt:  MSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFL-----NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTE

Query:  LSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNTTLPKSSSKEKRQINGAYH--
        LSAVEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + E M+V  T  KS SK K   +   H  
Subjt:  LSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNTTLPKSSSKEKRQINGAYH--

Query:  -----LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI
              TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDD  YCKYHRVI H V++CFVLK+LI KLA E KIELD+DEV ++N   +
Subjt:  -----LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI

Query:  KEKS--------------------------KHQRKKNLKKLQPKRQRS------------KKFSQPQQLVMLNKSLSKTFHKKEKENLATSYCIDVEEVD
           S                          + Q+K      Q K + S            + F +     +L  +   T    E +N   SY    EE+D
Subjt:  KEKS--------------------------KHQRKKNLKKLQPKRQRS------------KKFSQPQQLVMLNKSLSKTFHKKEKENLATSYCIDVEEVD

Query:  NSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF
        NS + +QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST+ FDRLK+T+ Q +R+M +L+ K F
Subjt:  NSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]4.1e-16355.16Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T   T E RM E+++ +N LMK ++E+D +IA LK+ IE++  AESS    +KN DKGK  +Q+ QPQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL
         QYGG  Q   LY KPYTKRIDNLRM  GYQPPKFQQFDGKGNPKQH+AHF++TCE A                          EPESID+WE+LE++FL
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL

Query:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFY+TR  VSM ELTNT+Q+KGELV++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK
        R +    ++T       IKE MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDD  
Subjt:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK

Query:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFHKKEKEN-LATS
        YCKYHRVI HPV++CFVLK+LILKLA E KIELD+DEV ++N A I+  S            P + + + F Q ++ + L + L ++F + + E  L  +
Subjt:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFHKKEKEN-LATS

Query:  YC-----IDVE-------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSK
         C     ++V+       EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK RPST  FDRLK+T+ Q +
Subjt:  YC-----IDVE-------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSK

Query:  RKMDSLEMKLF
        R+M S + K F
Subjt:  RKMDSLEMKLF

TYK03695.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-15949.49Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T   T E RM E+++ +N LMK ++E+D +IA LK+ IE++  AESS    +KN DKGK  +Q+ QPQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL
         QYGG  Q   LYSKPYTKRIDNLRM  GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPESID+WE+LE++FL
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL

Query:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFY+TRR VSM ELTNT+Q+KGELV++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  
Subjt:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK
        R +     +T       IKE MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDD  
Subjt:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK

Query:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSN----------------------------------------------------------
        YCKYHRVI HPV++CFVLK+LILKLA E KIEL++DEV ++N                                                          
Subjt:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSN----------------------------------------------------------

Query:  ----------------------LATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDVEEVDNSKKS----
                                +I  K K +R K +   +P + + + F Q ++ + L + L ++F   H +E   + T +   + EV+N+  S    
Subjt:  ----------------------LATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF
               QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+RPST  FDRLK+T+ Q +R+M SL+ K F
Subjt:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF

XP_031735972.1 uncharacterized protein LOC116401693 [Cucumis sativus]1.7e-14561.24Show/hide
Query:  LVKYNPLF------------EPDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-Q
        ++K NPL+            E   +V++VMM +   +E  M EM+  IN LMK + E+D +IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP Q
Subjt:  LVKYNPLF------------EPDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-Q

Query:  CSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------
         S SVASLS+QQLQDMITN IRAQYGG +Q S +YSKPYTKRIDNLRM +GYQPPKFQQFDGKGNPKQH+AHFVETCENA                    
Subjt:  CSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------

Query:  ------EPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELAT
              EPESI+SWE+LEKEFLNRFY+TRRTVSM ELTNTKQRKGE V++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELAT
Subjt:  ------EPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELAT

Query:  RAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIKEFMVVNTTLPKSSSKEKRQI------NGAYHLTLKERQKKIYPFPDADIPDMLEQLLEA
        RAHDMELSIASR  +D L+P ++K+ +         + T KE MVVNTT  K S  ++ ++      +    LTLKERQ+K+YPFPD+DI DMLEQLLE 
Subjt:  RAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIKEFMVVNTTLPKSSSKEKRQI------NGAYHLTLKERQKKIYPFPDADIPDMLEQLLEA

Query:  QLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI
        QLI+LP+CKRPE+  KVDD  YCKYHRVI HPV++CFVLK+LIL+LA E +IELDL+EV ++N A +
Subjt:  QLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI

XP_031742199.1 uncharacterized protein LOC105435721 [Cucumis sativus]1.7e-14561.24Show/hide
Query:  LVKYNPLF------------EPDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-Q
        ++K NPL+            E   +V++VMM +   +E  M EM+  IN LMK + E+D +IA LK Q++ +  AESSQT V+K  DKGK  VQ++QP Q
Subjt:  LVKYNPLF------------EPDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-Q

Query:  CSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------
         S SVASLS+QQLQDMITN IRAQYGG +Q S +YSKPYTKRIDNLRM +GYQPPKFQQFDGKGNPKQH+AHFVETCENA                    
Subjt:  CSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------

Query:  ------EPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELAT
              EPESI+SWE+LEKEFLNRFY+TRRTVSM ELTNTKQRKGE V++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELAT
Subjt:  ------EPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELAT

Query:  RAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIKEFMVVNTTLPKSSSKEKRQI------NGAYHLTLKERQKKIYPFPDADIPDMLEQLLEA
        RAHDMELSIASR  +D L+P ++K+ +         + T KE MVVNTT  K S  ++ ++      +    LTLKERQ+K+YPFPD+DI DMLEQLLE 
Subjt:  RAHDMELSIASRENQDLLLPNMRKEGRN-------DEETIKEFMVVNTTLPKSSSKEKRQI------NGAYHLTLKERQKKIYPFPDADIPDMLEQLLEA

Query:  QLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI
        QLI+LP+CKRPE+  KVDD  YCKYHRVI HPV++CFVLK+LIL+LA E +IELDL+EV ++N A +
Subjt:  QLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI

TrEMBL top hitse value%identityAlignment
A0A5A7SRE2 Ty3-gypsy retrotransposon protein1.8e-14849.71Show/hide
Query:  KTSSMVTVMNKSYMSRTARCCFNELTLQEDKTSIVVGQETTLQGAYINDK--FLVKYNPLFE---PDSN---------VVTVMMTETKTMEERMDEMQEH
        K +S  + ++ SY+    +      ++QE +   V+ ++ +L+    + K   +++ NPLF    P SN         VV+VMM +  T E  M EM+  
Subjt:  KTSSMVTVMNKSYMSRTARCCFNELTLQEDKTSIVVGQETTLQGAYINDK--FLVKYNPLFE---PDSN---------VVTVMMTETKTMEERMDEMQEH

Query:  INTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLR
        IN LMK ++E+D +IA LK Q++    +ESSQT V+K  DKGK  V+++QP Q S SVASLS+QQLQDMI N IRAQYGG  Q S +YSKPYTKRIDNLR
Subjt:  INTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQP-QCSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLR

Query:  MSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFL-----NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTE
        M +GYQP KFQQFDGKGNPKQHI HFVETCENA        ++L ++F+     N F  TRR VSM ELTNT QRKGE V++YI RWRA+SLDCKD+ TE
Subjt:  MSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFL-----NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTE

Query:  LSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNTTLPKSSSKEKRQINGAYH--
        LSAVEMC QGMHWELLYIL+GIKPRTFEELATRAHDM+LSIA+R  +D L+   R +     +T       + E M+V  T  KS SK K   +   H  
Subjt:  LSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNTTLPKSSSKEKRQINGAYH--

Query:  -----LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI
              TL+ERQKK+YPFPD+D+ DMLEQL+E QLI+LP+CKRPE++ KVDD  YCKYHRVI H V++CFVLK+LI KLA E KIELD+DEV ++N   +
Subjt:  -----LTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI

Query:  KEKS--------------------------KHQRKKNLKKLQPKRQRS------------KKFSQPQQLVMLNKSLSKTFHKKEKENLATSYCIDVEEVD
           S                          + Q+K      Q K + S            + F +     +L  +   T    E +N   SY    EE+D
Subjt:  KEKS--------------------------KHQRKKNLKKLQPKRQRS------------KKFSQPQQLVMLNKSLSKTFHKKEKENLATSYCIDVEEVD

Query:  NSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF
        NS + +QRTSVFD IKP TTR SVFQR+SMA  +EENQC   T  + SAF+RLS+S SKK RPST+ FDRLK+T+ Q +R+M +L+ K F
Subjt:  NSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF

A0A5A7TZU9 Ribonuclease H1.4e-14561.71Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T+  T E+RM E+++ +N LMKA++E+D +IA LK+ IE++  AESS T  IKN +KGK  +Q+ QPQ S S+ASLS+QQLQ+MI N I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL
         QYGG  Q   LYSKPYTKRIDN+RM  GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPESIDSWE+LE++FL
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL

Query:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFY+TRR VSM ELT TKQRKGE V++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R N DLL+P +
Subjt:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IKEFMVVNTT----LPKSSSKEKRQINGAYHL-TLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYC
        RKE +  + T        KE MVV+TT    + K    EKRQ  G     TLKERQ+K+YPFPD+D+PDML+QLLE QLI+LP+CKRP EM +V+D  YC
Subjt:  RKEGRNDEET-------IKEFMVVNTT----LPKSSSKEKRQINGAYHL-TLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYC

Query:  KYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI
        KYHRVI HPV++CFVLK+LILKLA++ KIEL+LD+V ++N A +
Subjt:  KYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATI

A0A5A7TZU9 Ribonuclease H3.7e+0033.14Show/hide
Query:  DEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFH--KKEKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTR
        D  T++ L ++K       +  L   Q K Q+ + +S P     +    S+      K K  +A +  I VEE  +S++ +    QR+SVFDRI     R
Subjt:  DEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFH--KKEKENLATSYCIDVEEVDNSKKSE----QRTSVFDRIKPPTTR

Query:  PSVFQRMSMAPTEEENQCLMSTSTRPSAFQRL--------SVSTSKKSRPSTFVFDRLKVTSGQSKRKM
        PSVFQR+S +  ++ NQ    +STR SAFQRL        S+S +  +R S F    + VT  Q K  M
Subjt:  PSVFQRMSMAPTEEENQCLMSTSTRPSAFQRL--------SVSTSKKSRPSTFVFDRLKVTSGQSKRKM

A0A5A7TZU9 Ribonuclease H1.6e-14447.23Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T   T E RM E+++ +N LMK ++E+D +IA LK+ IE+   AESS    +KN DKGK  +Q+ QP+   S+ASLS+QQLQ+MIT+ I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELV
         QYGG  Q   LYSKPYTKRIDNLRM  GYQPPKFQQ+  K      +    +  +   PESID+WE+LE++FLNRFY+TRR +SM ELTNT+Q+KGELV
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENAEPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELV

Query:  VNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNT
        ++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTF+ELATRAHDMEL++A++  +D L+P +R +    ++T       IKE MVV+ 
Subjt:  VNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNMRKEGRNDEET-------IKEFMVVNT

Query:  TLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLA
        T  KS SK K     R+ +G      TLKERQ+K+YPFP++D+ DMLEQLLE QLI+L +CKR E+  KVDD  YCKYHRVI HP+++CFVLK+LILKLA
Subjt:  TLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFVLKDLILKLA

Query:  MEGKIELDLDEVTKSNLATIK-------------------------------------------------------------------------------
         E KIELD+DEV ++N   IK                                                                               
Subjt:  MEGKIELDLDEVTKSNLATIK-------------------------------------------------------------------------------

Query:  -----EKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDV----------EEVDNSKKSEQRTSVFDRIKPPTTR
              K K +R K +   +P + + + F Q ++ + L + L ++F   H +E   +   +   +          +EV+NS +  QRTSVFDRIKP TTR
Subjt:  -----EKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDV----------EEVDNSKKSEQRTSVFDRIKPPTTR

Query:  PSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF
         SVFQR+SMA  EE+NQC     TR S F+RLS+STSKK+RPST  FDRLK+T+ Q +R+M SL+ K F
Subjt:  PSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF

A0A5A7URH1 Ty3-gypsy retrotransposon protein2.0e-16355.16Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T   T E RM E+++ +N LMK ++E+D +IA LK+ IE++  AESS    +KN DKGK  +Q+ QPQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL
         QYGG  Q   LY KPYTKRIDNLRM  GYQPPKFQQFDGKGNPKQH+AHF++TCE A                          EPESID+WE+LE++FL
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL

Query:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFY+TR  VSM ELTNT+Q+KGELV++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSIA+R  +D L+P  
Subjt:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK
        R +    ++T       IKE MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPFPD+D+ DMLEQLLE QLI+LP+CKRPE+  KVDD  
Subjt:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK

Query:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFHKKEKEN-LATS
        YCKYHRVI HPV++CFVLK+LILKLA E KIELD+DEV ++N A I+  S            P + + + F Q ++ + L + L ++F + + E  L  +
Subjt:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFHKKEKEN-LATS

Query:  YC-----IDVE-------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSK
         C     ++V+       EV+NS +  QRTSVFDRIKP TTR SVFQR+S+A  EEENQC     TR S  +RLS+ST KK RPST  FDRLK+T+ Q +
Subjt:  YC-----IDVE-------EVDNSKKSEQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSK

Query:  RKMDSLEMKLF
        R+M S + K F
Subjt:  RKMDSLEMKLF

A0A5D3BX77 Retrotransposon gag protein7.9e-16049.49Show/hide
Query:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR
        P  N+++VM+T   T E RM E+++ +N LMK ++E+D +IA LK+ IE++  AESS    +KN DKGK  +Q+ QPQ S S+ASLS+QQLQ+MI + I+
Subjt:  PDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQIENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIR

Query:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL
         QYGG  Q   LYSKPYTKRIDNLRM  GYQPPKFQQFDGKGNPKQH+AHF+ETCE A                          EPESID+WE+LE++FL
Subjt:  AQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCENA--------------------------EPESIDSWEELEKEFL

Query:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM
        NRFY+TRR VSM ELTNT+Q+KGELV++YI RWRA+SLDCKDR TELSAVEMC QGMHW LLYIL+GIKPRTFEELATRAHDMELSI +R  +D L+P  
Subjt:  NRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQDLLLPNM

Query:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK
        R +     +T       IKE MVV+ T  KS SK K     R+ +G      TLKERQ+K+YPF D+D+ DMLEQLLE QLI+LPKCKRP++ EKVDD  
Subjt:  RKEGRNDEET-------IKEFMVVNTTLPKSSSKEK-----RQING--AYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLK

Query:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSN----------------------------------------------------------
        YCKYHRVI HPV++CFVLK+LILKLA E KIEL++DEV ++N                                                          
Subjt:  YCKYHRVIGHPVKRCFVLKDLILKLAMEGKIELDLDEVTKSN----------------------------------------------------------

Query:  ----------------------LATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDVEEVDNSKKS----
                                +I  K K +R K +   +P + + + F Q ++ + L + L ++F   H +E   + T +   + EV+N+  S    
Subjt:  ----------------------LATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTF---HKKEKENLATSYCIDVEEVDNSKKS----

Query:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF
               QRTSVFDRIKP TTR SVFQR+SMA  EEENQC     TR S F+RLS+S SKK+RPST  FDRLK+T+ Q +R+M SL+ K F
Subjt:  ------EQRTSVFDRIKPPTTRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTTCAAGACTTCTTCAATGGTCACTGTCATGAACAAGTCCTACATGAGTCGTACTGCTCGTTGTTGCTTCAATGAACTGACGTTGCAAGAAGATAAAACTTCTAT
CGTTGTAGGCCAAGAAACAACCTTGCAGGGGGCATATATTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACCTGATTCTAACGTAGTGACTGTCATGATGA
CTGAGACAAAAACTATGGAAGAAAGAATGGATGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTAAAGAAAAAGATTCTCAAATCGCGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTACAGTGCAAGATGATCAGCCACAGTGTTCTGCTTCGGTCGC
TTCATTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACATACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATGTCAATCGGGTATCAGCCACCGAAATTTCAACAGTTTGATGGAAAGGGAAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAAAGAGTTTTTGAATCGCTTCTACAACACTAGAAGAACTGTTAGCATGTTTGAACTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTACATAAAACGCTGGAGAGCCATGAGTTTAGATTGCAAAGATCGCTTCACTGAACTCTCTGCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAATCAA
GACCTTCTACTCCCTAATATGAGGAAAGAAGGAAGGAACGATGAAGAGACTATAAAAGAATTTATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAATAAATGGAGCGTATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGATATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCTCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGAAAAGATGTTTTGTC
TTAAAGGACTTAATTCTAAAGCTAGCTATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAACTAAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAGCATCAAAG
AAAGAAGAATCTTAAGAAACTTCAACCCAAGAGGCAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCTTTGTCCAAAACTTTCCACAAAA
AGGAAAAGGAGAACCTAGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCCCAACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAAGTCGACCTTCAACATTTGTTTTTGATCGCCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAGCTTGGAGATGAAACTTTTTGATTCCT
TCTCATCAATTTCGAGGTTCTCCTTGCTGAGTTTCTTCCTCCAGTTTGAGTTCAGTTCTCCGTTAAGGTTGCCTTTGCAGTTCCTTCCTCCAAGTTCAAGGTTCTCACGC
GCTTCGTTGCAGCTTCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGTAGTTTTTTCTCTCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCAA
AATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTAAAAGGTTCTCCGCTTCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGCTG
CAGTTTCTTCCTCCAAGTTTGAAGGTTCTTACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCTTTTTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAG
TTCCTTTTCCCAAATTCGAAGGTTCTCATGCGCTTTGCTGAGTTCCTTCCTCCAAGTTCGAGGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACG
TGTTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTTGAAGGTCCTCATGCTA
CGCTCGGCTGCGCTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGGCGGCAAAAGTCCAAGGAGCACGCCATGTCCTT
GTACTCATGCTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGAACACGTCATGTCCTCGTACTCATGCTGAAAGACATGGCGGCAGCGGCAAAAGTCCAAGGAGCACGTCA
TGTCCTTGAACTCATGCTGAAAAACGTGGCGACGGCAAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGACGACAAAAGTCCAAGGAGCAC
GTCATGTCCCTGTACTCATGCTGAAAGACGTGGTGGCGACAAAAGTCCAAGGAGCACGTCATGTCCTTGTACTCATGCTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGA
GCACGTCTTGTCCTTGTAATCATGCTGAAAGAAGTGGCGGCGGCAAAAGTCCAAGGAACACGTCATGTCCTTGAACTCATGCTGAAAGAAGTGGCAGCGGCAAAAGTCCA
AGGAGCTTGTCATGTCCTTGAACTCATGTTGAAAGACGTGGCGGCAACAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCGTTCAAGACTTCTTCAATGGTCACTGTCATGAACAAGTCCTACATGAGTCGTACTGCTCGTTGTTGCTTCAATGAACTGACGTTGCAAGAAGATAAAACTTCTAT
CGTTGTAGGCCAAGAAACAACCTTGCAGGGGGCATATATTAATGACAAGTTTCTTGTTAAGTATAACCCTTTGTTTGAACCTGATTCTAACGTAGTGACTGTCATGATGA
CTGAGACAAAAACTATGGAAGAAAGAATGGATGAGATGCAGGAACACATCAACACCTTGATGAAGGCGATTAAAGAAAAAGATTCTCAAATCGCGCAACTAAAGAGCCAA
ATTGAGAACCAACATATCGCCGAATCAAGTCAAACCCAAGTCATAAAAAATCATGACAAAGGAAAGACTACAGTGCAAGATGATCAGCCACAGTGTTCTGCTTCGGTCGC
TTCATTATCCATCCAACAGCTCCAAGATATGATCACAAACTGTATCAGAGCTCAGTACGGTGGACATACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGA
TTGATAACTTGAGAATGTCAATCGGGTATCAGCCACCGAAATTTCAACAGTTTGATGGAAAGGGAAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAAC
GCTGAACCTGAGTCAATAGACAGTTGGGAGGAACTCGAAAAAGAGTTTTTGAATCGCTTCTACAACACTAGAAGAACTGTTAGCATGTTTGAACTCACCAACACTAAACA
ACGAAAAGGTGAACTCGTTGTTAACTACATAAAACGCTGGAGAGCCATGAGTTTAGATTGCAAAGATCGCTTCACTGAACTCTCTGCCGTCGAGATGTGCATTCAAGGCA
TGCACTGGGAACTCCTCTACATCCTTAAAGGTATAAAGCCTCGCACCTTTGAGGAACTAGCAACTCGCGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAATCAA
GACCTTCTACTCCCTAATATGAGGAAAGAAGGAAGGAACGATGAAGAGACTATAAAAGAATTTATGGTTGTAAACACAACCCTTCCCAAGTCGTCTTCGAAAGAAAAGCG
ACAAATAAATGGAGCGTATCACTTAACTTTAAAGGAAAGACAGAAGAAAATCTATCCTTTCCCTGATGCCGATATCCCTGATATGTTGGAACAACTATTGGAAGCGCAAC
TGATAGAGCTTCCTAAGTGTAAACGACCAGAAGAGATGGAGAAAGTCGATGATCTCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGAAAAGATGTTTTGTC
TTAAAGGACTTAATTCTAAAGCTAGCTATGGAAGGAAAAATTGAGCTCGACCTTGATGAAGTAACTAAATCAAATCTTGCTACAATCAAAGAAAAGAGCAAGCATCAAAG
AAAGAAGAATCTTAAGAAACTTCAACCCAAGAGGCAGAGAAGTAAAAAGTTTTCTCAACCTCAACAACTGGTGATGTTGAATAAATCTTTGTCCAAAACTTTCCACAAAA
AGGAAAAGGAGAACCTAGCAACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATTCCAAGAAAAGTGAACAAAGGACTTCTGTCTTCGATCGCATCAAGCCTCCAACT
ACTCGTCCTTCGGTATTCCAAAGAATGAGTATGGCCCCAACAGAGGAAGAAAATCAATGTTTGATGTCCACCTCCACTCGACCTTCAGCTTTCCAAAGGCTAAGTGTCTC
CACATCGAAGAAAAGTCGACCTTCAACATTTGTTTTTGATCGCCTCAAAGTAACAAGCGGTCAATCTAAAAGAAAGATGGATAGCTTGGAGATGAAACTTTTTGATTCCT
TCTCATCAATTTCGAGGTTCTCCTTGCTGAGTTTCTTCCTCCAGTTTGAGTTCAGTTCTCCGTTAAGGTTGCCTTTGCAGTTCCTTCCTCCAAGTTCAAGGTTCTCACGC
GCTTCGTTGCAGCTTCTTCTCTCCAAGGTCGAAGGTTCTCACGCGCTGCGTTGTAGTTTTTTCTCTCCAAGTTCGAAGGTTCACGCACTTCGCTTCAGTTCCTTCTCCAA
AATTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTAAAAGGTTCTCCGCTTCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACGCGCTCCGCTG
CAGTTTCTTCCTCCAAGTTTGAAGGTTCTTACGTCGCTTCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCTTTTTCCCCAAGTTCGAAGGTTCACGCACTTCGCTTCAG
TTCCTTTTCCCAAATTCGAAGGTTCTCATGCGCTTTGCTGAGTTCCTTCCTCCAAGTTCGAGGGTTCTCCGCTGCTGCAGTTCATTCCTCCAAGTTCGAAGGTTCTCACG
TGTTCCGCTGCAGTTCCTTCCTCCAAGTTTGAAGTTCTTTCCTCCAAGTTCGAAGGTTCTCACGCGCTTCGTTGCAGTTCCTTCCTCCAAGTTTGAAGGTCCTCATGCTA
CGCTCGGCTGCGCTGCTGCGCTACTTCCTAAAGTCCAAAGACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGGCGGCAAAAGTCCAAGGAGCACGCCATGTCCTT
GTACTCATGCTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGAACACGTCATGTCCTCGTACTCATGCTGAAAGACATGGCGGCAGCGGCAAAAGTCCAAGGAGCACGTCA
TGTCCTTGAACTCATGCTGAAAAACGTGGCGACGGCAAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGAAAGACGTGGCGACGACAAAAGTCCAAGGAGCAC
GTCATGTCCCTGTACTCATGCTGAAAGACGTGGTGGCGACAAAAGTCCAAGGAGCACGTCATGTCCTTGTACTCATGCTGAAAGATGTGGCGGCGGCAAAAGTCCAAGGA
GCACGTCTTGTCCTTGTAATCATGCTGAAAGAAGTGGCGGCGGCAAAAGTCCAAGGAACACGTCATGTCCTTGAACTCATGCTGAAAGAAGTGGCAGCGGCAAAAGTCCA
AGGAGCTTGTCATGTCCTTGAACTCATGTTGAAAGACGTGGCGGCAACAAAGTCCAAGGAGCACGTCATGTCCTTGAACTCATGCTGA
Protein sequenceShow/hide protein sequence
MSFKTSSMVTVMNKSYMSRTARCCFNELTLQEDKTSIVVGQETTLQGAYINDKFLVKYNPLFEPDSNVVTVMMTETKTMEERMDEMQEHINTLMKAIKEKDSQIAQLKSQ
IENQHIAESSQTQVIKNHDKGKTTVQDDQPQCSASVASLSIQQLQDMITNCIRAQYGGHTQDSLLYSKPYTKRIDNLRMSIGYQPPKFQQFDGKGNPKQHIAHFVETCEN
AEPESIDSWEELEKEFLNRFYNTRRTVSMFELTNTKQRKGELVVNYIKRWRAMSLDCKDRFTELSAVEMCIQGMHWELLYILKGIKPRTFEELATRAHDMELSIASRENQ
DLLLPNMRKEGRNDEETIKEFMVVNTTLPKSSSKEKRQINGAYHLTLKERQKKIYPFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDLKYCKYHRVIGHPVKRCFV
LKDLILKLAMEGKIELDLDEVTKSNLATIKEKSKHQRKKNLKKLQPKRQRSKKFSQPQQLVMLNKSLSKTFHKKEKENLATSYCIDVEEVDNSKKSEQRTSVFDRIKPPT
TRPSVFQRMSMAPTEEENQCLMSTSTRPSAFQRLSVSTSKKSRPSTFVFDRLKVTSGQSKRKMDSLEMKLFDSFSSISRFSLLSFFLQFEFSSPLRLPLQFLPPSSRFSR
ASLQLLLSKVEGSHALRCSFFSPSSKVHALRFSSFSKIRRFSRASLQFLPPSLKGSPLLQFIPPSSKVLTRSAAVSSSKFEGSYVASLQFLPPSLKFFFPKFEGSRTSLQ
FLFPNSKVLMRFAEFLPPSSRVLRCCSSFLQVRRFSRVPLQFLPPSLKFFPPSSKVLTRFVAVPSSKFEGPHATLGCAAALLPKVQRRHVLELMLKDVAAAKVQGARHVL
VLMLKDVAAAKVQGTRHVLVLMLKDMAAAAKVQGARHVLELMLKNVATAKVQGARHVLELMLKDVATTKVQGARHVPVLMLKDVVATKVQGARHVLVLMLKDVAAAKVQG
ARLVLVIMLKEVAAAKVQGTRHVLELMLKEVAAAKVQGACHVLELMLKDVAATKSKEHVMSLNSC