; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032059 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032059
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionTy3-gypsy retrotransposon protein
Genome locationchr11:23379376..23385238
RNA-Seq ExpressionLag0032059
SyntenyLag0032059
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033746.1 retrotransposon gag protein [Cucumis melo var. makuwa]5.0e-12846.79Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPES-----
        MI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE      
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPES-----

Query:  -VDSWEELEREFLNRFYSTRR----TVSMY----------KACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNIT
         ++ W  L  +  ++          T  M+          K  TFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T
Subjt:  -VDSWEELEREFLNRFYSTRR----TVSMY----------KACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNIT

Query:  LPKLSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK
          K  SK K   +  ++        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+
Subjt:  LPKLSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSN------------------------------------------------------------------------------------L
        E KIELD+DEVAQ+N                                                                                     
Subjt:  EGKIELDLDEVAQSN------------------------------------------------------------------------------------L

Query:  ATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRP
          I  K K +R K   K +P + + K F QPR+ + + E   ++F +   E  +      T+  ++V       EEVDN    +QRT VF RIKP T R 
Subjt:  ATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRP

Query:  SVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLK
        SVFQR+SMA  +EENQC  ST  R+SAF+RLS+S  KK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HSR+ SRMKRK SV INTEGSL 
Subjt:  SVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLK

Query:  VKPNLIILTNPANEGSDQDHDKDK
        VKP  II TNP NEG ++  D++K
Subjt:  VKPNLIILTNPANEGSDQDHDKDK

KAA0040811.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.5e-13248.76Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGAHH----
                            K  TFEELATRAHDMELSIA+R  +D L+   R +     +T       + ESM+V  T  K  SK K + +  +H    
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN------
            TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+LILKL +E KIELD+DEVAQ+N      
Subjt:  ---LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN------

Query:  ------------------------------------------------------------------------------LATIKGKSKHQRKKDPKKLQPK
                                                                                         I  K K +R K   K +P 
Subjt:  ------------------------------------------------------------------------------LATIKGKSKHQRKKDPKKLQPK

Query:  RKRSKKFSQPRQPVTVKEIFSKTF---HKKKKENFVTSYCIDV----------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMST
        + + + F QPRQ +T+ E F ++F   H K+       +   +          EEVDN    +QRTSVFDRIKP TTR SVFQR+S+   +EENQC  ST
Subjt:  RKRSKKFSQPRQPVTVKEIFSKTF---HKKKKENFVTSYCIDV----------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMST

Query:  STRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDHD
         TR+SAF+ LS+S SKK R STS FDRLK+ NDQ +R+M +L++K F E N D K+HSR+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  D
Subjt:  STRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDHD

Query:  KDK
        ++K
Subjt:  KDK

KAA0055462.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-12948.34Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQY GP Q S +YSKPYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDL+PE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---
                            K  TFEELATRAHDMELSIA+R  +D L+   R  G+N+           + ESM+V  T  K  SK K   +  +H   
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---

Query:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-
             TL+ERQKK+Y FPD+D+ DMLEQL++ QLI+L +CKRPE+  KVDDP YCKYH VI H VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   + 
Subjt:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-

Query:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR
                                                                                       KG   H++K  ++ K  +PK 
Subjt:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR

Query:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS
         + K+  F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  S
Subjt:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS

Query:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH
        T  R+SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  
Subjt:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH

Query:  DKDK
        D++K
Subjt:  DKDK

KAA0056121.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]2.8e-13952.64Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSM-------------------------------------------------------YKACTFEELATRAHDMELSIASRENQ
        +LER+FLNRFYSTR  VSM                                                        K  TFEELATRAHDMELSIA+R  +
Subjt:  ELEREFLNRFYSTRRTVSM-------------------------------------------------------YKACTFEELATRAHDMELSIASRENQ

Query:  DILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEM
        D L+P  R +    ++T       I+ESMVV+ T  K  SK K     R+ +G      TLKERQ+K+Y FPD+D+ DMLEQLLE QLI+LP+CKRPE+ 
Subjt:  DILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEM

Query:  EKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKK
         KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A                  P + + + F Q R+ +T+ E   ++F +   
Subjt:  EKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKK

Query:  ENFV------TSYCIDVE-------EVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLK
        E  +       +  ++V+       EV+N     QRTSVFDRIKP TTR SVFQR+S+A  +EENQC     TR+S  +RLS+S  KK R STS FDRLK
Subjt:  ENFV------TSYCIDVE-------EVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLK

Query:  VTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQ
        +TNDQ +R+M + + K F E N D K+HS + SRMKRK  V INTEGSL VKP  II TNP NEG +Q
Subjt:  VTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQ

TYK08944.1 retrotransposon gag protein [Cucumis melo var. makuwa]1.6e-12948.34Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQY GP Q S +YSKPYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDL+PE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---
                            K  TFEELATRAHDMELSIA+R  +D L+   R  G+N+           + ESM+V  T  K  SK K   +  +H   
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---

Query:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-
             TL+ERQKK+Y FPD+D+ DMLEQL++ QLI+L +CKRPE+  KVDDP YCKYH VI H VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   + 
Subjt:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-

Query:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR
                                                                                       KG   H++K  ++ K  +PK 
Subjt:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR

Query:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS
         + K+  F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  S
Subjt:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS

Query:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH
        T  R+SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  
Subjt:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH

Query:  DKDK
        D++K
Subjt:  DKDK

TrEMBL top hitse value%identityAlignment
A0A5A7SUW1 Retrotransposon gag protein2.4e-12846.79Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPES-----
        MI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE      
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPES-----

Query:  -VDSWEELEREFLNRFYSTRR----TVSMY----------KACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNIT
         ++ W  L  +  ++          T  M+          K  TFEEL+TRAHDMELSIA+   +D L+   ++  +N+           + ESM+V  T
Subjt:  -VDSWEELEREFLNRFYSTRR----TVSMY----------KACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNIT

Query:  LPKLSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK
          K  SK K   +  ++        TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LP+CKRPE+  KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+
Subjt:  LPKLSSKEKRQTNGAHH-------LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAK

Query:  EGKIELDLDEVAQSN------------------------------------------------------------------------------------L
        E KIELD+DEVAQ+N                                                                                     
Subjt:  EGKIELDLDEVAQSN------------------------------------------------------------------------------------L

Query:  ATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRP
          I  K K +R K   K +P + + K F QPR+ + + E   ++F +   E  +      T+  ++V       EEVDN    +QRT VF RIKP T R 
Subjt:  ATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRP

Query:  SVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLK
        SVFQR+SMA  +EENQC  ST  R+SAF+RLS+S  KK R STS FDRLK+ NDQ +R+M +L+ K F E N D K+HSR+ SRMKRK SV INTEGSL 
Subjt:  SVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLK

Query:  VKPNLIILTNPANEGSDQDHDKDK
        VKP  II TNP NEG ++  D++K
Subjt:  VKPNLIILTNPANEGSDQDHDKDK

A0A5A7TGM1 Retrotransposon gag protein7.3e-13348.76Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQYGGP Q S +YSK YTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGNAF+WYTDLEPE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGAHH----
                            K  TFEELATRAHDMELSIA+R  +D L+   R +     +T       + ESM+V  T  K  SK K + +  +H    
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEKRQTNGAHH----

Query:  ---LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN------
            TL+ERQKK+Y FPD+D+ DMLEQL+E QLI+LPKCKRPE+  KVDDP YCKYHRVI H VE+CFVLK+LILKL +E KIELD+DEVAQ+N      
Subjt:  ---LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSN------

Query:  ------------------------------------------------------------------------------LATIKGKSKHQRKKDPKKLQPK
                                                                                         I  K K +R K   K +P 
Subjt:  ------------------------------------------------------------------------------LATIKGKSKHQRKKDPKKLQPK

Query:  RKRSKKFSQPRQPVTVKEIFSKTF---HKKKKENFVTSYCIDV----------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMST
        + + + F QPRQ +T+ E F ++F   H K+       +   +          EEVDN    +QRTSVFDRIKP TTR SVFQR+S+   +EENQC  ST
Subjt:  RKRSKKFSQPRQPVTVKEIFSKTF---HKKKKENFVTSYCIDV----------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMST

Query:  STRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDHD
         TR+SAF+ LS+S SKK R STS FDRLK+ NDQ +R+M +L++K F E N D K+HSR+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  D
Subjt:  STRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDHD

Query:  KDK
        ++K
Subjt:  KDK

A0A5A7UI09 Retrotransposon gag protein7.5e-13048.34Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQY GP Q S +YSKPYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDL+PE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---
                            K  TFEELATRAHDMELSIA+R  +D L+   R  G+N+           + ESM+V  T  K  SK K   +  +H   
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---

Query:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-
             TL+ERQKK+Y FPD+D+ DMLEQL++ QLI+L +CKRPE+  KVDDP YCKYH VI H VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   + 
Subjt:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-

Query:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR
                                                                                       KG   H++K  ++ K  +PK 
Subjt:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR

Query:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS
         + K+  F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  S
Subjt:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS

Query:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH
        T  R+SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  
Subjt:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH

Query:  DKDK
        D++K
Subjt:  DKDK

A0A5A7URH1 Ty3-gypsy retrotransposon protein1.4e-13952.64Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI + I+ QYGGP Q   LY KPYTKRIDNLR P GYQPPKFQQFDGKGNPKQH+AHF++TCE AGTRGDLLVKQFVRTLKGNA DWY DLEPES+D+WE
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSM-------------------------------------------------------YKACTFEELATRAHDMELSIASRENQ
        +LER+FLNRFYSTR  VSM                                                        K  TFEELATRAHDMELSIA+R  +
Subjt:  ELEREFLNRFYSTRRTVSM-------------------------------------------------------YKACTFEELATRAHDMELSIASRENQ

Query:  DILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEM
        D L+P  R +    ++T       I+ESMVV+ T  K  SK K     R+ +G      TLKERQ+K+Y FPD+D+ DMLEQLLE QLI+LP+CKRPE+ 
Subjt:  DILLPNMRKEGRNDEET-------IEESMVVNITLPKLSSKEK-----RQTNG--AHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEM

Query:  EKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKK
         KVDDP YCKYHRVI HPVE+CFVLK+LILKLA+E KIELD+DEVAQ+N A                  P + + + F Q R+ +T+ E   ++F +   
Subjt:  EKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKKK

Query:  ENFV------TSYCIDVE-------EVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLK
        E  +       +  ++V+       EV+N     QRTSVFDRIKP TTR SVFQR+S+A  +EENQC     TR+S  +RLS+S  KK R STS FDRLK
Subjt:  ENFV------TSYCIDVE-------EVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLK

Query:  VTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQ
        +TNDQ +R+M + + K F E N D K+HS + SRMKRK  V INTEGSL VKP  II TNP NEG +Q
Subjt:  VTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQ

A0A5D3CCI8 Retrotransposon gag protein7.5e-13048.34Show/hide
Query:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE
        MI N IRAQY GP Q S +YSKPYTKRIDNLR P+GYQPPKFQQFDGKGNPKQHIAHFVETCENAG+RGD LV+QFVR+LKGN F+WYTDL+PE +    
Subjt:  MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWE

Query:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---
                            K  TFEELATRAHDMELSIA+R  +D L+   R  G+N+           + ESM+V  T  K  SK K   +  +H   
Subjt:  ELEREFLNRFYSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRND--------EETIEESMVVNITLPKLSSKEKRQTNGAHH---

Query:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-
             TL+ERQKK+Y FPD+D+ DMLEQL++ QLI+L +CKRPE+  KVDDP YCKYH VI H VE+CFVLK+LILKLA+E KIELD+DEVAQ+N   + 
Subjt:  ----LTLKERQKKIYHFPDADIPDMLEQLLEAQLIELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATI-

Query:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR
                                                                                       KG   H++K  ++ K  +PK 
Subjt:  -------------------------------------------------------------------------------KGKSKHQRK--KDPKKLQPKR

Query:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS
         + K+  F QPR+ +T+ E  S++F +   E  +      T+  ++V       EEVDN    +QRTSVFDRIKP TTR SVFQR+SMA  +E+NQC  S
Subjt:  KRSKK--FSQPRQPVTVKEIFSKTFHKKKKENFV------TSYCIDV-------EEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMS

Query:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH
        T  R+SAF+RLS+S SKK R STS FDRLK+TNDQ +R+M +L+ K F E N D K+H+R+ SRMKRK SV INTEGSL VKP  II TNP NEG ++  
Subjt:  TSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVNSDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDH

Query:  DKDK
        D++K
Subjt:  DKDK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATCACAAACTGTATCAGAGCCCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTA
TCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCA
AACAGTTTGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCGGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTC
TACAGCACTAGACGAACCGTTAGCATGTATAAAGCCTGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACAT
TCTCCTCCCTAACATGAGAAAAGAAGGAAGAAACGACGAAGAGACTATAGAAGAATCTATGGTCGTCAACATAACCCTTCCCAAGTTGTCTTCGAAAGAAAAGCGACAAA
CAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATATATCATTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATA
GAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAA
GGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGTAAGCATCAAAGAAAGA
AGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAA
AAAGAGAACTTTGTGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATCCTGAGAGGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCG
TCCTTCAGTATTCCAAAGAATGAGTATGGCCGCAACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGATCTTCAGCTTTCCAAAGGCTAAGTGTCTCCATAT
CAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAAC
AGTGACAAGAAGCTTCATAGTAGAATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAACCTCATTATCTTGAC
CAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAGATAAGAGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTATAAACAAAAAAAAAGGTTCTT
CGCTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCATTGTTCCTTC
TCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCAC
ACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCC
TCCAAGTTCGAAGGTTCTCACGTGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCG
CATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCCTTCCTCCA
AGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTTGCTGCAGTTCCTTCCTCCAAG
TTTGAAGGTTCTCACATCGCTTCGCTTTGCGCTGCGCTTCATTGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTG
CTACCTTCCTCCAAGCTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCG
CTTCTCTTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTTCATTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTTGCTGCAGTTCCTTCCTCCAAGTTC
GAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAA
GTTCGAAGGTGCTTCTCTCCACCCCTCTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTG
ACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTTTGCTGCGCTTATCTTCAAATGTTGGCAGTGGTGAAGTCACTGCAATTGAATCTGATGACGACC
GTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCC
TGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAG
GAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGA
TCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGGGGGATGGAGATTCGGGGATTCAGATTTGGAGACAGAGTCAGAGAACTCAGA
GTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCTGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGA
GATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCC
GATCATCCAAGAGGATCAACAAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATCACAAACTGTATCAGAGCCCAGTACGGTGGACCTACTCAAGATTCCCTCTTGTATTCCAAACCTTATACTAAGAGGATTGATAACTTGAGAACGCCAATCGGGTA
TCAGCCACCAAAATTTCAGCAGTTTGATGGAAAAGGCAATCCTAAACAACATATTGCCCACTTCGTTGAGACATGCGAGAACGCTGGTACTCGAGGGGACCTACTAGTCA
AACAGTTTGTTCGAACACTTAAAGGAAATGCTTTTGACTGGTACACTGATCTAGAACCGGAGTCAGTAGACAGTTGGGAGGAACTCGAAAGAGAGTTTTTGAATCGCTTC
TACAGCACTAGACGAACCGTTAGCATGTATAAAGCCTGCACCTTTGAGGAACTAGCAACTCGTGCCCACGATATGGAGCTAAGTATTGCTAGTCGAGAAAACCAAGACAT
TCTCCTCCCTAACATGAGAAAAGAAGGAAGAAACGACGAAGAGACTATAGAAGAATCTATGGTCGTCAACATAACCCTTCCCAAGTTGTCTTCGAAAGAAAAGCGACAAA
CAAATGGAGCGCATCACTTAACTTTAAAGGAAAGACAGAAGAAAATATATCATTTCCCTGATGCCGACATCCCTGATATGTTGGAACAACTATTGGAAGCGCAACTGATA
GAGCTTCCTAAGTGTAAACGGCCAGAAGAGATGGAGAAAGTCGATGATCCCAAGTATTGCAAGTATCATCGAGTTATTGGTCATCCAGTGGAAAGATGTTTCGTCCTAAA
GGACTTAATTTTAAAGCTGGCTAAGGAAGGCAAAATTGAGCTCGACCTTGATGAAGTAGCTCAATCAAATCTTGCTACAATCAAAGGAAAGAGTAAGCATCAAAGAAAGA
AGGATCCTAAGAAACTTCAACCCAAGAGGAAGAGAAGTAAAAAGTTTTCTCAACCTCGACAACCGGTGACTGTGAAGGAGATCTTCTCCAAAACTTTCCACAAAAAGAAA
AAAGAGAACTTTGTGACTTCCTACTGCATCGACGTAGAAGAAGTTGACAATCCTGAGAGGGGTGAACAAAGGACCTCCGTCTTCGATCGCATCAAGCCTCCAACTACTCG
TCCTTCAGTATTCCAAAGAATGAGTATGGCCGCAACAAAAGAAGAAAATCAATGTTCGATGTCCACCTCCACTCGATCTTCAGCTTTCCAAAGGCTAAGTGTCTCCATAT
CAAAGAAAAGTCGATCTTCAACATCTGTCTTTGATCGCCTCAAAGTAACAAACGATCAACCTAAAAGAAAGATGAACAACTTGGAGTTGAAACTTTTCGATGAAGTAAAC
AGTGACAAGAAGCTTCATAGTAGAATCTCGTCACGTATGAAGAGGAAGTTTTCTGTTCTCATAAATACGGAAGGTTCCTTGAAGGTGAAGCCAAACCTCATTATCTTGAC
CAATCCTGCAAATGAAGGATCTGATCAAGACCATGACAAAGATAAGAGGTACGTAGGCAGCTTAAAGAAAACTTTAAGTTCAGTCTCTATAAACAAAAAAAAAGGTTCTT
CGCTGCAGTTCCTTCTCTCCAAGTTCGAGGGTCCTTACACTGTACGCTATTGCGTTGTTCCTTCTCCAAGTTCGAAGGTTCTTCGTTGTATCCTGCTGCATTGTTCCTTC
TCCAAGTTCGAGGGTTCTCAGTTGTACAACTGCTACGTTGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCAC
ACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCTTCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCC
TCCAAGTTCGAAGGTTCTCACGTGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCTCACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCG
CATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCCTTCCTCCA
AGTTCGAAGGTTCACACGCGCATCGCCACAGTTCCTTCCTCCAAGTCTGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGCGCTT
CGCTGCAGTTCCTTCCTCACAGTTCGAAGGTTCTCACGCGCTTCGCTGCAGTTCCTTCCCCCCAAGTTCGAAGGTTCTCACGTCGCTTTGCTGCAGTTCCTTCCTCCAAG
TTTGAAGGTTCTCACATCGCTTCGCTTTGCGCTGCGCTTCATTGCAGTTCCTTCCTCCCTAAGTTCGAAGTTCCTTCCTCCAAGTTCGAAGGTTCTCATGCGCTTCGTTG
CTACCTTCCTCCAAGCTCGAAGGTTCTCTCACGCGCTGCTGCAGCTCCTTCCTCCAAGTTCGAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCG
CTTCTCTTCGCTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTTCATTGCTACCTTCCTCCAAGTTCGAAGGTTCTCTCACGCGTTGCTGCAGTTCCTTCCTCCAAGTTC
GAAGGTTCCCTCACGCGCTTCGCTCGCTCCTTCTCCAAGTTCGAAGGCACTTCTCTCCACTGCTCCTTCTCCAAGTTCGAAGGCGCTTCTCTCCGCTGCTCCTTCTCCAA
GTTCGAAGGTGCTTCTCTCCACCCCTCTTTTGAAGGTTCGCCACTGAGGTTCTCCTTCTCCAAGTTCGAAGGTTCACCGTTGCTCCTTTTCAAATGTTTGGCGGAGGTTG
ACGTCCTCGTTCCGCTTCATCTTCAAATGTTGGTAGTTGACGGCGTTTGCTGCGCTTATCTTCAAATGTTGGCAGTGGTGAAGTCACTGCAATTGAATCTGATGACGACC
GTTGAAGGCGAGTCGGGTCTGGTGACCACCCCTGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAGGCGAATCTGGTGACTACCCC
TGCAGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAG
GAGTGCATGAAGGCGAATCTGGTTACTCAGATCACCCAATAAAATGGGGACTGGGTCTAGCAGGAGTGCATGAAGGCGAATCTGGTGACTACCCCTGCAGGTTACTCAGA
TCACCCAATGAAATAGGGGACTGGTCTAGCAGGAGTGATATCACTGCAAGCGAATTTGGGGGATGGAGATTCGGGGATTCAGATTTGGAGACAGAGTCAGAGAACTCAGA
GTCCAGAGCATTCTGCCAAGAGTCCAGAGTCGGCAGGCCGATCATCCAAGAGGATCAACAAGCTAACAAGCTGATCCAACAGATCATCAAGCCAACAGGCCGATCCAAGA
GATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCCGATCATCCAAGAGATCAACAAGCCAACCGACCGATCAAGAAGATCAACAAGTCAGCAGGCC
GATCATCCAAGAGGATCAACAAGCTAA
Protein sequenceShow/hide protein sequence
MITNCIRAQYGGPTQDSLLYSKPYTKRIDNLRTPIGYQPPKFQQFDGKGNPKQHIAHFVETCENAGTRGDLLVKQFVRTLKGNAFDWYTDLEPESVDSWEELEREFLNRF
YSTRRTVSMYKACTFEELATRAHDMELSIASRENQDILLPNMRKEGRNDEETIEESMVVNITLPKLSSKEKRQTNGAHHLTLKERQKKIYHFPDADIPDMLEQLLEAQLI
ELPKCKRPEEMEKVDDPKYCKYHRVIGHPVERCFVLKDLILKLAKEGKIELDLDEVAQSNLATIKGKSKHQRKKDPKKLQPKRKRSKKFSQPRQPVTVKEIFSKTFHKKK
KENFVTSYCIDVEEVDNPERGEQRTSVFDRIKPPTTRPSVFQRMSMAATKEENQCSMSTSTRSSAFQRLSVSISKKSRSSTSVFDRLKVTNDQPKRKMNNLELKLFDEVN
SDKKLHSRISSRMKRKFSVLINTEGSLKVKPNLIILTNPANEGSDQDHDKDKRYVGSLKKTLSSVSINKKKGSSLQFLLSKFEGPYTVRYCVVPSPSSKVLRCILLHCSF
SKFEGSQLYNCYVVPSSKFEGSHAHRHSSFLQVRRFTRASPQFLPPSSKVHTRFATVPSSKFEGSHAHRHSSFLQVRRFSRASPQFLPPSSKVLTRIATVPSSKFEGSHA
HRHSSFLQVRRFTRASPQFLPPSSKVHTRIATVPFLQVRRFTRASPQFLPPSLKVLTRFAAVPSPQVRRFSRASLQFLPHSSKVLTRFAAVPSPQVRRFSRRFAAVPSSK
FEGSHIASLCAALHCSSFLPKFEVPSSKFEGSHALRCYLPPSSKVLSRAAAAPSSKFEGSLTRFARSFSKFEGASLRCSFSKFEGASLHCYLPPSSKVLSRVAAVPSSKF
EGSLTRFARSFSKFEGTSLHCSFSKFEGASLRCSFSKFEGASLHPSFEGSPLRFSFSKFEGSPLLLFKCLAEVDVLVPLHLQMLVVDGVCCAYLQMLAVVKSLQLNLMTT
VEGESGLVTTPAGYSDHPIKWGLGLAGVHEANLVTTPAGYSDHPIKWGLGLAGVHEGESGYSDHPIKWGLGLAGVHEGESGYSDHPIKWGLGLAGVHEGESGDYPCRLLR
SPNEIGDWSSRSDITASEFGGWRFGDSDLETESENSESRAFCQESRVGRPIIQEDQQANKLIQQIIKPTGRSKRSTSQPTDQEDQQVSRPIIQEINKPTDRSRRSTSQQA
DHPRGSTS