; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0010749 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0010749
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr1:5186431..5188241
RNA-Seq ExpressionLag0010749
SyntenyLag0010749
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4263564.1 unnamed protein product [Prunus armeniaca]2.4e-10839.1Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP
        ++  V V++F +LF S +  V D +V    +   VS      LL P+S +EI +AL    P+KAPGPDG+   FY+ +W IVG D++  CL VLN     
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP

Query:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY
         + N TLV LIPK  +   V+++RPISLCN                          SAFIP R ++DN +  FE +H L++  +   K   LKLDM+KAY
Subjt:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY

Query:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF
        DRVE  FL +++  +G              +VS+S  + G   G++IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER  R     +  S+PSI+HLFF
Subjt:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF

Query:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG
         DDSLLF     +EA  ++ +   YE ASGQ +N   S + FSP+T    Q  I Q+L+V++ PCH++YLGLP+ + +++    R +KDR+W ++ GW+G
Subjt:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG

Query:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG------RSWLATVVC-----MAQF------
        K  S AGKEVL+KS+ QAIP Y+M+ FRLP GL +EI S +AKFWW     RG+     R +      GG       S+   ++C     + +F      
Subjt:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG------RSWLATVVC-----MAQF------

Query:  --------------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS
                         +G+L S               G RWRIG+GR   +YG  W+P +    IQS+P+LP  S V DLF ASGGWD   + A F   
Subjt:  --------------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS

Query:  D
        +
Subjt:  D

CAB4273075.1 unnamed protein product [Prunus armeniaca]3.1e-10838.94Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP
        ++  V V++F +LF S +  V D +V    +   VS      LL P+S +EI +AL    P+KAPGPDG+   FY+ +W IVG D++  C+ VLN     
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP

Query:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY
         + N TLV LIPK  +   V+++RPISLCN                          SAFIP R ++DN +  FE +H L++R +   K   LKLDM+KAY
Subjt:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY

Query:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF
        DRVEW FL +++  +G              +VS+S  + G   G++IPSRGLRQGDP+S YLFL+ AE  S+LL+ AER  R     +  S+PSI+HLFF
Subjt:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF

Query:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG
         DDSLLF     +EA  ++ +   YE A GQ +N   S + FSP+T    Q  I Q+L+V+L PCH++YLGLP+ + +++      +KDR+W ++ GW+G
Subjt:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG

Query:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG-------------------------RSWLA
        K  S A KEVL+KS+ QAIP Y+M+ FRLP GL +EI S +AKFWW     RG+     R +      GG                          S +A
Subjt:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG-------------------------RSWLA

Query:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS
         ++    F         +G+L S               G RWRIG+GR   +YG  W+P +    IQS+P+LPA S V DLF ASGGWD   + A F   
Subjt:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS

Query:  D
        +
Subjt:  D

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]8.2e-10937.04Show/hide
Query:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG
        +  +  DYF+ LF+S+    Q  +  L ++ P ++  MN  LL+ F+ EE+   L Q  P KAPG DG+   F++ +W IVG  + + CL +LN   S  
Subjt:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG

Query:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD
        E N TL+ LIPK K    V++FRPISLC                           SAF+P R ++DN +  FE +H ++   +GR    ALKLDM+KAYD
Subjt:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD

Query:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV
        RVEW FLRE+ML+LG              + +FS    G  VG ++P RGLRQG PLSPYLFL+C EG S LLRGAERR   +  +V    PS++HL F 
Subjt:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK
        DDS+LF +        +  L   YE+ SGQ INY  S  + SPN        I  +L+V +  CH++YLGLP+   + R    + LKD++W+ I GWK K
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK

Query:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------
          S AGKE+L+K+++QAIP Y+M+CFR+PKGL KE++  MA+FWW     +  +   G  W+   ++C ++F  G G                       
Subjt:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------

Query:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA
                                                L+ G RWR+GNG S  VY   WLP     +I S P LP +++V DLF +SG W+  +L+ 
Subjt:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA

Query:  HF
         F
Subjt:  HF

XP_008237273.1 PREDICTED: uncharacterized protein LOC103336015 [Prunus mume]2.0e-11039.43Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP
        ++  V V++F +LF S +  V D +V    +   VS      LL P+S +EI +AL    P KAPGPDG+   FY+ +W IVG +++  CL VLN     
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP

Query:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY
         + N TLV LIPK  +   V+++RPISLCN                          SAFIP R ++DN +  FE +H L++R +   K   LKLDM+KAY
Subjt:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY

Query:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF
        DRVEW FL +++  +G              +VS+S  + G   G++IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER  R     +  S+PSI+HLFF
Subjt:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF

Query:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG
         DDSLLF     +EA  ++ +   YE ASGQ +N   S + FSP+T    Q  I Q+L+V+L PCH++YLGLP+ + +++    R +KDR+W ++ GW+G
Subjt:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG

Query:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGV------VISPVRVLGG-------------------------RSWLA
        K  S AGKEVL+KS+ QAIP Y+M+ FRLP GL +EI S +AKFWW     RG+       +   +  GG                          S +A
Subjt:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGV------VISPVRVLGG-------------------------RSWLA

Query:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS
         ++    F         +G+L S               G RWRIG+GR   +YG  W+P +    IQS+P+LPA S V DLF ASGGWD   + A F   
Subjt:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS

Query:  D
        +
Subjt:  D

XP_030487384.1 uncharacterized protein LOC115704310 [Cannabis sativa]2.2e-10940Show/hide
Query:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG
        V ++V DY+  LF SS  +   F   L  + P V+  MN+ L+  F+ EE++ A+++ +P KAPG DGL   FY+  W  +  D+  +CL VLN G    
Subjt:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG

Query:  EVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD
         +NDT+V LIPK    + + +FRPISLCN                          SAF+ GR + DNAI+G+E +H +RK         ALKLDM+KAYD
Subjt:  EVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD

Query:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFFV
        RVEW FL  +M++LG              SV FSF +NGE  G+V P RGLRQGDPLSP+LFLLCAE  SSL++ AE+R R      G     +SHLFF 
Subjt:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK
        DDSL+F     +E    +ELL  Y  ASGQ +N+  S + F  N     +  ++ ++ V +   + +YLGLPSF+ R +     F+K+R+W +++GWKG 
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK

Query:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLATVVCMAQFVVGAGALSSGC------------------
        FFS A KEVL+K+IVQAIP YTM+CFRLPK  +  IHS  A+FWW G      ++     W    V  A +   +G L + C                  
Subjt:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLATVVCMAQFVVGAGALSSGC------------------

Query:  ----RWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLSD
            RWRIGN  S  V   +WLP     +I   P LP    V DL    G WD+  +RA F+ +D
Subjt:  ----RWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLSD

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein5.7e-10836.71Show/hide
Query:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG
        +  +  DYF+ LF+SS    Q  +  L ++ P ++  MN  LL+ F+ EE+   L Q  P KAPG DG+   F++ +W IVG  + + CL +LN   S  
Subjt:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG

Query:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD
        E N TL+ LIPK K    V++FRPISLC                           SAF+P R ++DN +  FE ++ ++   +GR    ALKLDM+KAYD
Subjt:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD

Query:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV
        RVEW FLR +ML+LG              + +FS    G  VG ++P RGLRQG PLSPYLFL+C EG S LLRGAERR   +  +V   +PS++HL F 
Subjt:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK
        DDS+LF +    +   +  L   YE+ +GQ INY  S ++ SPN        I  +L+V +  CH+ YLGLP+   + R    + LKD++W+ I GWK K
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK

Query:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------
          S AGKE+L+K+++QAIP Y+M+CFR+PKGL KE++  MA+FWW     +  +   G  W+   ++C ++F  G G                       
Subjt:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------

Query:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA
                                                L+ G RWR+G+G S  VY   WLP   C +I S P LP ++ V DLF +SG W+  +L+ 
Subjt:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA

Query:  HF
         F
Subjt:  HF

A0A5E4FZN9 PREDICTED: retrotransposon4.0e-10937.04Show/hide
Query:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG
        +  +  DYF+ LF+S+    Q  +  L ++ P ++  MN  LL+ F+ EE+   L Q  P KAPG DG+   F++ +W IVG  + + CL +LN   S  
Subjt:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG

Query:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD
        E N TL+ LIPK K    V++FRPISLC                           SAF+P R ++DN +  FE +H ++   +GR    ALKLDM+KAYD
Subjt:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD

Query:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV
        RVEW FLRE+ML+LG              + +FS    G  VG ++P RGLRQG PLSPYLFL+C EG S LLRGAERR   +  +V    PS++HL F 
Subjt:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK
        DDS+LF +        +  L   YE+ SGQ INY  S  + SPN        I  +L+V +  CH++YLGLP+   + R    + LKD++W+ I GWK K
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK

Query:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------
          S AGKE+L+K+++QAIP Y+M+CFR+PKGL KE++  MA+FWW     +  +   G  W+   ++C ++F  G G                       
Subjt:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------

Query:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA
                                                L+ G RWR+GNG S  VY   WLP     +I S P LP +++V DLF +SG W+  +L+ 
Subjt:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA

Query:  HF
         F
Subjt:  HF

A0A6J5TIF9 Reverse transcriptase domain-containing protein1.2e-10839.1Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP
        ++  V V++F +LF S +  V D +V    +   VS      LL P+S +EI +AL    P+KAPGPDG+   FY+ +W IVG D++  CL VLN     
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP

Query:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY
         + N TLV LIPK  +   V+++RPISLCN                          SAFIP R ++DN +  FE +H L++  +   K   LKLDM+KAY
Subjt:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY

Query:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF
        DRVE  FL +++  +G              +VS+S  + G   G++IPSRGLRQGDP+SPYLFL+ AE  S+LL+ AER  R     +  S+PSI+HLFF
Subjt:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF

Query:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG
         DDSLLF     +EA  ++ +   YE ASGQ +N   S + FSP+T    Q  I Q+L+V++ PCH++YLGLP+ + +++    R +KDR+W ++ GW+G
Subjt:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG

Query:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG------RSWLATVVC-----MAQF------
        K  S AGKEVL+KS+ QAIP Y+M+ FRLP GL +EI S +AKFWW     RG+     R +      GG       S+   ++C     + +F      
Subjt:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG------RSWLATVVC-----MAQF------

Query:  --------------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS
                         +G+L S               G RWRIG+GR   +YG  W+P +    IQS+P+LP  S V DLF ASGGWD   + A F   
Subjt:  --------------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS

Query:  D
        +
Subjt:  D

A0A6J5UD52 Reverse transcriptase domain-containing protein1.5e-10838.94Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP
        ++  V V++F +LF S +  V D +V    +   VS      LL P+S +EI +AL    P+KAPGPDG+   FY+ +W IVG D++  C+ VLN     
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSP

Query:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY
         + N TLV LIPK  +   V+++RPISLCN                          SAFIP R ++DN +  FE +H L++R +   K   LKLDM+KAY
Subjt:  GEVNDTLVVLIPKTKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAY

Query:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF
        DRVEW FL +++  +G              +VS+S  + G   G++IPSRGLRQGDP+S YLFL+ AE  S+LL+ AER  R     +  S+PSI+HLFF
Subjt:  DRVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFR---VGHSSPSISHLFF

Query:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG
         DDSLLF     +EA  ++ +   YE A GQ +N   S + FSP+T    Q  I Q+L+V+L PCH++YLGLP+ + +++      +KDR+W ++ GW+G
Subjt:  VDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKG

Query:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG-------------------------RSWLA
        K  S A KEVL+KS+ QAIP Y+M+ FRLP GL +EI S +AKFWW     RG+     R +      GG                          S +A
Subjt:  KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWW-----RGVVISPVRVL------GG-------------------------RSWLA

Query:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS
         ++    F         +G+L S               G RWRIG+GR   +YG  W+P +    IQS+P+LPA S V DLF ASGGWD   + A F   
Subjt:  TVVCMAQF------VVGAGALSS---------------GCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLS

Query:  D
        +
Subjt:  D

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)5.7e-10836.71Show/hide
Query:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG
        +  +  DYF+ LF+SS    Q  +  L ++ P ++  MN  LL+ F+ EE+   L Q  P KAPG DG+   F++ +W IVG  + + CL +LN   S  
Subjt:  VAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPG

Query:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD
        E N TL+ LIPK K    V++FRPISLC                           SAF+P R ++DN +  FE ++ ++   +GR    ALKLDM+KAYD
Subjt:  EVNDTLVVLIPKTKAARWVADFRPISLC--------------------------NSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYD

Query:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV
        RVEW FLR +ML+LG              + +FS    G  VG ++P RGLRQG PLSPYLFL+C EG S LLRGAERR   +  +V   +PS++HL F 
Subjt:  RVEWPFLREVMLRLG--------------SVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERR---LRFRVGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK
        DDS+LF +    +   +  L   YE+ +GQ INY  S ++ SPN        I  +L+V +  CH+ YLGLP+   + R    + LKD++W+ I GWK K
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQGWKGK

Query:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------
          S AGKE+L+K+++QAIP Y+M+CFR+PKGL KE++  MA+FWW     +  +   G  W+   ++C ++F  G G                       
Subjt:  FFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLA-TVVCMAQFVVGAG-----------------------

Query:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA
                                                L+ G RWR+G+G S  VY   WLP   C +I S P LP ++ V DLF +SG W+  +L+ 
Subjt:  ---------------------------------------ALSSGCRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRA

Query:  HF
         F
Subjt:  HF

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein8.2e-2724.05Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVAL-RDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCS
        E+   +  +++ L+++ + ++ + D  L R   P ++ D    L  P S +EI   +      K+PGPDG S  FY+   + +   + +    +   G  
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVAL-RDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCS

Query:  PGEVNDTLVVLIPK-TKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSK
        P    +  + LIPK  K    + +FRPISL N                            FIPG     N       IH + K          + LD  K
Subjt:  PGEVNDTLVVLIPK-TKAARWVADFRPISLCN--------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSK

Query:  AYDRVEWPFLREVMLRLGSVSFSFNL--------------NGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFV
        A+D+++ PF+ +V+ R G      N+              NGEK+  +    G RQG PLSPYLF +  E L+  +R  +     ++G     IS L   
Subjt:  AYDRVEWPFLREVMLRLGSVSFSFNL--------------NGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFV

Query:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLG--LPSFMPRNRSGTLRFLKDRIWRQIQGWK
        DD +++  +  +    +  L+  + +  G  IN   S +AF     + A++ I +    S+   + +YLG  L   +        + LK  I   ++ WK
Subjt:  DDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLG--LPSFMPRNRSGTLRFLKDRIWRQIQGWK

Query:  GKFFSIAGKEVLLKSIVQAIPCYTMNC--FRLPKGLVKEIHSAMAKFWW
            S  G+  ++K  +     Y  N    ++P     E+  A+ KF W
Subjt:  GKFFSIAGKEVLLKSIVQAIPCYTMNC--FRLPKGLVKEIHSAMAKFWW

P14381 Transposon TX1 uncharacterized 149 kDa protein2.7e-2224.06Show/hide
Query:  YFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPGEVNDTLV
        ++Q+LF S  P   D    L D  P VS+   + L  P + +E+  ALR    NK+PG DGL+  F++  WD +G D  +        G  P      ++
Subjt:  YFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPGEVNDTLV

Query:  VLIPKTKAARWVADFRPISLCNSAF--------------------------IPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYDRVEWPFL
         L+PK    R + ++RP+SL ++ +                          +PGR + DN  L  + +H  R   R     A L LD  KA+DRV+  +L
Subjt:  VLIPKTKAARWVADFRPISLCNSAF--------------------------IPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYDRVEWPFL

Query:  REVM--------------LRLGSVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFVDDSLLFFREN
           +                  S      +N      +   RG+RQG PLS  L+ L  E    LLR   +RL   V         L    D ++   ++
Subjt:  REVM--------------LRLGSVSFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFVDDSLLFFREN

Query:  GSEASVIRELLLWYEKASGQTINYEIS--------VVAFSP------NTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQG
          +    +E    Y  AS   IN+  S         V F P      + E    +Y+   LS    P  Q ++                L++ +  ++  
Subjt:  GSEASVIRELLLWYEKASGQTINYEIS--------VVAFSP------NTEEGAQQYISQILSVSLCPCHQQYLGLPSFMPRNRSGTLRFLKDRIWRQIQG

Query:  WKG--KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRG
        WKG  K  S+ G+ +++  +V +   Y + C    +  + +I   +  F W G
Subjt:  WKG--KFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRG

P16423 Retrovirus-related Pol polyprotein from type-2 retrotransposable element R2DM2.4e-1025.25Show/hide
Query:  MEVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFS--EEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHG
        M   +++V Y++ +     PS           C      M+ +L R +S   E+ L A R    + +PGPDG++    R   ++    + +    +L  G
Subjt:  MEVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFS--EEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHG

Query:  CSPGEVNDTLVVLIPKTKAARWVADFRPISL---------------CNSA---------FIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKA
          P  +     V IPKT  A+   DFRPIS+                NS+         F+P     DNA +    +    K  R         LD+SKA
Subjt:  CSPGEVNDTLVVLIPKTKAARWVADFRPISL---------------CNSA---------FIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKA

Query:  YDRVEWPFLREVMLRLGSV--------------SFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFVD
        +D +    + + +   G+                 S N +G    + +P+RG++QGDPLSP LF L    +  LLR     +  +VG++  + +   F D
Subjt:  YDRVEWPFLREVMLRLGSV--------------SFSFNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFVD

Query:  DSLLF
        D +LF
Subjt:  DSLLF

P92555 Uncharacterized mitochondrial protein AtMg012507.9e-1457.97Show/hide
Query:  FNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLR---FRVGHSSPSISHLFFVDDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ + R    RV ++SP I+HL F DD+
Subjt:  FNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLR---FRVGHSSPSISHLFFVDDS

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)5.7e-1226.32Show/hide
Query:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQA------LLRPFSEEEI--LLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLA
        + A +V+D       + +   +     +R  CPT     ++       L RP S +EI  + A ++T    A GPDG++       W+ +   I      
Subjt:  EVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQA------LLRPFSEEEI--LLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLA

Query:  VLNHGCSPGEVNDTLVVLIPKTKAARWVADFRPISLCN------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKL
        ++ HG  P    D+  VLIPK       A FRP+S+ +                         AFI    V +N  L    I E R + +G   + A+ L
Subjt:  VLNHGCSPGEVNDTLVVLIPKTKAARWVADFRPISLCN------------------------SAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKL

Query:  DMSKAYDRVEWPFL------REVMLRLGSVSFSFNLNGEKVGQVI--------PSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISH
        D+ KA+D VE   +      +++ L + +       N +   +V+        P+RG+RQGDPLSP LF  C   + ++LR       F +G  +  I  
Subjt:  DMSKAYDRVEWPFL------REVMLRLGSVSFSFNLNGEKVGQVI--------PSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISH

Query:  LFFVDDSLLFFR-ENGSEASVIR
        L F DD +L      G +AS+ R
Subjt:  LFFVDDSLLFFR-ENGSEASVIR

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.0e-0940Show/hide
Query:  ISLCNSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYDRVEWPFLREVMLRLG
        I    ++FIPGR   DN +   E +H +R R +G   W  LKLD+ KAYDR+ W +L + ++  G
Subjt:  ISLCNSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYDRVEWPFLREVMLRLG

AT4G29090.1 Ribonuclease H-like superfamily protein1.2e-0460Show/hide
Query:  AIPCYTMNCFRLPKGLVKEIHSAMAKFWWR
        A+P YTM CF LPK + K+I S +A FWWR
Subjt:  AIPCYTMNCFRLPKGLVKEIHSAMAKFWWR

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.6e-1557.97Show/hide
Query:  FNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLR---FRVGHSSPSISHLFFVDDS
        F +NG   G V PSRGLRQGDPLSPYLF+LC E LS L R A+ + R    RV ++SP I+HL F DD+
Subjt:  FNLNGEKVGQVIPSRGLRQGDPLSPYLFLLCAEGLSSLLRGAERRLR---FRVGHSSPSISHLFFVDDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGTTGCTCAAGTGGTGGTTGACTACTTCCAGCATTTGTTCGCTTCATCGATCCCGAGTGTGCAGGATTTTGATGTGGCCTTGCGAGATCTGTGTCCTACTGTGAG
TGATGACATGAACCAGGCTCTTTTACGCCCTTTTTCTGAGGAAGAGATCCTGTTGGCATTAAGGCAGACGCATCCTAATAAGGCCCCTGGACCAGATGGGTTGTCAGGAA
GTTTTTACAGGAATCACTGGGACATTGTCGGGTCGGATATCACGCAGAGCTGTTTGGCAGTACTGAATCACGGTTGCTCTCCAGGTGAAGTAAATGACACCTTGGTTGTG
CTCATTCCGAAGACCAAAGCGGCCCGTTGGGTTGCAGATTTCAGGCCTATTTCCCTTTGTAATAGTGCGTTTATTCCAGGGAGATGTGTGGTGGATAACGCCATCCTGGG
GTTTGAGTGTATCCATGAGCTTCGTAAGAGAGCCAGGGGGAGGGCTAAATGGGCTGCGTTGAAGCTGGACATGAGCAAGGCATATGACAGGGTGGAATGGCCTTTCCTCC
GTGAGGTCATGCTCCGATTGGGCTCAGTGTCCTTCTCCTTTAACCTGAACGGGGAGAAGGTGGGGCAGGTGATTCCGTCTCGAGGTCTCCGGCAGGGGGATCCCCTTTCT
CCATACTTGTTTCTGTTATGTGCTGAAGGTCTGTCAAGTCTATTGCGTGGTGCTGAGCGGAGGCTCAGGTTTCGGGTGGGGCATTCTAGTCCGTCGATCTCACACCTTTT
CTTCGTGGATGACAGTCTTCTCTTTTTCAGGGAGAATGGGAGTGAAGCATCGGTTATTCGGGAACTGTTGCTATGGTATGAAAAAGCTTCAGGTCAGACTATCAATTACG
AGATATCTGTTGTGGCTTTTAGCCCAAATACGGAGGAGGGGGCTCAACAGTATATTAGCCAAATCCTATCTGTGTCCCTCTGTCCTTGCCACCAGCAGTATCTTGGTTTG
CCTTCATTTATGCCACGGAACCGATCAGGGACGTTGAGATTTTTAAAGGATCGTATATGGCGCCAGATCCAGGGTTGGAAGGGTAAATTCTTTTCGATAGCAGGGAAGGA
AGTCCTACTTAAATCCATAGTTCAGGCTATCCCTTGCTATACGATGAACTGCTTCCGGTTGCCCAAGGGTTTGGTGAAAGAGATTCACAGTGCTATGGCTAAGTTTTGGT
GGAGGGGGGTCGTTATTTCCCCAGTCAGGGTTCTTGGAGGCAGGTCTTGGCTTGCGACCGTCGTTTGTATGGCGCAGTTTGTTGTCGGGGCGGGAGCTCTTAGTTCGGGT
TGTCGGTGGAGGATTGGTAATGGGCGGTCTACGCCGGTTTATGGTTCAAACTGGTTGCCGAACGAGTTTTGTCTTCAAATACAGTCGGTCCCGTCACTTCCTGCTGCTAG
TGTGGTTAGTGATCTTTTTGCTGCGTCTGGTGGGTGGGATGAGGCTGTGCTCAGAGCCCATTTTGATTTGTCGGATCGTGGGCCATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGTTGCTCAAGTGGTGGTTGACTACTTCCAGCATTTGTTCGCTTCATCGATCCCGAGTGTGCAGGATTTTGATGTGGCCTTGCGAGATCTGTGTCCTACTGTGAG
TGATGACATGAACCAGGCTCTTTTACGCCCTTTTTCTGAGGAAGAGATCCTGTTGGCATTAAGGCAGACGCATCCTAATAAGGCCCCTGGACCAGATGGGTTGTCAGGAA
GTTTTTACAGGAATCACTGGGACATTGTCGGGTCGGATATCACGCAGAGCTGTTTGGCAGTACTGAATCACGGTTGCTCTCCAGGTGAAGTAAATGACACCTTGGTTGTG
CTCATTCCGAAGACCAAAGCGGCCCGTTGGGTTGCAGATTTCAGGCCTATTTCCCTTTGTAATAGTGCGTTTATTCCAGGGAGATGTGTGGTGGATAACGCCATCCTGGG
GTTTGAGTGTATCCATGAGCTTCGTAAGAGAGCCAGGGGGAGGGCTAAATGGGCTGCGTTGAAGCTGGACATGAGCAAGGCATATGACAGGGTGGAATGGCCTTTCCTCC
GTGAGGTCATGCTCCGATTGGGCTCAGTGTCCTTCTCCTTTAACCTGAACGGGGAGAAGGTGGGGCAGGTGATTCCGTCTCGAGGTCTCCGGCAGGGGGATCCCCTTTCT
CCATACTTGTTTCTGTTATGTGCTGAAGGTCTGTCAAGTCTATTGCGTGGTGCTGAGCGGAGGCTCAGGTTTCGGGTGGGGCATTCTAGTCCGTCGATCTCACACCTTTT
CTTCGTGGATGACAGTCTTCTCTTTTTCAGGGAGAATGGGAGTGAAGCATCGGTTATTCGGGAACTGTTGCTATGGTATGAAAAAGCTTCAGGTCAGACTATCAATTACG
AGATATCTGTTGTGGCTTTTAGCCCAAATACGGAGGAGGGGGCTCAACAGTATATTAGCCAAATCCTATCTGTGTCCCTCTGTCCTTGCCACCAGCAGTATCTTGGTTTG
CCTTCATTTATGCCACGGAACCGATCAGGGACGTTGAGATTTTTAAAGGATCGTATATGGCGCCAGATCCAGGGTTGGAAGGGTAAATTCTTTTCGATAGCAGGGAAGGA
AGTCCTACTTAAATCCATAGTTCAGGCTATCCCTTGCTATACGATGAACTGCTTCCGGTTGCCCAAGGGTTTGGTGAAAGAGATTCACAGTGCTATGGCTAAGTTTTGGT
GGAGGGGGGTCGTTATTTCCCCAGTCAGGGTTCTTGGAGGCAGGTCTTGGCTTGCGACCGTCGTTTGTATGGCGCAGTTTGTTGTCGGGGCGGGAGCTCTTAGTTCGGGT
TGTCGGTGGAGGATTGGTAATGGGCGGTCTACGCCGGTTTATGGTTCAAACTGGTTGCCGAACGAGTTTTGTCTTCAAATACAGTCGGTCCCGTCACTTCCTGCTGCTAG
TGTGGTTAGTGATCTTTTTGCTGCGTCTGGTGGGTGGGATGAGGCTGTGCTCAGAGCCCATTTTGATTTGTCGGATCGTGGGCCATCTTGA
Protein sequenceShow/hide protein sequence
MEVAQVVVDYFQHLFASSIPSVQDFDVALRDLCPTVSDDMNQALLRPFSEEEILLALRQTHPNKAPGPDGLSGSFYRNHWDIVGSDITQSCLAVLNHGCSPGEVNDTLVV
LIPKTKAARWVADFRPISLCNSAFIPGRCVVDNAILGFECIHELRKRARGRAKWAALKLDMSKAYDRVEWPFLREVMLRLGSVSFSFNLNGEKVGQVIPSRGLRQGDPLS
PYLFLLCAEGLSSLLRGAERRLRFRVGHSSPSISHLFFVDDSLLFFRENGSEASVIRELLLWYEKASGQTINYEISVVAFSPNTEEGAQQYISQILSVSLCPCHQQYLGL
PSFMPRNRSGTLRFLKDRIWRQIQGWKGKFFSIAGKEVLLKSIVQAIPCYTMNCFRLPKGLVKEIHSAMAKFWWRGVVISPVRVLGGRSWLATVVCMAQFVVGAGALSSG
CRWRIGNGRSTPVYGSNWLPNEFCLQIQSVPSLPAASVVSDLFAASGGWDEAVLRAHFDLSDRGPS