; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035086 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035086
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr3:14678892..14681671
RNA-Seq ExpressionLag0035086
SyntenyLag0035086
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]1.4e-16939.49Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN +LL  F+R E+E  + QM P+KAPG DG PALF+Q++W  VGD      + +LN    + ++N T IALIPKVK P  +S++RPISLC   YK+IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K VL  VI E QSAFVP R I DNV+   E ++TIK  + GR   + +KLDM+KAYD VEW FL  +++K+GF   WV  +MDC+ +T  S+L 
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----
         G P   I+P+RG RQG PLSPYLFL+ +E  S L+ GA  RG L G++  +  P ++HL FADDS++F KA+ +      ++   YE+ +G+++     
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----

Query:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                      +I  +L +PVV     YLG+ +   + R+  FQ++K ++W  + GWK K  S  GKE+LIK+V Q IP Y MSCFR+PK LC +L+
Subjt:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         +MARFWW  ++ KR IHW +W+ LC  K  GGL FR+LE+FNQALLAKQ WR+   P  LV+++ + RY      L   + +N S  WR   W +ELL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+G+G    ++ D W+P  S FK   +    +     V +  + S  W++  L+++   +++  I  IP+++    D  IWHY  NGMYS+++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH
        +LA   +    G  S+  +   ++W  +W  KIP K+K F+WR  +D +P    L+   +  +P+C  C    ESV HA+  C+ +K++         C 
Subjt:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH

Query:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING
        +   N+F  R +W A QL     E+   A+  W +WN RN F        ++ +SE       ++LL RM    A   SD+ N++  I+G
Subjt:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]6.8e-16940.97Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN++LL  F+R E+E  + QM P+KAPG DG PALF+Q++W  VGD      + +LN    + ++N T IALIPKVK P  +S++RPISLC   YK+IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K VL  VI ENQSAFVP R I DNV+   E +HTIK  + GR   + +KLDM+KAYD VEW FL  +++K+GF   WV  +MDC+ +T  S+L 
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----
         G P   I+P+RG RQG PLSPYLFL+ +E  S L+ GA  RG L G++  +  P ++HL FADDS++F KA+ E      ++   YE+ SG+++     
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----

Query:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                      +I  +L +PVV    KYLG+ +   + R+  FQ++K ++W  + GWK K  S  GKE+L+K+V Q IP Y MSCFR+PK LC +L+
Subjt:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         +MARFWW  ++ KR IHW +W+ LC  K  GGL FR+LE+FNQALLAKQ WR+   P  LV+++ + RY      L   + +N S  WR   W +ELL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+GNG    ++ D W+P  S FK   +    +     V +  + S  W++  L+++   +++     IP+++    D  IWHY  NGMYS+++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH
        +LA   +    G  S   +   ++W  +W  KIP K+K F+WR  +D +P    L+   +  +P+C  C    ESV HA+  C+ +K++         C 
Subjt:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH

Query:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGF
            N+F  R +W A QL     E+   A+  W +WN RN F
Subjt:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGF

XP_017250619.1 PREDICTED: uncharacterized protein LOC108221234 [Daucus carota subsp. sativus]1.0e-16939.92Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN  L   F+  E++R I  M P K+PGPDG  A+F+QQ W  VG +     +D LN    +   N T + LIPKVK+PK + D+RPISLCNV YK IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
        VL+NR+K +L  +I   QSAFVPGR I DN ++ +ECLH ++  R+G++ +V +KLDMSKAYD VEW F+E++L K+GF  QWV+ IM CV S   S  +
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR-------
        NG    K+ P RG RQGDPLSPYLFL+ +E  S+L+  A +R  + G+K  +  P ISHLFFADDSL+F KAS   + + ++I  +Y + SG+       
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR-------

Query:  ------------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                    R      L M     +  YLG+     + ++  F++IK++VW  L  W+   FS GGKE+L+K+V Q +P Y MSCF++P+  C ++ 
Subjt:  ------------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         ++AR+WWGS   KRKIHW+ W ++  PK  GGL FR    +NQALLAKQ WR+  +P  L+S+V++ +Y    S L +      S+ WR  VW + LL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR+RIGNG+ T  FKDPW+ +  +F PI        E+V V E+I+    W+   +R      DI++I  IP+S  + AD W WHY S G Y++++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPN
        KL  ++     SSS     +WW   W +KIP+K+  F WR Y++ +P+   L    + +   C +C    +S  HA+  C  ++++ +++          
Subjt:  KLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPN

Query:  RNNFADRIIWLASQLPEEEFEKACIAFWAIWNDRNGFAH
          +F D ++++   L +++ +   +  W IW +RN   H
Subjt:  RNNFADRIIWLASQLPEEEFEKACIAFWAIWNDRNGFAH

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]8.0e-17036.61Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        +NE+LL P+++ EIE  I+QM P+KA GPDGFPALFYQ +W  VG  T    ++ LN    I  WN T+IALIPK+KQP++ISD+RPISLCNVSYKII+K
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K V+  VI + QSAFVP R+I DNVI+GHECLHTI S ++G  G   +KLD+SKA+D VEW +LE ++ K+GF+  W++ I+ C+ + + SI +
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASG--------
        NG P     P RG RQGDPLSPYLFLL +E LS+LI+   + G L+GI   +    I+HL FADDSL+F ++   +    R +L  Y +ASG        
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASG--------

Query:  -----------RRMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                   R+  +  +L + +V++ G YLG+ S+FTRRR +                                                        
Subjt:  -----------RRMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
                      RK+HW +W ++C PKE GGLNFR+LE FNQAL+AK VWR   HPNLLVSKV+K +Y    SLL     S  S FW+G++W R+LL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+GNG     F DPW+P+ +TFKP+      +  D TVA FI+    W ++S+ +    ED  +I ++PIS+ N  D W+WHY   G YS+R+GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPN
        KL   +K    S+S N +   W  +WK  +P K+K FIWR+ ++ IP+   L   G+   P C+IC    ES+ HA   CKR++QI   +F  + C +  
Subjt:  KLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPN

Query:  RNNFADRIIW--LASQLPEEEFEKACIAFWAIWNDRNGFAHNLEVMP-------------------------------------WQRRS-----------
         +N +   +W  L  QL  ++   A I  W IWNDRN   H  +V P                                     W+  S           
Subjt:  RNNFADRIIW--LASQLPEEEFEKACIAFWAIWNDRNGFAHNLEVMP-------------------------------------WQRRS-----------

Query:  ------------------------------------EVNAMIQGLRLLQRMNISCASICSDSINVVKMINGDIHTTSDVHHWILQIHHMKESFDTFAFTY
                                            E+  +++GL+     N +   + SDS+  +++I  +IHT  D  +W+++I  +   F   +F++
Subjt:  ------------------------------------EVNAMIQGLRLLQRMNISCASICSDSINVVKMINGDIHTTSDVHHWILQIHHMKESFDTFAFTY

Query:  VSRQGNRQADFLAKEVCLIKDQCFGW
         SRQ NR A  LAK         + W
Subjt:  VSRQGNRQADFLAKEVCLIKDQCFGW

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.5e-16839.59Show/hide
Query:  NEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAKV
        N+ L   F+ +E+   +K +   K+PG DG  A+FY   W+ VG+L     +D+LN       +N T I LIPK+K+PK + D+RPISLCNV+YKII+K+
Subjt:  NEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAKV

Query:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN
        L  R K VL  VI E QSAF+  R I DN++V  E +H++K +  G +G+  +KLDMSKA+D VEW FL  ++ K+GF  + + LIM C+ +   S LIN
Subjt:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN

Query:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR--------
        G  +  ++P+RG RQGDPLSPYLFL+ SE LS L+      G L G+   +  P I+HL FADDSL+F +A+     + +  L +Y +ASG+        
Subjt:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR--------

Query:  -----------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHN
                   ++    +LGMP+      YLG+ +   R +   F NIK+R+W  +  W  K FS GGKEVL+K+V Q IP Y MSCFRL    C  +  
Subjt:  -----------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHN

Query:  MMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQC
        MMARFWWGSS   +KIHWK W  LC  K  GGL FR    FNQA LAKQ WR+F  PN L+S+V+KGRY HQ   ++  +    S+ W+G VW RELL  
Subjt:  MMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQC

Query:  GLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYK
        GL  +IG+G       D WIP    FKP+   G+       VA++I+ +  W L  L N  +  DI  I TIP+S  ++ D W WHY S+G Y++++GY 
Subjt:  GLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYK

Query:  LARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPNR
        LA S++    SSS+ +Q  WW L W   +P KV+ F WR     +P    L+   +  S  CS+C    ES+ HAL  C  +K +      ++D    + 
Subjt:  LARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPNR

Query:  NNFADRIIWLASQLPEEEFEKACIAFWAIWNDRNGFAHNLEVM-PWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMI
            D +++L++ L + E EK     W IW+DRN + H  ++  P    S+  A +     ++       S  +     VK +
Subjt:  NNFADRIIWLASQLPEEEFEKACIAFWAIWNDRNGFAHNLEVM-PWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMI

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein6.6e-17039.49Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN +LL  F+R E+E  + QM P+KAPG DG PALF+Q++W  VGD      + +LN    + ++N T IALIPKVK P  +S++RPISLC   YK+IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K VL  VI E QSAFVP R I DNV+   E ++TIK  + GR   + +KLDM+KAYD VEW FL  +++K+GF   WV  +MDC+ +T  S+L 
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----
         G P   I+P+RG RQG PLSPYLFL+ +E  S L+ GA  RG L G++  +  P ++HL FADDS++F KA+ +      ++   YE+ +G+++     
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----

Query:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                      +I  +L +PVV     YLG+ +   + R+  FQ++K ++W  + GWK K  S  GKE+LIK+V Q IP Y MSCFR+PK LC +L+
Subjt:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         +MARFWW  ++ KR IHW +W+ LC  K  GGL FR+LE+FNQALLAKQ WR+   P  LV+++ + RY      L   + +N S  WR   W +ELL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+G+G    ++ D W+P  S FK   +    +     V +  + S  W++  L+++   +++  I  IP+++    D  IWHY  NGMYS+++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH
        +LA   +    G  S+  +   ++W  +W  KIP K+K F+WR  +D +P    L+   +  +P+C  C    ESV HA+  C+ +K++         C 
Subjt:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH

Query:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING
        +   N+F  R +W A QL     E+   A+  W +WN RN F        ++ +SE       ++LL RM    A   SD+ N++  I+G
Subjt:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING

A0A5E4FZN9 PREDICTED: retrotransposon3.3e-16940.97Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN++LL  F+R E+E  + QM P+KAPG DG PALF+Q++W  VGD      + +LN    + ++N T IALIPKVK P  +S++RPISLC   YK+IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K VL  VI ENQSAFVP R I DNV+   E +HTIK  + GR   + +KLDM+KAYD VEW FL  +++K+GF   WV  +MDC+ +T  S+L 
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----
         G P   I+P+RG RQG PLSPYLFL+ +E  S L+ GA  RG L G++  +  P ++HL FADDS++F KA+ E      ++   YE+ SG+++     
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----

Query:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                      +I  +L +PVV    KYLG+ +   + R+  FQ++K ++W  + GWK K  S  GKE+L+K+V Q IP Y MSCFR+PK LC +L+
Subjt:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         +MARFWW  ++ KR IHW +W+ LC  K  GGL FR+LE+FNQALLAKQ WR+   P  LV+++ + RY      L   + +N S  WR   W +ELL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+GNG    ++ D W+P  S FK   +    +     V +  + S  W++  L+++   +++     IP+++    D  IWHY  NGMYS+++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH
        +LA   +    G  S   +   ++W  +W  KIP K+K F+WR  +D +P    L+   +  +P+C  C    ESV HA+  C+ +K++         C 
Subjt:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH

Query:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGF
            N+F  R +W A QL     E+   A+  W +WN RN F
Subjt:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGF

A0A803NHG3 Uncharacterized protein6.6e-17041.41Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MNE LL PF   E+ + IK+MHP+KAPG DG PALFYQ+FWS+V        +++L+    +G  N+T  ALIPKV++P  +++YRPISLCNV YKI++K
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         L NR++  L  VI E+QSAFV GR I DN IVG+E LH ++  R      + +KLDM+KAYD VEW FL+ +++++G+H QW+  IM CV S   S LI
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRMVISNL
        NG    K++P+RG RQGDPLSP+LFL  +EV S L+        L G++ G+    +SHLFFADDS++F+  + E    F ++L      +  ++ ++  
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRMVISNL

Query:  LGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHW
        LG+ VV+N GKYLG+ S   R ++  F  IK RVW  L+GWK   FS G KEVLIK++ Q IP Y MSC+RL K+    +H M ARFWWGS+  K+KIHW
Subjt:  LGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHW

Query:  KRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDP
         +W+ LC PKE GGL FR+LE FNQALLAKQ+WR    P  L SKV+K  Y    S+L+    ++ S  WR  VW +E++  G R R+GNG+   + +DP
Subjt:  KRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDP

Query:  WIPKESTFK-----PIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSSS
        W+P+ ++FK     P P       + + V +   PS  W  S +R    +ED  +I  +P    +  D+ +WHY  NG Y++R+GY++A  I+    +  
Subjt:  WIPKESTFK-----PIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSSS

Query:  NNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDT-ESVDHALCGCKRSKQICDMIFRRVDCHIPNRNNFADRIIWLAS
            + WW  LWK K+P KVKHF W+     +P++  L    +   P C  C     E++ HAL GC  +K I  +   + +       +    ++ LA 
Subjt:  NNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDT-ESVDHALCGCKRSKQICDMIFRRVDCHIPNRNNFADRIIWLAS

Query:  QLPEEEFEKACIAFWAIWNDRNGFAHN------LEVMPW
         + ++ +E   +  W +W  RN   H        EV+ W
Subjt:  QLPEEEFEKACIAFWAIWNDRNGFAHN------LEVMPW

A0A803PV25 Uncharacterized protein1.1e-17242.4Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN  LL  F   E+ R +K+M+P+KAPG DG PALFYQ+FWS++        +++LN    +   NDT +ALIPKV +P+ I ++RPISLCNV YKI++K
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         L NRM++ L  V+ ++QSAF+ GR I DN IVG+E LH ++  R      V +KLDM+KAYD VEW FLE +++K+G+   WV  IM+C+ S Q S +I
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR-------
        NG    ++LP+RG RQGDPLSP+LFLL +E  S LI  A  +G L G+  G+    +SHLFFADDSLVF  A+ ++   FR +L  Y  ASG+       
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGR-------

Query:  ------------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                    R  ++  +G+ VV+N GKYLG+ S   R ++  F+ I  +VW  L+GWK  FFS  GKEVLIK++ Q IP Y MSCFRLPK     +H
Subjt:  ------------RMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
        +M ARFWWGSSE   KIHW +W  LC  KE GGL FR+L  FNQALLAKQ+WR   +PN L SKV+K  Y     +L     ++ S  WR  VW ++++Q
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFK-----PIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYS
         G R RIGNG    +  DPW+P+  TFK     P+P       +++ V +    +  W    +R V    D  +I  +  S  +  D+ +WHY  +G YS
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFK-----PIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYS

Query:  MRNGYKLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDT-ESVDHALCGCKRSKQICDMIFRRV
        +R+GY++A +++V    S+  A  RWW  LWK KIP KVKHF+W+  +  IP+N  L    +QI P C+ C     E+V HAL  C            RV
Subjt:  MRNGYKLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDT-ESVDHALCGCKRSKQICDMIFRRV

Query:  DCHIPNRNNFADRI------------IWLASQLPEEEFEKACIAFWAIWNDRNGFAH
        +C +   + F+ +I            + ++S L +E+FE   +  W +W  RN   H
Subjt:  DCHIPNRNNFADRI------------IWLASQLPEEEFEKACIAFWAIWNDRNGFAH

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)6.6e-17039.49Show/hide
Query:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK
        MN +LL  F+R E+E  + QM P+KAPG DG PALF+Q++W  VGD      + +LN    + ++N T IALIPKVK P  +S++RPISLC   YK+IAK
Subjt:  MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAK

Query:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI
         + NR+K VL  VI E QSAFVP R I DNV+   E ++TIK  + GR   + +KLDM+KAYD VEW FL  +++K+GF   WV  +MDC+ +T  S+L 
Subjt:  VLVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILI

Query:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----
         G P   I+P+RG RQG PLSPYLFL+ +E  S L+ GA  RG L G++  +  P ++HL FADDS++F KA+ +      ++   YE+ +G+++     
Subjt:  NGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRM-----

Query:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH
                      +I  +L +PVV     YLG+ +   + R+  FQ++K ++W  + GWK K  S  GKE+LIK+V Q IP Y MSCFR+PK LC +L+
Subjt:  --------------VISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLH

Query:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ
         +MARFWW  ++ KR IHW +W+ LC  K  GGL FR+LE+FNQALLAKQ WR+   P  LV+++ + RY      L   + +N S  WR   W +ELL 
Subjt:  NMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQ

Query:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY
         GLR R+G+G    ++ D W+P  S FK   +    +     V +  + S  W++  L+++   +++  I  IP+++    D  IWHY  NGMYS+++GY
Subjt:  CGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGY

Query:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH
        +LA   +    G  S+  +   ++W  +W  KIP K+K F+WR  +D +P    L+   +  +P+C  C    ESV HA+  C+ +K++         C 
Subjt:  KLA---RSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCH

Query:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING
        +   N+F  R +W A QL     E+   A+  W +WN RN F        ++ +SE       ++LL RM    A   SD+ N++  I+G
Subjt:  IPNRNNFADRIIWLASQLPEEEFEKACIAF--WAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMING

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein3.3e-3325Show/hide
Query:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKV-KQPKAISDYRPISLCNVSYKIIAKV
        E L  P + +EI  +I  +   K+PGPDGF A FYQ++  E+       +  +         + +  I LIPK  +      ++RPISL N+  KI+ K+
Subjt:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKV-KQPKAISDYRPISLCNVSYKIIAKV

Query:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN
        L NR++  ++++I+ +Q  F+PG   + N+      +  I   R   +  V I +D  KA+D ++  F+ + L K+G    ++++I         +I++N
Subjt:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN

Query:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASG---------
        G   +    K G RQG PLSP LF +V EVL+  I        + GI+ GK   ++    FADD +V+ +  I        ++  + K SG         
Subjt:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASG---------

Query:  ------RRMVISNLLG---MPVVNNLGKYLGVRSNFTRRRRD----DFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLC
               R   S ++G     + +   KYLG++   TR  +D    +++ + + +      WK    S  G+  ++K       +Y  +    +LP    
Subjt:  ------RRMVISNLLG---MPVVNNLGKYLGVRSNFTRRRRD----DFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLC

Query:  ADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVW
         +L     +F W  ++ + +I      Q     + GG+   + + + +A + K  W
Subjt:  ADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVW

P08548 LINE-1 reverse transcriptase homolog4.1e-3625.81Show/hide
Query:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKV-KQPKAISDYRPISLCNVSYKIIAKV
        E L  P S +EI   I+ +   K+PGPDGF + FYQ F  E+  +    + ++         + + +I LIPK  K P    +YRPISL N+  KI+ K+
Subjt:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKV-KQPKAISDYRPISLCNVSYKIIAKV

Query:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN
        L NR++  ++++I+ +Q  F+PG   + N+      +  I   +   +  + + +D  KA+D ++  F+ R L KIG    +++LI         +I++N
Subjt:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN

Query:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYK------------------ASIEQVWTFRSI
        GV       + G RQG PLSP LF +V EVL+  I    +   + GI  G    +I    FADD +V+ +                   S  ++ T +S+
Subjt:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYK------------------ASIEQVWTFRSI

Query:  LMIYEKASGRRMVISNLLGMPVVNNLGKYLGV--RSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLCAD
          IY   +     + + +   VV    KYLGV    +     +++++ +++ +   +  WK    S  G+  ++K       +Y  +    + P +   D
Subjt:  LMIYEKASGRRMVISNLLGMPVVNNLGKYLGV--RSNFTRRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLCAD

Query:  LHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPN
        L  ++  F W  ++ K +I       L    + GG+   +L  + ++++ K  W  + H N
Subjt:  LHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPN

P0C2F6 Putative ribonuclease H protein At1g657501.7e-3425.72Show/hide
Query:  RRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFREL
        R  +D F  I +RV   + GW+ K  S  G+  L K+V   +P++ MS   LP+++   L  +   F WGS+  K+K H  +W ++C PK+ GGL  R  
Subjt:  RRRRDDFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFREL

Query:  ESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTP---IKSNCSVFWRGY-VWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGA
        +S N+AL++K  WRL    N L + V++ +Y H G +  +     K + S  WR   +  R+++  G+    G+G+    + D W+  +   +       
Subjt:  ESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTP---IKSNCSVFWRGY-VWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGA

Query:  MIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPIS-ATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKV
           + V   +   P  GW  + +    T      +  + +   T + D   W +  +G +S+R+ Y++    +V   + ++     ++  LWK ++P++V
Subjt:  MIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRIIETIPIS-ATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKV

Query:  KHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGC
        K F+W      + +     +  +  S VC +C+   ES+ H L  C
Subjt:  KHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCGC

P11369 LINE-1 retrotransposable element ORF2 protein2.2e-3727.67Show/hide
Query:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPK-VKQPKAISDYRPISLCNVSYKIIAKV
        + L +P S  EIE VI  +   K+PGPDGF A FYQ F  ++  +    +  +         + +  I LIPK  K P  I ++RPISL N+  KI+ K+
Subjt:  EKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPK-VKQPKAISDYRPISLCNVSYKIIAKV

Query:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN
        L NR++  ++ +I+ +Q  F+PG   + N+      +H I   +   +  + I LD  KA+D ++  F+ ++L + G    ++ +I         +I +N
Subjt:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILIN

Query:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFY---KASIEQVWTF---------------RSI
        G   + I  K G RQG PLSPYLF +V EVL+  I     +  + GI+ GK   +IS L  ADD +V+    K S  ++                  +S+
Subjt:  GVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFY---KASIEQVWTF---------------RSI

Query:  LMIYEKASGRRMVISNLLGMPVVNNLGKYLGVRSNFTRRRRD----DFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLC
          +Y K       I       +V N  KYLGV    T+  +D    +F+++K+ +   L+ WK    S  G+  ++K       +Y  +    ++P    
Subjt:  LMIYEKASGRRMVISNLLGMPVVNNLGKYLGVRSNFTRRRRD----DFQNIKQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSC--FRLPKNLC

Query:  ADLHNMMARFWWGSSE---HKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVW
         +L   + +F W + +    K  +  KR          GG+   +L+ + +A++ K  W
Subjt:  ADLHNMMARFWWGSSE---HKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVW

P93295 Uncharacterized mitochondrial protein AtMg003102.8e-3245.03Show/hide
Query:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKE-LGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLST
        +P+Y MSCFRL K LC  L + M  FWW S E+KRKI W  W +LC  KE  GGL FR+L  FNQALLAKQ +R+   P+ L+S++++ RY    S++  
Subjt:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKE-LGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLST

Query:  PIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI
         + +  S  WR  +  RELL  GL + IG+G  T ++ D WI  E+   P+
Subjt:  PIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein1.6e-1925.26Show/hide
Query:  IKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPS---LGWHLSSLRNVV
        +K RY    S+L   ++   S  W   +    LL+ G R  IG+G++  +  D  +    +  P PL      +++T+            W  S +   V
Subjt:  IKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPS---LGWHLSSLRNVV

Query:  TQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYKL------ARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMG
         Q D   I  I ++ +   D+ IW+Y + G Y++R+GY L           +     S + + R    +W   I  K+KHF+WRA    + +   L   G
Subjt:  TQEDIRIIETIPISATNSADEWIWHYCSNGMYSMRNGYKL------ARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMG

Query:  MQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPNR----NNFADRIIWLASQLPE---EEFEKACIAF--WAIWNDRNGFAHN
        M+I P C  C  + ES++HAL  C        M +R  D  +       N+F + I  + + + +    +F K    +  W IW  RN    N
Subjt:  MQISPVCSICRVDTESVDHALCGCKRSKQICDMIFRRVDCHIPNR----NNFADRIIWLASQLPE---EEFEKACIAF--WAIWNDRNGFAHN

AT4G20520.1 RNA binding;RNA-directed DNA polymerases6.6e-1338.55Show/hide
Query:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWV
        +V R+K ++  +I   Q++F+PGR   DN++   E +H+++ K+ G +GW+ +KLD+ KAYD + W +LE  LI  GF   W+
Subjt:  LVNRMKLVLREVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWV

AT4G29090.1 Ribonuclease H-like superfamily protein7.0e-4727.47Show/hide
Query:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTP
        +P Y M+CF LPK +C  + +++A FWW + +  + +HWK WD L   K  GG+ F+++E+FN ALL KQ+WR+ + P  L++KV K RY H+   L+ P
Subjt:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLSTP

Query:  IKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI------PLVGAMIREDVTVAEFISPS-LGWHLSSLRNVVTQEDIRIIETI
        + S  S  W+    ++E+L+ G R  +GNG+D ++++  W+  +     +      P   A +   + V++ I  S   W    +  +  + + ++I  +
Subjt:  IKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI------PLVGAMIREDVTVAEFISPS-LGWHLSSLRNVVTQEDIRIIETI

Query:  PISATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSS----SNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVD
                D + W Y S+G Y++++GY +   I +  +SS    S  +    +  +WKS+   K++HF+W+   + +P    L    +     C  C   
Subjt:  PISATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSS----SNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVD

Query:  TESVDHALCGCKRSKQICDMIFRRVDCHIPNRNNFADRI----IW---LASQLPEEEFEKACIAF--WAIWNDRN
         E+V+H L  C  ++    + +      IP    +AD I     W   L +  P+ E     + +  W +W +RN
Subjt:  TESVDHALCGCKRSKQICDMIFRRVDCHIPNRNNFADRI----IW---LASQLPEEEFEKACIAF--WAIWNDRN

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.0e-3345.03Show/hide
Query:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKE-LGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLST
        +P+Y MSCFRL K LC  L + M  FWW S E+KRKI W  W +LC  KE  GGL FR+L  FNQALLAKQ +R+   P+ L+S++++ RY    S++  
Subjt:  IPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKE-LGGLNFRELESFNQALLAKQVWRLFTHPNLLVSKVIKGRYAHQGSLLST

Query:  PIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI
         + +  S  WR  +  RELL  GL + IG+G  T ++ D WI  E+   P+
Subjt:  PIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.3e-1653.52Show/hide
Query:  LSILINGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDS
        L  +ING P   + P RG RQGDPLSPYLF+L +EVLS L   A ++G L GI+     P+I+HL FADD+
Subjt:  LSILINGVPTDKILPKRGFRQGDPLSPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATGAAAAGTTGCTTACTCCATTTTCAAGGGCCGAGATTGAAAGAGTAATTAAACAAATGCACCCTTCTAAGGCGCCTGGTCCTGACGGGTTTCCTGCCCTTTTCTA
TCAACAATTCTGGTCTGAGGTTGGTGATCTTACTTTTTTGAACTACATGGACATGTTAAATAGGCTTAGATATATCGGAGATTGGAATGACACCCATATTGCTTTGATCC
CTAAGGTCAAGCAACCAAAAGCTATTTCTGATTATAGACCTATTAGTTTATGCAATGTGTCCTATAAAATAATTGCTAAAGTGTTAGTCAATCGCATGAAGCTGGTGTTG
CGAGAGGTGATTTATGAGAACCAATCTGCATTTGTTCCTGGTCGTTCGATTTTTGATAACGTGATTGTTGGCCACGAATGTTTACATACTATCAAAAGCAAACGAACAGG
GCGCCAGGGATGGGTAACGATAAAACTTGATATGAGCAAAGCGTACGATTGTGTGGAATGGTGTTTTTTGGAACGCCTTCTAATTAAAATTGGTTTTCACACACAATGGG
TTAGACTCATAATGGATTGTGTTCAGAGCACTCAGCTATCCATTCTGATAAATGGCGTGCCAACGGACAAAATTCTGCCCAAACGTGGCTTTCGTCAAGGGGACCCGTTG
TCACCTTATCTTTTTTTGCTCGTTTCGGAAGTACTATCCTCTCTTATTTCAGGTGCTGTTGATAGAGGTCATCTCTCTGGTATCAAACCAGGAAAATTTTGCCCACAAAT
TTCTCATCTTTTCTTTGCAGATGACAGTCTTGTCTTCTATAAAGCATCAATTGAACAAGTTTGGACTTTTCGGTCCATCTTGATGATTTATGAGAAGGCTTCGGGCAGAA
GAATGGTGATTTCCAATTTGTTGGGCATGCCGGTGGTTAATAATCTTGGGAAATACCTAGGGGTTCGGTCTAATTTTACTAGACGAAGAAGAGATGATTTTCAGAATATT
AAGCAAAGAGTGTGGATGACTTTACAAGGTTGGAAAAGAAAATTTTTCTCTACAGGTGGTAAGGAAGTGTTGATTAAAAGCGTTGCTCAATATATTCCAATGTATATCAT
GAGCTGTTTTCGTCTTCCCAAAAATCTATGTGCAGATCTCCATAATATGATGGCACGATTCTGGTGGGGTTCGTCGGAGCACAAAAGGAAGATTCATTGGAAACGTTGGG
ATCAACTATGCTTACCTAAGGAGTTAGGGGGCTTAAATTTCCGTGAATTGGAATCGTTCAATCAAGCTCTATTGGCAAAACAGGTGTGGCGGCTTTTTACTCACCCAAAT
TTACTTGTGTCAAAAGTAATTAAGGGAAGATATGCACATCAAGGGTCCTTACTATCCACGCCAATTAAATCAAATTGCTCTGTCTTTTGGAGGGGATATGTATGGGCTAG
GGAACTGCTCCAATGTGGATTGCGCAAACGTATAGGGAATGGTAAGGATACTTTGTTATTTAAGGACCCATGGATTCCCAAGGAGAGCACTTTCAAACCAATCCCACTTG
TAGGGGCGATGATCAGGGAGGATGTTACGGTTGCTGAATTCATATCTCCATCATTGGGTTGGCATTTGAGTAGTTTGAGAAATGTGGTAACTCAAGAAGATATCAGAATT
ATTGAAACCATTCCAATTAGTGCTACTAACAGTGCAGATGAATGGATATGGCACTATTGCTCTAATGGAATGTATTCCATGCGCAATGGGTATAAATTAGCTCGATCTAT
TAAAGTGGGTACGAAATCGTCCAGTAACAACGCTCAGCGAAGATGGTGGGTTTTGTTGTGGAAGTCTAAAATCCCCCAAAAGGTGAAACATTTTATTTGGAGAGCATATT
ATGATTGTATTCCATCAAATTATTGCCTATGGAAGATGGGGATGCAAATTTCTCCCGTGTGTAGTATTTGCAGAGTCGATACGGAAAGTGTTGACCATGCTTTGTGTGGT
TGTAAACGATCCAAACAAATTTGTGACATGATATTCCGCAGAGTGGATTGCCACATTCCCAATCGGAATAATTTTGCCGACAGAATTATTTGGCTTGCAAGCCAGCTACC
AGAAGAAGAATTTGAGAAAGCATGCATTGCTTTTTGGGCTATTTGGAACGATAGAAATGGTTTTGCTCATAATCTAGAGGTAATGCCATGGCAAAGGCGATCTGAAGTAA
ACGCCATGATTCAAGGTCTCCGACTTCTTCAACGAATGAACATTTCGTGTGCATCAATTTGCTCGGATTCGATAAATGTGGTTAAGATGATAAATGGGGATATTCATACT
ACATCGGATGTGCATCATTGGATTCTACAAATCCATCATATGAAGGAATCTTTTGATACTTTTGCTTTTACTTATGTTTCAAGGCAAGGTAATAGGCAAGCAGATTTCCT
AGCTAAAGAGGTTTGTCTTATCAAAGATCAATGCTTTGGGTGGGAAACTTCCCTCCGACATTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATGAAAAGTTGCTTACTCCATTTTCAAGGGCCGAGATTGAAAGAGTAATTAAACAAATGCACCCTTCTAAGGCGCCTGGTCCTGACGGGTTTCCTGCCCTTTTCTA
TCAACAATTCTGGTCTGAGGTTGGTGATCTTACTTTTTTGAACTACATGGACATGTTAAATAGGCTTAGATATATCGGAGATTGGAATGACACCCATATTGCTTTGATCC
CTAAGGTCAAGCAACCAAAAGCTATTTCTGATTATAGACCTATTAGTTTATGCAATGTGTCCTATAAAATAATTGCTAAAGTGTTAGTCAATCGCATGAAGCTGGTGTTG
CGAGAGGTGATTTATGAGAACCAATCTGCATTTGTTCCTGGTCGTTCGATTTTTGATAACGTGATTGTTGGCCACGAATGTTTACATACTATCAAAAGCAAACGAACAGG
GCGCCAGGGATGGGTAACGATAAAACTTGATATGAGCAAAGCGTACGATTGTGTGGAATGGTGTTTTTTGGAACGCCTTCTAATTAAAATTGGTTTTCACACACAATGGG
TTAGACTCATAATGGATTGTGTTCAGAGCACTCAGCTATCCATTCTGATAAATGGCGTGCCAACGGACAAAATTCTGCCCAAACGTGGCTTTCGTCAAGGGGACCCGTTG
TCACCTTATCTTTTTTTGCTCGTTTCGGAAGTACTATCCTCTCTTATTTCAGGTGCTGTTGATAGAGGTCATCTCTCTGGTATCAAACCAGGAAAATTTTGCCCACAAAT
TTCTCATCTTTTCTTTGCAGATGACAGTCTTGTCTTCTATAAAGCATCAATTGAACAAGTTTGGACTTTTCGGTCCATCTTGATGATTTATGAGAAGGCTTCGGGCAGAA
GAATGGTGATTTCCAATTTGTTGGGCATGCCGGTGGTTAATAATCTTGGGAAATACCTAGGGGTTCGGTCTAATTTTACTAGACGAAGAAGAGATGATTTTCAGAATATT
AAGCAAAGAGTGTGGATGACTTTACAAGGTTGGAAAAGAAAATTTTTCTCTACAGGTGGTAAGGAAGTGTTGATTAAAAGCGTTGCTCAATATATTCCAATGTATATCAT
GAGCTGTTTTCGTCTTCCCAAAAATCTATGTGCAGATCTCCATAATATGATGGCACGATTCTGGTGGGGTTCGTCGGAGCACAAAAGGAAGATTCATTGGAAACGTTGGG
ATCAACTATGCTTACCTAAGGAGTTAGGGGGCTTAAATTTCCGTGAATTGGAATCGTTCAATCAAGCTCTATTGGCAAAACAGGTGTGGCGGCTTTTTACTCACCCAAAT
TTACTTGTGTCAAAAGTAATTAAGGGAAGATATGCACATCAAGGGTCCTTACTATCCACGCCAATTAAATCAAATTGCTCTGTCTTTTGGAGGGGATATGTATGGGCTAG
GGAACTGCTCCAATGTGGATTGCGCAAACGTATAGGGAATGGTAAGGATACTTTGTTATTTAAGGACCCATGGATTCCCAAGGAGAGCACTTTCAAACCAATCCCACTTG
TAGGGGCGATGATCAGGGAGGATGTTACGGTTGCTGAATTCATATCTCCATCATTGGGTTGGCATTTGAGTAGTTTGAGAAATGTGGTAACTCAAGAAGATATCAGAATT
ATTGAAACCATTCCAATTAGTGCTACTAACAGTGCAGATGAATGGATATGGCACTATTGCTCTAATGGAATGTATTCCATGCGCAATGGGTATAAATTAGCTCGATCTAT
TAAAGTGGGTACGAAATCGTCCAGTAACAACGCTCAGCGAAGATGGTGGGTTTTGTTGTGGAAGTCTAAAATCCCCCAAAAGGTGAAACATTTTATTTGGAGAGCATATT
ATGATTGTATTCCATCAAATTATTGCCTATGGAAGATGGGGATGCAAATTTCTCCCGTGTGTAGTATTTGCAGAGTCGATACGGAAAGTGTTGACCATGCTTTGTGTGGT
TGTAAACGATCCAAACAAATTTGTGACATGATATTCCGCAGAGTGGATTGCCACATTCCCAATCGGAATAATTTTGCCGACAGAATTATTTGGCTTGCAAGCCAGCTACC
AGAAGAAGAATTTGAGAAAGCATGCATTGCTTTTTGGGCTATTTGGAACGATAGAAATGGTTTTGCTCATAATCTAGAGGTAATGCCATGGCAAAGGCGATCTGAAGTAA
ACGCCATGATTCAAGGTCTCCGACTTCTTCAACGAATGAACATTTCGTGTGCATCAATTTGCTCGGATTCGATAAATGTGGTTAAGATGATAAATGGGGATATTCATACT
ACATCGGATGTGCATCATTGGATTCTACAAATCCATCATATGAAGGAATCTTTTGATACTTTTGCTTTTACTTATGTTTCAAGGCAAGGTAATAGGCAAGCAGATTTCCT
AGCTAAAGAGGTTTGTCTTATCAAAGATCAATGCTTTGGGTGGGAAACTTCCCTCCGACATTGA
Protein sequenceShow/hide protein sequence
MNEKLLTPFSRAEIERVIKQMHPSKAPGPDGFPALFYQQFWSEVGDLTFLNYMDMLNRLRYIGDWNDTHIALIPKVKQPKAISDYRPISLCNVSYKIIAKVLVNRMKLVL
REVIYENQSAFVPGRSIFDNVIVGHECLHTIKSKRTGRQGWVTIKLDMSKAYDCVEWCFLERLLIKIGFHTQWVRLIMDCVQSTQLSILINGVPTDKILPKRGFRQGDPL
SPYLFLLVSEVLSSLISGAVDRGHLSGIKPGKFCPQISHLFFADDSLVFYKASIEQVWTFRSILMIYEKASGRRMVISNLLGMPVVNNLGKYLGVRSNFTRRRRDDFQNI
KQRVWMTLQGWKRKFFSTGGKEVLIKSVAQYIPMYIMSCFRLPKNLCADLHNMMARFWWGSSEHKRKIHWKRWDQLCLPKELGGLNFRELESFNQALLAKQVWRLFTHPN
LLVSKVIKGRYAHQGSLLSTPIKSNCSVFWRGYVWARELLQCGLRKRIGNGKDTLLFKDPWIPKESTFKPIPLVGAMIREDVTVAEFISPSLGWHLSSLRNVVTQEDIRI
IETIPISATNSADEWIWHYCSNGMYSMRNGYKLARSIKVGTKSSSNNAQRRWWVLLWKSKIPQKVKHFIWRAYYDCIPSNYCLWKMGMQISPVCSICRVDTESVDHALCG
CKRSKQICDMIFRRVDCHIPNRNNFADRIIWLASQLPEEEFEKACIAFWAIWNDRNGFAHNLEVMPWQRRSEVNAMIQGLRLLQRMNISCASICSDSINVVKMINGDIHT
TSDVHHWILQIHHMKESFDTFAFTYVSRQGNRQADFLAKEVCLIKDQCFGWETSLRH