; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0002547 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0002547
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:43734458..43740538
RNA-Seq ExpressionLag0002547
SyntenyLag0002547
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR036397 - Ribonuclease H superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA38592.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]7.3e-11844.73Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M+C++TVSYS L+NG P     P+RG+RQGDPLSPYLFLLCAEGF+ LL + E    L G+ I +  P +   FFADDS +F KA D +   LK I + Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +KS    S N+H   +     +L + RV+S   +LG+P  +G++K   F  LK+ V K L+GW+E+  S+ GKEVL+K VAQSIP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF LP  +C +I+++ ARFWWG  G+ RK+HWM WER+C+ K +GGMGFR ++AFN AMLAKQ WRL+ NP+SL  ++L+ KYF   NF EA +G+ PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW
        +W+SI   R +   G R++IG+G  ++I  D W+ RP +   I +  D ++  +V+ L+  + + +W   ++  LFLP +  DI+ I +  R   D I+W
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW

Query:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        + DK G+F VKSAY +A+ +    E  S S  S     WR +W++ V  + KI  W++ +DI+P+
Subjt:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.2e-11746.57Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M CI++VSYSIL+NG       P+RG+RQGDP+SPY+FLLCA+GFS+LL        +SG+ I + CP +   FFADDSL+F KA  ++ QTL  IL+ Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCE--RILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYT
        E  SGQ IN++KS+   S N    D++ CE  R+L   +     ++LG+PS IGKSK  +FA++K+ VE+ L GWKEKL S+GG+E+LIKAVAQ+IPTYT
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCE--RILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYT

Query:  MSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSP
        MSCF++P  +C++I+ +  RFWWG  G + K+ W+SW+++C+ K  GGMGFR ++AFN AMLAKQ WRLI NPNSL+ +I + +Y+  G+  +A++G SP
Subjt:  MSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSP

Query:  STIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSVKG-KRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII
        S  WRSI  G ++  +G RWR+GNG  I I +D W+  P +   I          RV+ L+D E  +WK+D +R LFLP EA  IL+I +      D+II
Subjt:  STIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSVKG-KRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII

Query:  WHPDKKGIFKVKSAYHLAMD-LKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        W  ++KG F VKSAY++A+  +   E    S G      WR LW   + P+ +I  WK+  + +P+
Subjt:  WHPDKKGIFKVKSAYHLAMD-LKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

XP_023912060.1 uncharacterized protein LOC112023667 [Quercus suber]6.8e-11639.37Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M C++TV+YS+L+NG P+   +P+RGIRQGDP+SPYLFLLCAEG SA+LK+EE   ++ G+ + +  P +    FADDSLIF KA   +   + ++LK+Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E+ SGQ +N EK++   S+N   + + + + +   + +    Q+LG+PS IGK K+  F KLKD V K + GWK K+ S  G+E LIKAVAQ+  TYTMS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF+LP  +C ++  + ++FWWG   ++RK+ W++W+++C+ K  GGMGFR ++AFN A+LAKQ WRL++N NSL  ++ + KYF +  F++AQ+G  PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSV-KGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIWH
         WRSI   +D+  +G RW IGNG  ++I +D W+  P S   +    ++  G  V+ L+D E H WK D I + FLPHEA+ IL I +      D ++W 
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSV-KGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIWH

Query:  PDKKGIFKVKSAYHLAMDLKYKEKASQ-SDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPSNWSPRD---------------------YWNWMVDHL
            G F V+SAYH+A  L   +   Q S+ S M   W+ +W        +   W+   +I+P+    RD                     +W   +   
Subjt:  PDKKGIFKVKSAYHLAMDLKYKEKASQ-SDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPSNWSPRD---------------------YWNWMVDHL

Query:  NNEEIAKGSIIM---WSIWNHRNKIQAASSRGAAEFLINDA
         N       I++   W IWN+RN I   ++   A  +INDA
Subjt:  NNEEIAKGSIIM---WSIWNHRNKIQAASSRGAAEFLINDA

XP_030958760.1 uncharacterized protein LOC115980671 [Quercus lobata]2.6e-11545.49Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M CI++VSYS+LING       PSRG+RQGDPLSPYLFLLCA+GFS+L+ +      LSGL I +  P +   FFADDSL+F KA   + Q L  IL  Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +KS+   S N   + +     IL   + +  G++LG+PS IGKSK  VFA++K+ V + L GW E L S+GG+E LIKAVAQ+IPTY MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF LP  +CDDI+ +  RFWWG  G + K+ W+SW+R+C++K QGGMGFR ++AFN AMLAKQ WRL+ NP+SL+ ++ R KY+  G+ + A +G  PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGS---RTPIIANDSVKGKRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII
         WRSI  G ++  KG RWR+GNG +I I  D W+  P +    +P    D      V+ L+D +  +W+ D ++ +FLP EA+ ILNI + +    D +I
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGS---RTPIIANDSVKGKRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII

Query:  WHPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        W  ++KG+F VKSAY++A++L     +   S G  + + W+ +W   +  + KI  W+   D +P+
Subjt:  WHPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

XP_030970961.1 uncharacterized protein LOC115991405 [Quercus lobata]2.6e-11545.49Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M CI++VSYS+LING       PSRG+RQGDPLSPYLFLLCA+GFS+L+ +      LSGL I +  P +   FFADDSL+F KA   + Q L  IL  Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +KS+   S N   + +     IL   + +  G++LG+PS IGKSK  VFA++K+ V + L GW E L S+GG+E LIKAVAQ+IPTY MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF LP  +CDDI+ +  RFWWG  G + K+ W+SW+R+C++K QGGMGFR ++AFN AMLAKQ WRL+ NP+SL+ ++ R KY+  G+ + A +G  PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGS---RTPIIANDSVKGKRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII
         WRSI  G ++  KG RWR+GNG +I I  D W+  P +    +P    D      V+ L+D +  +W+ D ++ +FLP EA+ ILNI + +    D +I
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGS---RTPIIANDSVKGKRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII

Query:  WHPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        W  ++KG+F VKSAY++A++L     +   S G  + + W+ +W   +  + KI  W+   D +P+
Subjt:  WHPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

TrEMBL top hitse value%identityAlignment
A0A2N9EYC3 Reverse transcriptase domain-containing protein4.3e-11643.78Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        MEC+STVSYSIL+NG P    KPSRG+RQGDPLSPYLFLLCAEGF +L+++E++   L G+ I++  P +   FFADDSL+F KA   D+  ++ IL +Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +K+    S++     +   + +L +  +    ++LG+PS +G++K + FA++K+ V   L+GWKEKL S  G+E+LIK+VAQ+IP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CFRLPN +  +I+ L  RFWWG  G+K KMHW+ W  +C++K+ GG+G R +  FN+A+LAKQ WRL+ NP+SL FK+ + KYF   + +E Q     S 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI----NRPGSRTPIIANDSVKGKRVNWLLDENHK-WKEDRIRQLFLPHEAEDILNISVGFRNARDEI
         WRSI   RDL  KG  WR+G G  I+I  D W+    N     TP + N S+    V  L+D + K WK + +++LFLP EA  IL I + FRN  D +
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI----NRPGSRTPIIANDSVKGKRVNWLLDENHK-WKEDRIRQLFLPHEAEDILNISVGFRNARDEI

Query:  IWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        +W   K+G++ V+S YHL  + + +++   SD +KM + W+++WS  +  +T+  +W+  +  +P+
Subjt:  IWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

A0A2N9FN80 Uncharacterized protein2.1e-11843.87Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        MECIS+VSYSIL+NG P    KPSRG+RQGDPLSPYLFLLCAEGF +LL++E+   +L G+ I++  P +   FFADDSL+F +A   D+  ++ IL +Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +K+    S++     K   + +L +  +    ++LG+PS IG++K + FA++K+ V   L+GWKEKL S  G+E+LIK+VAQ+IP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CFRLPN +  +I+ L  RFWWG  GDK KMHW+ W  +C++K  GG+GFR++ +FN+A+LAKQ WRL+ N +SL +K+ + KYF   + +EAQ+ ++ S 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRP---GSRTPIIANDSVKGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII
         W+SI   RDL  KG  WR+G+G  IQI  D W+  P      +P   N S+   +V  L+D ++  WKE+ IR++FLPH+A  IL + +  R+  D ++
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRP---GSRTPIIANDSVKGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII

Query:  WHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        W   K G++ V+S YH  M  K +  A  SD +++ + W ++WS  + P+ +  +W+  ++ +P+
Subjt:  WHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

A0A2N9GI95 Reverse transcriptase domain-containing protein1.8e-11735.05Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        MECISTVSYSIL+NG P    KPSRG+RQGDPLSPYLFLLCAEGF +LL++E+    L G+ I++  P +   FFADDSL+F KA   D++ ++ IL +Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +K+    S++     +   + +L +  +    ++LG+PS IG++K + FA++K+ V   L+GWKEKL S  G+E+LIK+VAQ+IP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CFRLPN +  +I+ L  RFWWG  GDK KMHW+SW  +C++K  GG+GFR++  FN+A+LAKQ WRL+ NP+SL +K+ + KYF   + +EA   ++ S 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSR---TPIIANDSVKGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII
         W+SI   RDL  KG  WR+GNG  I I  D W+  P ++   +P I   ++    V  L+D ++  WKE+ IR+ FLPH+A  I+ I +      D ++
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSR---TPIIANDSVKGKRVNWLLD-ENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEII

Query:  WHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPSN----------------------------------
        W   + G + V+S YHL +  K +      D ++M   W S+WS  V P+T+ C+W+  ++ +P+                                   
Subjt:  WHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPSN----------------------------------

Query:  --------WSPR----------DYWNWMVDHLNNEEIAKGSIIMWSIWNHRNKIQAASSRGAAEFLINDAEEPRESCHLEETETKL-LEAKRRRGLVRAK
                W  R          D W+     L+  E+   S++ W IW HRN+++          ++  A+E       E+    + L   +   ++   
Subjt:  --------WSPR----------DYWNWMVDHLNNEEIAKGSIIMWSIWNHRNKIQAASSRGAAEFLINDAEEPRESCHLEETETKL-LEAKRRRGLVRAK

Query:  KRRRPRVGGARLEWVLNLFRITKNRSELG---NQNHGSQSNS--------HGIKAIRQTCLQL------NLG---LEIESDALEVIKVLAGDEEDLSELK
        K + P  G  ++ +   +FR T N + LG       G+   S        H I+A+  +  +       +LG   +E+E D+  V+  L       +   
Subjt:  KRRRPRVGGARLEWVLNLFRITKNRSELG---NQNHGSQSNS--------HGIKAIRQTCLQL------NLG---LEIESDALEVIKVLAGDEEDLSELK

Query:  PIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHA
         I E I   ++ L  V F H  R  N+ AH LA+ A
Subjt:  PIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHA

A0A5E4GGB8 PREDICTED: reverse mRNAase (Fragment)3.5e-11844.73Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M+C++TVSYS L+NG P     P+RG+RQGDPLSPYLFLLCAEGF+ LL + E    L G+ I +  P +   FFADDS +F KA D +   LK I + Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +KS    S N+H   +     +L + RV+S   +LG+P  +G++K   F  LK+ V K L+GW+E+  S+ GKEVL+K VAQSIP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF LP  +C +I+++ ARFWWG  G+ RK+HWM WER+C+ K +GGMGFR ++AFN AMLAKQ WRL+ NP+SL  ++L+ KYF   NF EA +G+ PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW
        +W+SI   R +   G R++IG+G  ++I  D W+ RP +   I +  D ++  +V+ L+  + + +W   ++  LFLP +  DI+ I +  R   D I+W
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW

Query:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        + DK G+F VKSAY +A+ +    E  S S  S     WR +W++ V  + KI  W++ +DI+P+
Subjt:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

M5VU98 Reverse transcriptase domain-containing protein6.0e-11844.73Show/hide
Query:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY
        M+C++TVSYS L+NG P     P+RG+RQGDPLSPYLFLLCAEGF+ LL + E    L G+ I +  P +   FFADDS +F KA D +   LK I + Y
Subjt:  MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEY

Query:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS
        E  SGQ IN +KS    S N+H   +     +L + RV+S   +LG+P  +G++K   F  LK+ V K L+GW+E+  S+ GKEVL+K VAQSIP Y MS
Subjt:  EIVSGQIINLEKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMS

Query:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST
        CF LP  +C +I+++ ARFWWG  G+ RK+HWM WER+C+ K +GGMGFR ++AFN AMLAKQ WRL+ NP+SL  ++L+ KYF   NF EA +G+ PS 
Subjt:  CFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPST

Query:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW
        +W+SI   R +   G R++IG+G  ++I  D W+ RP +   I +  D ++  +V+ L+  + + +W   ++  LFLP +  DI+ I +  R   D I+W
Subjt:  IWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIAN-DSVKGKRVNWLL--DENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIW

Query:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS
        + DK G+F VKSAY +A+ +    E  S S  S     WR +W++ V  + KI  W++ +DI+P+
Subjt:  HPDKKGIFKVKSAYHLAMDL-KYKEKASQSDGSKMIKDWRSLWSSGVLPRTKICVWKIVNDIIPS

SwissProt top hitse value%identityAlignment
P08548 LINE-1 reverse transcriptase homolog7.1e-1521.57Show/hide
Query:  SILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEYEIVSGQIIN
        +I++NG   + F    G RQG PLSP LF +  E  +  ++ E++   + G+ I      +++S FADD +++++        L  ++KEY  VSG  IN
Subjt:  SILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEYEIVSGQIIN

Query:  LEKS-AFMVSRNMHAKD--KETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIK--AVAQSIPTYTMSCFRL
          KS AF+ + N  A+   K++    +  K++  LG +L     +    +  +  L+  + + +  WK    S  G+  ++K   + ++I  +     + 
Subjt:  LEKS-AFMVSRNMHAKD--KETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIK--AVAQSIPTYTMSCFRL

Query:  PNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPSTIWRS
        P     D++++   F W      +K   ++   +      GG+    +R + ++++ K +W   +N    ++  +  +      +    I + P    ++
Subjt:  PNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPSTIWRS

Query:  ICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSVKGKRVNWLLDEN
        I WG+D     + W     I  ++  DP +      +P+   DS      +W+ D N
Subjt:  ICWGRDLFAKGYRWRIGNGIMIQIDKDPWINRPGSRTPIIANDSVKGKRVNWLLDEN

P0C2F6 Putative ribonuclease H protein At1g657506.8e-3427.58Show/hide
Query:  MPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGG
        MP    +  +  F ++ + V   + GW+EK  S  G+  L KAV  S+P ++MS   LP  I + +D+L   F WG+  +K+K H + W ++C  K +GG
Subjt:  MPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGG

Query:  MGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKY----FKDGNFMEAQIGNSPSTIWRSICWG-RDLFAKGYRWRIGNGIMIQIDKDPWIN------
        +G R  ++ N+A+++K  WRL++  NSL   +L+ KY     +D  ++  +   S S+ WRSI  G RD+ + G  W  G+G  I+   D W++      
Subjt:  MGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKY----FKDGNFMEAQIGNSPSTIWRSICWG-RDLFAKGYRWRIGNGIMIQIDKDPWIN------

Query:  -----RPGSRTPIIANDS-VKGKRVNWLLDENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGS
             RP     ++A D  + G+  ++   + +     R+    +      +L++  G   ARD + W   + G F V+SAY +           +    
Subjt:  -----RPGSRTPIIANDS-VKGKRVNWLLDENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGS

Query:  KMIKDWRSLWSSGVLPRTKICVWKIVNDII
         M   +  LW   V  R K  +W + N  +
Subjt:  KMIKDWRSLWSSGVLPRTKICVWKIVNDII

P11369 LINE-1 retrotransposable element ORF2 protein2.3e-1323.31Show/hide
Query:  SILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEYEIVSGQIIN
        +I +NG   E      G RQG PLSPYLF +  E  +  +++++    + G+QI K    ++IS  ADD +++I       + L  ++  +  V G  IN
Subjt:  SILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEYEIVSGQIIN

Query:  LEKS-AFMVSRNMHAKD--KETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIK--AVAQSIPTYTMSCFRL
          KS AF+ ++N  A+   +ET    +    +  LG  + +  ++       F  LK  +++ LR WK+   S  G+  ++K   + ++I  +     ++
Subjt:  LEKS-AFMVSRNMHAKD--KETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIK--AVAQSIPTYTMSCFRL

Query:  PNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRN
        P    ++++    +F W       K   ++   +   +  GG+    ++ + +A++ K +W   R+
Subjt:  PNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRN

P92555 Uncharacterized mitochondrial protein AtMg012501.1e-1248.53Show/hide
Query:  LINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDS
        +ING+PQ    PSRG+RQGDPLSPYLF+LC E  S L +R +    L G++++ + P +    FADD+
Subjt:  LINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003101.0e-3444.06Show/hide
Query:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK-DQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFME
        ++P Y MSCFRL   +C  +      FWW +  +KRK+ W++W+++C++K D GG+GFR +  FNQA+LAKQS+R+I  P++LL ++LR +YF   + ME
Subjt:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK-DQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFME

Query:  AQIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI
          +G  PS  WRSI  GR+L ++G    IG+GI  ++  D WI
Subjt:  AQIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.1e-1426.84Show/hide
Query:  QHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK
        ++LG+P    K   + +  L + +   +  W  +  S  G+  LI +V  S+  + MS FRLP+    +ID +C+ F W       K   ++W  +C  K
Subjt:  QHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK

Query:  DQGGMGFRQIRAFNQAMLAKQSWRLIRNP---NSLLFKILRGKYFKDGNFMEAQIGNSPSTIWRSICW---GRDLFAKGYRWRIGNGIMI
        D+GG+G R ++  N+       W +  N    + +  KIL+ +    G F++  I N  +T +    W   GR +   G+R  I  GI +
Subjt:  DQGGMGFRQIRAFNQAMLAKQSWRLIRNP---NSLLFKILRGKYFKDGNFMEAQIGNSPSTIWRSICW---GRDLFAKGYRWRIGNGIMI

AT4G29090.1 Ribonuclease H-like superfamily protein5.3e-4232.01Show/hide
Query:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEA
        ++PTYTM+CF LP  +C  I  + A FWW    + + MHW +W+ +   K +GG+GF+ I AFN A+L KQ WR++  P SL+ K+ + +YF   + + A
Subjt:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEA

Query:  QIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI-NRPGS------RTPIIANDSVKG-KRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILN
         +G+ PS +W+SI   +++  +G R  +GNG  I I +  W+ ++P S      R P     SV    +V+ L+DE+  +W++D I  LF   E + I  
Subjt:  QIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI-NRPGS------RTPIIANDSVKG-KRVNWLLDEN-HKWKEDRIRQLFLPHEAEDILN

Query:  ISVGFRNARDEIIWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKD-WRSLWSSGVLPRTKICVWKIVNDIIP
        +  G R   D   W     G + VKS Y +   +  K  + Q      +   ++ +W S   P+ +  +WK +++ +P
Subjt:  ISVGFRNARDEIIWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKD-WRSLWSSGVLPRTKICVWKIVNDIIP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.4e-3644.06Show/hide
Query:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK-DQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFME
        ++P Y MSCFRL   +C  +      FWW +  +KRK+ W++W+++C++K D GG+GFR +  FNQA+LAKQS+R+I  P++LL ++LR +YF   + ME
Subjt:  SIPTYTMSCFRLPNYICDDIDRLCARFWWGTVGDKRKMHWMSWERMCRNK-DQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFME

Query:  AQIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI
          +G  PS  WRSI  GR+L ++G    IG+GI  ++  D WI
Subjt:  AQIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDKDPWI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)8.0e-1448.53Show/hide
Query:  LINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDS
        +ING+PQ    PSRG+RQGDPLSPYLF+LC E  S L +R +    L G++++ + P +    FADD+
Subjt:  LINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGCATATCCACAGTCTCGTACTCAATCCTCATTAATGGTTCGCCCCAAGAAGAGTTTAAGCCGAGTAGAGGGATCAGACAAGGGGACCCGCTATCCCCGTACCT
CTTCCTCCTCTGTGCTGAGGGTTTCTCAGCGCTCCTTAAAAGGGAAGAATCCTTTAACAACCTATCTGGATTACAAATCAATAAACATTGCCCCCTCTTACGCATCTCTT
TTTTTGCAGATGACAGCCTGATCTTCATCAAAGCAAGGGACAAAGACCTCCAAACTCTCAAAAGGATTTTGAAGGAATACGAAATTGTTTCGGGCCAAATCATCAACTTA
GAAAAGTCGGCTTTCATGGTCAGCAGAAACATGCACGCCAAAGACAAAGAAACCTGCGAGCGAATCCTAAGCATCAAAAGAGTCAATTCCCTTGGCCAACACCTAGGGAT
GCCCTCTCAGATTGGGAAAAGCAAGAGAGCTGTGTTTGCAAAGCTAAAAGATATGGTGGAGAAAACCCTCCGAGGTTGGAAAGAAAAACTTTTCTCCCTTGGAGGAAAGG
AAGTCCTCATAAAAGCGGTGGCCCAATCGATTCCCACATACACTATGAGTTGCTTTCGTCTCCCTAATTACATTTGTGATGATATTGATAGGTTATGCGCCAGGTTTTGG
TGGGGAACGGTGGGAGACAAAAGAAAGATGCATTGGATGAGCTGGGAGCGCATGTGTCGCAACAAGGACCAAGGAGGCATGGGATTCAGACAAATAAGGGCATTCAACCA
AGCAATGCTAGCTAAGCAGAGCTGGAGACTAATCAGAAACCCCAACAGCCTACTCTTCAAAATCCTCCGAGGCAAATATTTCAAAGATGGCAACTTCATGGAGGCGCAAA
TAGGAAATTCCCCTTCGACCATTTGGAGAAGCATTTGCTGGGGCAGAGATCTGTTCGCTAAAGGCTACCGGTGGCGGATAGGGAATGGAATCATGATTCAAATTGACAAA
GATCCCTGGATCAATAGGCCAGGGAGTCGGACCCCAATCATCGCAAATGACAGTGTAAAAGGCAAGAGGGTGAACTGGTTACTCGACGAAAATCACAAATGGAAGGAAGA
CCGAATAAGACAACTCTTCCTCCCTCATGAGGCTGAAGACATCCTAAACATTTCAGTGGGATTCAGAAATGCAAGAGACGAAATCATATGGCATCCAGACAAAAAAGGGA
TCTTCAAAGTTAAAAGTGCTTACCATCTCGCTATGGACTTAAAGTACAAAGAAAAAGCGTCCCAATCCGACGGAAGCAAGATGATCAAAGATTGGAGGAGCCTATGGAGT
TCGGGTGTTCTACCAAGAACCAAAATTTGCGTCTGGAAGATAGTGAATGATATTATCCCGTCAAATTGGAGCCCTAGAGACTACTGGAATTGGATGGTGGATCATCTTAA
CAATGAGGAGATAGCAAAGGGGTCAATTATCATGTGGAGTATATGGAATCACAGAAACAAAATTCAGGCAGCAAGCAGTCGGGGAGCAGCAGAATTTCTCATCAACGATG
CTGAGGAACCACGTGAGTCATGTCATTTGGAAGAGACCGAAACCAAACTCTTGGAAGCTAAACGCAGACGCGGCTTGGTTCGAGCAAAGAAGCGTCGGAGGCCTCGGGTG
GGTGGTGCACGACTCGAATGGGTCCTTAATCTGTTTCGGATTACAAAAAACCGATCGGAATTGGGAAATCAAAATCATGGAAGCCAAAGCAATTCTCATGGGATCAAGGC
AATTCGACAAACCTGCCTTCAATTGAATCTAGGGCTGGAAATAGAATCAGACGCCCTGGAAGTGATCAAGGTCCTGGCCGGAGACGAAGAAGACCTGTCGGAGCTCAAGC
CCATCGCTGAGACGATCGTGTCTTCCTCCAAGGATCTGCGTGAAGTTTCTTTCATCCACTGTAACCGTCTAGCTAATTCAACAGCCCACTGGTTGGCTAGGCACGCTTCT
TCTGTAAATTTTTGTTCTAGAAATTTTGGTTTCGATCAGGGGAATCCTCTTTGCGAGGAATCTGGGCTTTCTTTTTGGGCGCCTGATCTTCCCTCCTGGTTCTCCCCTCC
TTTTTTAGAGGGTGCGGGATATGATATGGTGGTCCTGGGACCCAATGGTAACATGATAGCAACAATGGAGATGTTTGATAATCTGTGTTTTACACCTCTTGCAGCAGAGA
TCCAAGCGATCCTGCATGGCCTTAGACTGTTGCAACGATTGCAGCATATGAGTGCTCATGTATTCTCCGACTCTTCAAACACAATCAACATGATAACAGGAGATCTTCAA
CCTTCTTTAGAGAGCCCATTCGACATTGTTTTCATGCTCTTGTGCGTATGTGAAGTATGGCATAAGCAGGACAACACAACAACTAGAACACGGTCGAGAGACGGATATCG
AAGGAGAGACAAGGCAAAGGGGTTAGGCCGAGCTCGACCCAACCCCGTGTGGGCCATGCTTGAACCACTTGCATGGGTCGAGTCCTTCCACCTCTGTTTGGTCCCTGGTG
CCCCTGGCTGCCCTAGTTCCACCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGAATGCATATCCACAGTCTCGTACTCAATCCTCATTAATGGTTCGCCCCAAGAAGAGTTTAAGCCGAGTAGAGGGATCAGACAAGGGGACCCGCTATCCCCGTACCT
CTTCCTCCTCTGTGCTGAGGGTTTCTCAGCGCTCCTTAAAAGGGAAGAATCCTTTAACAACCTATCTGGATTACAAATCAATAAACATTGCCCCCTCTTACGCATCTCTT
TTTTTGCAGATGACAGCCTGATCTTCATCAAAGCAAGGGACAAAGACCTCCAAACTCTCAAAAGGATTTTGAAGGAATACGAAATTGTTTCGGGCCAAATCATCAACTTA
GAAAAGTCGGCTTTCATGGTCAGCAGAAACATGCACGCCAAAGACAAAGAAACCTGCGAGCGAATCCTAAGCATCAAAAGAGTCAATTCCCTTGGCCAACACCTAGGGAT
GCCCTCTCAGATTGGGAAAAGCAAGAGAGCTGTGTTTGCAAAGCTAAAAGATATGGTGGAGAAAACCCTCCGAGGTTGGAAAGAAAAACTTTTCTCCCTTGGAGGAAAGG
AAGTCCTCATAAAAGCGGTGGCCCAATCGATTCCCACATACACTATGAGTTGCTTTCGTCTCCCTAATTACATTTGTGATGATATTGATAGGTTATGCGCCAGGTTTTGG
TGGGGAACGGTGGGAGACAAAAGAAAGATGCATTGGATGAGCTGGGAGCGCATGTGTCGCAACAAGGACCAAGGAGGCATGGGATTCAGACAAATAAGGGCATTCAACCA
AGCAATGCTAGCTAAGCAGAGCTGGAGACTAATCAGAAACCCCAACAGCCTACTCTTCAAAATCCTCCGAGGCAAATATTTCAAAGATGGCAACTTCATGGAGGCGCAAA
TAGGAAATTCCCCTTCGACCATTTGGAGAAGCATTTGCTGGGGCAGAGATCTGTTCGCTAAAGGCTACCGGTGGCGGATAGGGAATGGAATCATGATTCAAATTGACAAA
GATCCCTGGATCAATAGGCCAGGGAGTCGGACCCCAATCATCGCAAATGACAGTGTAAAAGGCAAGAGGGTGAACTGGTTACTCGACGAAAATCACAAATGGAAGGAAGA
CCGAATAAGACAACTCTTCCTCCCTCATGAGGCTGAAGACATCCTAAACATTTCAGTGGGATTCAGAAATGCAAGAGACGAAATCATATGGCATCCAGACAAAAAAGGGA
TCTTCAAAGTTAAAAGTGCTTACCATCTCGCTATGGACTTAAAGTACAAAGAAAAAGCGTCCCAATCCGACGGAAGCAAGATGATCAAAGATTGGAGGAGCCTATGGAGT
TCGGGTGTTCTACCAAGAACCAAAATTTGCGTCTGGAAGATAGTGAATGATATTATCCCGTCAAATTGGAGCCCTAGAGACTACTGGAATTGGATGGTGGATCATCTTAA
CAATGAGGAGATAGCAAAGGGGTCAATTATCATGTGGAGTATATGGAATCACAGAAACAAAATTCAGGCAGCAAGCAGTCGGGGAGCAGCAGAATTTCTCATCAACGATG
CTGAGGAACCACGTGAGTCATGTCATTTGGAAGAGACCGAAACCAAACTCTTGGAAGCTAAACGCAGACGCGGCTTGGTTCGAGCAAAGAAGCGTCGGAGGCCTCGGGTG
GGTGGTGCACGACTCGAATGGGTCCTTAATCTGTTTCGGATTACAAAAAACCGATCGGAATTGGGAAATCAAAATCATGGAAGCCAAAGCAATTCTCATGGGATCAAGGC
AATTCGACAAACCTGCCTTCAATTGAATCTAGGGCTGGAAATAGAATCAGACGCCCTGGAAGTGATCAAGGTCCTGGCCGGAGACGAAGAAGACCTGTCGGAGCTCAAGC
CCATCGCTGAGACGATCGTGTCTTCCTCCAAGGATCTGCGTGAAGTTTCTTTCATCCACTGTAACCGTCTAGCTAATTCAACAGCCCACTGGTTGGCTAGGCACGCTTCT
TCTGTAAATTTTTGTTCTAGAAATTTTGGTTTCGATCAGGGGAATCCTCTTTGCGAGGAATCTGGGCTTTCTTTTTGGGCGCCTGATCTTCCCTCCTGGTTCTCCCCTCC
TTTTTTAGAGGGTGCGGGATATGATATGGTGGTCCTGGGACCCAATGGTAACATGATAGCAACAATGGAGATGTTTGATAATCTGTGTTTTACACCTCTTGCAGCAGAGA
TCCAAGCGATCCTGCATGGCCTTAGACTGTTGCAACGATTGCAGCATATGAGTGCTCATGTATTCTCCGACTCTTCAAACACAATCAACATGATAACAGGAGATCTTCAA
CCTTCTTTAGAGAGCCCATTCGACATTGTTTTCATGCTCTTGTGCGTATGTGAAGTATGGCATAAGCAGGACAACACAACAACTAGAACACGGTCGAGAGACGGATATCG
AAGGAGAGACAAGGCAAAGGGGTTAGGCCGAGCTCGACCCAACCCCGTGTGGGCCATGCTTGAACCACTTGCATGGGTCGAGTCCTTCCACCTCTGTTTGGTCCCTGGTG
CCCCTGGCTGCCCTAGTTCCACCTGA
Protein sequenceShow/hide protein sequence
MECISTVSYSILINGSPQEEFKPSRGIRQGDPLSPYLFLLCAEGFSALLKREESFNNLSGLQINKHCPLLRISFFADDSLIFIKARDKDLQTLKRILKEYEIVSGQIINL
EKSAFMVSRNMHAKDKETCERILSIKRVNSLGQHLGMPSQIGKSKRAVFAKLKDMVEKTLRGWKEKLFSLGGKEVLIKAVAQSIPTYTMSCFRLPNYICDDIDRLCARFW
WGTVGDKRKMHWMSWERMCRNKDQGGMGFRQIRAFNQAMLAKQSWRLIRNPNSLLFKILRGKYFKDGNFMEAQIGNSPSTIWRSICWGRDLFAKGYRWRIGNGIMIQIDK
DPWINRPGSRTPIIANDSVKGKRVNWLLDENHKWKEDRIRQLFLPHEAEDILNISVGFRNARDEIIWHPDKKGIFKVKSAYHLAMDLKYKEKASQSDGSKMIKDWRSLWS
SGVLPRTKICVWKIVNDIIPSNWSPRDYWNWMVDHLNNEEIAKGSIIMWSIWNHRNKIQAASSRGAAEFLINDAEEPRESCHLEETETKLLEAKRRRGLVRAKKRRRPRV
GGARLEWVLNLFRITKNRSELGNQNHGSQSNSHGIKAIRQTCLQLNLGLEIESDALEVIKVLAGDEEDLSELKPIAETIVSSSKDLREVSFIHCNRLANSTAHWLARHAS
SVNFCSRNFGFDQGNPLCEESGLSFWAPDLPSWFSPPFLEGAGYDMVVLGPNGNMIATMEMFDNLCFTPLAAEIQAILHGLRLLQRLQHMSAHVFSDSSNTINMITGDLQ
PSLESPFDIVFMLLCVCEVWHKQDNTTTRTRSRDGYRRRDKAKGLGRARPNPVWAMLEPLAWVESFHLCLVPGAPGCPSST