; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008568 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008568
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:25519850..25521151
RNA-Seq ExpressionLag0008568
SyntenyLag0008568
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA38592.1 PREDICTED: reverse mRNAase, partial [Prunus dulcis]8.0e-12952.74Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M PSKAPGPDG   + Y  +W I+G+D +      L + + +  +N+T + LIPK K+P+TM++  PISLCNVLY++ AK LANRMK V+ S+IS SQSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDN +V FE  H L  +R G+ G +A+KLDMSKAYDRVEW ++E+ M  MGF   W++ VM C+++VSYS LVNGE   +  P+RGLRQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFLLCAEGF+ LL + E    L G+ I    PT++HLFFADDS VF KA ++N    K + + YE ASGQ IN  KS    S N+  DT+++   +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G+ R +S   YLG+P   GRNK   FR +K+RV K LQGW+    S+ GKEVL+K VAQ+IP+Y MSCF LP  +C  I+++ A+FWWG  G   K HWM
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYL
         W+++C+ K +GGMGFR L
Subjt:  NWKKMCRNKNQGGMGFRYL

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]1.4e-12853.22Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M+P+KAPGPDG  A+ +  +W+I+G D +   L +LN+   +  IN T I L+PK K+P  MS+FRPISLCNV+YK+I+K LANR+K +L  IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        F+ GR ITDNVLV FE +H L +K+ GK G+ AIKLDMSKAYDRVEW +I++ M++MGF   WI+ VM CI+SVSYS+LVNG       P+RGLRQGDP+
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPY+FLLCA+GFS+LL        ++G+ I   CP ITHLFFADDSL+F KA      T   +L+ YE+ASGQ IN DKSS   S N  ++ + +   +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G  +     +YLG+PS  G++K  +F  +K+RVE+ L GWK  L S+GG+E+LIKAVAQAIP YTMSCF++P  +C  I+ +  +FWWG  G + K  W+
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYL
        +WKK+C+ K  GGMGFR L
Subjt:  NWKKMCRNKNQGGMGFRYL

XP_023901742.1 uncharacterized protein LOC112013579 [Quercus suber]1.7e-13154.92Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        ++P+KAPGPDG  A  +HN+WDI+G    N  L +LN+   +  IN T I+LIPK+  P  M+EFRPISLCN  YK+I+K LANR K +L +IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        F P R ITDNVLV FE +H LN+K  GK  Y++IKLDMSKA+DRVEW +I+  M+++GF   WI  +M+C+SSVSYSVL+NGE      PSRG+RQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SP LFLLCAEG SAL+        + G+ I   CP ITHLFFADDSL+F KAKE   H    +L +YEEASGQ IN DKSS   S N  ++ +     IL
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G  + +   +YLG+PS  G++K  VF  +KDRV K L GWKG L S+GG+E+LIKAVAQA+P YTMSCF+LP  +C  ++ L   FWWG    ++K  W+
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFR
        +W+KMCR+K  GGMGFR
Subjt:  NWKKMCRNKNQGGMGFR

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]3.5e-13254.89Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M P+KAPGPDG +A+ Y  FW I+G+  ++  L  LNN   +  IN+T I LIPK ++P+ MSEFRPISLCNV+YK+I+K LANR+KQVL  IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDNVLV +E +H ++ ++ GK G VA+KLD+SKAYDRVEW +++  M++MGF   WI++VMSC+++ S+S+LVNG+  E+ +PSRG+RQGDP+
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFLLCAEG +ALL + E    + G+ I    P IT+L FADDSL+F +A  +   T  ++L+ YE ASGQ+IN +KSS   S N  E  K +  EIL
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G+K  +   +YLG+P+  GR K   F  +KDRV K LQGWKG L S  GKE+LIKAVAQAIP YTMS F++P  +CS ++ LCA+FWWG  GN+ K HW 
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYL
        +W K+   K +GGMGFR L
Subjt:  NWKKMCRNKNQGGMGFRYL

XP_030964220.1 uncharacterized protein LOC115985421 [Quercus lobata]3.8e-13155.16Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        ++P+KAPGPDG  A+ +HN+WDI+G + IN  L +LN+   +  IN T I+LIPK+  P  M+EF PISLCN  YK+I+K LANR+K +L +IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        F P R ITDNVLV FE +H LN+K  GK  Y++IKLDMSKA+DRVEW +I+  M ++GF   WI  VM+C+SSVSYSVL+NGE      PSRG+RQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SP LFLLCAEGFSAL+        + G+ I   CP ITH FFADDSL+F KAK    H    +L +YEEASGQ IN DKSS   S N  +D K    +IL
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G  + +   +YLG+PS  G++K  VF  +KDRV K L GWKG L S+GG+E+LIKAVAQA+P YTMSCF+LP  +C  ++ +   FWWG    ++K  W+
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFR
        +WK+MCR+K  GGMGFR
Subjt:  NWKKMCRNKNQGGMGFR

TrEMBL top hitse value%identityAlignment
A0A2N9FNH6 Reverse transcriptase domain-containing protein3.9e-12952.73Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M+P+KAPGPDG +AM Y  FW I+G+D  N  L  L++ + ++ +N T I LIPK   P+ M++FRPISLCNVLYK+I+K LANR+K VLD IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDN+LV FE +H +  KR G++ ++A+KLDMSKAYDRVEW ++E  M ++GF   W+  +M C++SVSYSV++NGE     KP+RG+RQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFL+CAEG +ALL++ E    + GL I    P I+HLFFADDSL+F +A          +L  YE+ASGQ +N +K+S   S N   D +     +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
            +  LG+YLG+P   GR K   F  IK ++ K L GWKG L S  G+E+LIK+VAQAIPVYTMSCFR+P  +CS I+ + +KFWWG    + K HW 
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYLVL
         W  MCR K++GGMGFR L L
Subjt:  NWKKMCRNKNQGGMGFRYLVL

A0A2N9I4C9 Uncharacterized protein8.0e-12752.97Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M+PSKA GPDG +A+ Y  FW I+G D     L  L++ + ++ IN T IALIPK K P TM++FRPISLCNVLYK+I+K LANR+K VL+ +IS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR I+DN+LV FE +H L +KR G+  ++A+KLDMSKAYDRVEW +I + M ++GF+P W+  +M CI SVSYSV++NG+     +P+RG+ QGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFL+CAEG +ALL    S  +L+GL +    P I+HLFFADDSL+F +A     HT   +L  YE+ASGQ +N++K+S   S N  + T+      L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
            S  LG+YLG+P   GR K   F  IK +V + L GWKG + SM GKEVLIK+VAQA+PVYTMSCF L  ++C+ I+ +   FWWG   ++ K HW 
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYLVL
        NW K+CR K  GGMGFR L L
Subjt:  NWKKMCRNKNQGGMGFRYLVL

A0A2N9J3U0 Reverse transcriptase domain-containing protein3.9e-12952.73Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M+P+KAPGPDG +AM Y  FW I+G+D  N  L  L++ + ++ +N T I LIPK   P+ M++FRPISLCNVLYK+I+K LANR+K VLD IIS +QSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDN+LV FE +H +  KR G++ ++A+KLDMSKAYDRVEW ++E  M ++GF   W+  +M C++SVSYSV++NGE     KP+RG+RQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFL+CAEG +ALL++ E    + GL I    P I+HLFFADDSL+F +A          +L  YE+ASGQ +N +K+S   S N   D +     +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
            +  LG+YLG+P   GR K   F  IK ++ K L GWKG L S  G+E+LIK+VAQAIPVYTMSCFR+P  +CS I+ + +KFWWG    + K HW 
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYLVL
         W  MCR K++GGMGFR L L
Subjt:  NWKKMCRNKNQGGMGFRYLVL

A0A5E4GGB8 PREDICTED: reverse mRNAase (Fragment)3.9e-12952.74Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M PSKAPGPDG   + Y  +W I+G+D +      L + + +  +N+T + LIPK K+P+TM++  PISLCNVLY++ AK LANRMK V+ S+IS SQSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDN +V FE  H L  +R G+ G +A+KLDMSKAYDRVEW ++E+ M  MGF   W++ VM C+++VSYS LVNGE   +  P+RGLRQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFLLCAEGF+ LL + E    L G+ I    PT++HLFFADDS VF KA ++N    K + + YE ASGQ IN  KS    S N+  DT+++   +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G+ R +S   YLG+P   GRNK   FR +K+RV K LQGW+    S+ GKEVL+K VAQ+IP+Y MSCF LP  +C  I+++ A+FWWG  G   K HWM
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYL
         W+++C+ K +GGMGFR L
Subjt:  NWKKMCRNKNQGGMGFRYL

M5VU98 Reverse transcriptase domain-containing protein6.0e-13053.22Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M PSKAPGPDG   + Y  +W I+G+D +      L + E +  +N+T + LIPK K+P+TM++ RPISLCNVLY++ AK LANRMK V+ S+IS SQSA
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
        FVPGR ITDN +V FE  H L  +R G+ G +A+KLDMSKAYDRVEW ++E+ M  MGF   W++ VM C+++VSYS LVNGE   +  P+RGLRQGDPL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL
        SPYLFLLCAEGF+ LL + E    L G+ I    PT++HLFFADDS VF KA ++N    K + + YE ASGQ IN  KS    S N+  DT+++   +L
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEIL

Query:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM
        G+ R +S   YLG+P   GRNK   FR +K+RV K LQGW+    S+ GKEVL+K VAQ+IP+Y MSCF LP  +C  I+++ A+FWWG  G   K HWM
Subjt:  GIKRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWM

Query:  NWKKMCRNKNQGGMGFRYL
         W+++C+ K +GGMGFR L
Subjt:  NWKKMCRNKNQGGMGFRYL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.2e-2925Show/hide
Query:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS
        K+PGPDG  A  Y  +     E+ +   L +  + E+   + N+     I LIPK  +D      FRPISL N+  K++ K LANR++Q +  +I   Q 
Subjt:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS

Query:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP
         F+PG Q   N+      I  +N  R     +V I +D  KA+D+++  ++ +++ ++G    +++ + +     + ++++NG+  E F    G RQG P
Subjt:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP

Query:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEI
        LSP LF +  E  +  + +E+    + G+++      +    FADD +V+L+    +     K++  + + SG  IN  KS      N ++ T+++    
Subjt:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEI

Query:  LGIKRSNSLGQYLGMPSQTGRNKGGVFRN----IKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGSYGN
        L    ++   +YLG+  Q  R+   +F+     +   +++    WK    S  G+  ++K       +Y  +    +LP    + +++   KF W    N
Subjt:  LGIKRSNSLGQYLGMPSQTGRNKGGVFRN----IKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGSYGN

Query:  KDKAHWMNWKKMCRNKNQGG
        + +A     K +   KN+ G
Subjt:  KDKAHWMNWKKMCRNKNQGG

P08548 LINE-1 reverse transcriptase homolog6.1e-3125.78Show/hide
Query:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS
        K+PGPDG  +  Y  F     E+ +   L +  N E+   + NT     I LIPK  KDP     +RPISL N+  K++ K L NR++Q +  II   Q 
Subjt:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS

Query:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP
         F+PG Q   N+      I  +N  +     ++ + +D  KA+D ++  ++  ++K++G   ++++ + +  S  + ++++NG   + F    G RQG P
Subjt:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP

Query:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKS-SFMASKNVKEDTKAKCEE
        LSP LF +  E  +  +  E++   + G+ I +    I    FADD +V+L+    +     +V+K+Y   SG  IN  KS +F+ + N + +   K   
Subjt:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKS-SFMASKNVKEDTKAKCEE

Query:  ILGI--KRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGSYGNK
           +  K+   LG YL    +    +   +  ++  + + +  WK    S  G+  ++K       +Y  +    + P +    ++++   F W    N+
Subjt:  ILGI--KRSNSLGQYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGSYGNK

Query:  DKAHWMNWKKMCRNKNQGG
         K      K +  NKN+ G
Subjt:  DKAHWMNWKKMCRNKNQGG

P11369 LINE-1 retrotransposable element ORF2 protein1.2e-3427.27Show/hide
Query:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS
        K+PGPDG  A  Y  F     ED I     + +  E    + N+     I LIPK  KDP  +  FRPISL N+  K++ K LANR+++ + +II P Q 
Subjt:  KAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTL----IALIPK-SKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQS

Query:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP
         F+PG Q   N+      IH +N  +     ++ I LD  KA+D+++  ++ + +++ G    ++  + +  S    ++ VNGE  E      G RQG P
Subjt:  AFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDP

Query:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKS-SFMASKNVKEDTKAKCEE
        LSPYLF +  E  +  + +++    + G++I      I+ L  ADD +V++   +++      ++  + E  G  IN +KS +F+ +KN + + + +   
Subjt:  LSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKS-SFMASKNVKEDTKAKCEE

Query:  ILGIKRSNSLGQYLG--MPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGS
           I  +N   +YLG  +  +        F+++K  +++ L+ WK    S  G+  ++K       +Y  +    ++PT   + ++    KF W +
Subjt:  ILGIKRSNSLGQYLG--MPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSC--FRLPTNICSFIDRLCAKFWWGS

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-3228.16Show/hide
Query:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA
        M  +K+PG DG     +  FWD +G D            E        +++L+PK  D + +  +RP+SL +  YK++AKA++ R+K VL  +I P QS 
Subjt:  MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSA

Query:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL
         VPGR I DNV +  + +H    +RTG      + LD  KA+DRV+  Y+  +++   F P ++  + +  +S    V +N          RG+RQG PL
Subjt:  FVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPL

Query:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKE-SNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEI
        S  L+ L  E F  LL +      L GL +      +    +ADD  V L A++  +L   ++  + Y  AS   IN+ KSS +   ++K D        
Subjt:  SPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDSLVFLKAKE-SNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEI

Query:  LGIKRSNSLGQYLGM-PSQTGRNKGGVFRNIKDRVEKTLQGWKG--NLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDK
          I   + + +YLG+  S         F  +++ V   L  WKG   + SM G+ ++I  +  +   Y + C        + I R    F W        
Subjt:  LGIKRSNSLGQYLGM-PSQTGRNKGGVFRNIKDRVEKTLQGWKG--NLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDK

Query:  AHWMNWKKMCRNKNQGGMG
         HW++         +GG G
Subjt:  AHWMNWKKMCRNKNQGGMG

P92555 Uncharacterized mitochondrial protein AtMg012503.6e-1552.94Show/hide
Query:  LVNGEHQEVFKPSRGLRQGDPLSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDS
        ++NG  Q +  PSRGLRQGDPLSPYLF+LC E  S L  R +    L G++++N+ P I HL FADD+
Subjt:  LVNGEHQEVFKPSRGLRQGDPLSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDS

Arabidopsis top hitse value%identityAlignment
AT3G24255.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.0e-0927.83Show/hide
Query:  QYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNK
        +YLG+P  T +     +  + +++   +  W     S  G+  LI +V  ++  + MS FRLP+     ID +C+ F W       K   + W  +C  K
Subjt:  QYLGMPSQTGRNKGGVFRNIKDRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNK

Query:  NQGGMGFRYLVLSTK
        ++GG+G R L  + K
Subjt:  NQGGMGFRYLVLSTK

AT4G20520.1 RNA binding;RNA-directed DNA polymerases4.8e-1536.05Show/hide
Query:  LANRMKQVLDSIISPSQSAFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKV
        +  R+K ++ ++I P+Q++F+PGR  TDN++   E +H++  K+ G  G++ +KLD+ KAYDR+ W Y+E+++   GF   W+ ++
Subjt:  LANRMKQVLDSIISPSQSAFVPGRQITDNVLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKV

AT4G29090.1 Ribonuclease H-like superfamily protein7.4e-0839.66Show/hide
Query:  AIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNKNQGGMGFR
        A+P YTM+CF LP  +C  I  + A FWW +       HW  W  +   K +GG+GF+
Subjt:  AIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNKNQGGMGFR

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-1147.54Show/hide
Query:  AIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNK-NQGGMGFRYL
        A+PVY MSCFRL   +C  +     +FWW S  NK K  W+ W+K+C++K + GG+GFR L
Subjt:  AIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNK-NQGGMGFRYL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)2.6e-1652.94Show/hide
Query:  LVNGEHQEVFKPSRGLRQGDPLSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDS
        ++NG  Q +  PSRGLRQGDPLSPYLF+LC E  S L  R +    L G++++N+ P I HL FADD+
Subjt:  LVNGEHQEVFKPSRGLRQGDPLSPYLFLLCAEGFSALLEREESLANLAGLKINNHCPTITHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATCCTTCCAAAGCCCCGGGTCCTGACGGGGCACATGCTATGCTTTATCACAATTTCTGGGATATTATGGGGGAGGATACAATTAATACTTGTTTGGGAATTCTGAA
TAACAGGGAAGAGATAGAGCCTATAAACAACACTCTTATTGCGCTTATTCCTAAATCAAAGGATCCAAAGACGATGAGTGAATTTAGGCCTATCAGCCTGTGCAATGTGC
TCTACAAGGTAATAGCAAAAGCTCTAGCGAATCGCATGAAACAAGTACTTGACTCTATTATCTCCCCTTCCCAATCAGCCTTCGTCCCGGGTAGACAGATCACTGACAAC
GTGCTGGTGGGTTTTGAATGCATTCACGCACTAAACAACAAAAGGACGGGGAAAGCAGGATATGTGGCTATCAAACTCGACATGAGTAAAGCCTACGACCGTGTGGAATG
GGTGTATATAGAAGAAAGCATGAAACAAATGGGCTTCAGCCCCAGTTGGATCCAGAAAGTCATGAGCTGCATCTCCTCTGTGAGCTACTCGGTTCTTGTCAACGGTGAAC
ACCAAGAAGTTTTTAAGCCGAGCAGGGGCCTCCGCCAGGGAGACCCTCTATCCCCTTACCTCTTCTTGTTGTGTGCAGAGGGTTTCTCGGCTCTCCTCGAAAGGGAAGAA
TCTTTAGCTAACCTAGCTGGTCTTAAAATTAACAATCATTGCCCCACTATAACTCATCTCTTTTTTGCAGACGACAGCCTTGTCTTTCTTAAAGCTAAGGAGTCGAACCT
TCATACTTTCAAGAAAGTTCTGAAGCAATACGAGGAAGCCTCAGGCCAAACAATCAACTTTGACAAGTCCTCCTTTATGGCCAGTAAAAATGTCAAGGAAGACACTAAGG
CCAAGTGCGAGGAAATTCTCGGGATTAAGAGATCGAACTCATTAGGCCAATATCTGGGGATGCCTTCCCAAACAGGAAGAAATAAAGGAGGGGTTTTCAGGAATATCAAA
GACAGGGTTGAGAAAACTCTTCAAGGGTGGAAAGGGAATCTTTTCTCCATGGGAGGAAAGGAAGTTCTCATAAAGGCTGTGGCTCAAGCGATTCCGGTCTACACCATGAG
TTGTTTTCGATTACCTACTAACATTTGTTCTTTTATTGACAGGTTATGTGCTAAATTCTGGTGGGGATCCTACGGTAATAAAGACAAAGCCCATTGGATGAACTGGAAGA
AGATGTGCCGAAACAAGAACCAAGGAGGAATGGGATTTCGTTACCTAGTGCTTTCAACCAAGCTATGCTCGCGAAACAAAGCTGGAGAATGA
mRNA sequenceShow/hide mRNA sequence
ATGAATCCTTCCAAAGCCCCGGGTCCTGACGGGGCACATGCTATGCTTTATCACAATTTCTGGGATATTATGGGGGAGGATACAATTAATACTTGTTTGGGAATTCTGAA
TAACAGGGAAGAGATAGAGCCTATAAACAACACTCTTATTGCGCTTATTCCTAAATCAAAGGATCCAAAGACGATGAGTGAATTTAGGCCTATCAGCCTGTGCAATGTGC
TCTACAAGGTAATAGCAAAAGCTCTAGCGAATCGCATGAAACAAGTACTTGACTCTATTATCTCCCCTTCCCAATCAGCCTTCGTCCCGGGTAGACAGATCACTGACAAC
GTGCTGGTGGGTTTTGAATGCATTCACGCACTAAACAACAAAAGGACGGGGAAAGCAGGATATGTGGCTATCAAACTCGACATGAGTAAAGCCTACGACCGTGTGGAATG
GGTGTATATAGAAGAAAGCATGAAACAAATGGGCTTCAGCCCCAGTTGGATCCAGAAAGTCATGAGCTGCATCTCCTCTGTGAGCTACTCGGTTCTTGTCAACGGTGAAC
ACCAAGAAGTTTTTAAGCCGAGCAGGGGCCTCCGCCAGGGAGACCCTCTATCCCCTTACCTCTTCTTGTTGTGTGCAGAGGGTTTCTCGGCTCTCCTCGAAAGGGAAGAA
TCTTTAGCTAACCTAGCTGGTCTTAAAATTAACAATCATTGCCCCACTATAACTCATCTCTTTTTTGCAGACGACAGCCTTGTCTTTCTTAAAGCTAAGGAGTCGAACCT
TCATACTTTCAAGAAAGTTCTGAAGCAATACGAGGAAGCCTCAGGCCAAACAATCAACTTTGACAAGTCCTCCTTTATGGCCAGTAAAAATGTCAAGGAAGACACTAAGG
CCAAGTGCGAGGAAATTCTCGGGATTAAGAGATCGAACTCATTAGGCCAATATCTGGGGATGCCTTCCCAAACAGGAAGAAATAAAGGAGGGGTTTTCAGGAATATCAAA
GACAGGGTTGAGAAAACTCTTCAAGGGTGGAAAGGGAATCTTTTCTCCATGGGAGGAAAGGAAGTTCTCATAAAGGCTGTGGCTCAAGCGATTCCGGTCTACACCATGAG
TTGTTTTCGATTACCTACTAACATTTGTTCTTTTATTGACAGGTTATGTGCTAAATTCTGGTGGGGATCCTACGGTAATAAAGACAAAGCCCATTGGATGAACTGGAAGA
AGATGTGCCGAAACAAGAACCAAGGAGGAATGGGATTTCGTTACCTAGTGCTTTCAACCAAGCTATGCTCGCGAAACAAAGCTGGAGAATGA
Protein sequenceShow/hide protein sequence
MNPSKAPGPDGAHAMLYHNFWDIMGEDTINTCLGILNNREEIEPINNTLIALIPKSKDPKTMSEFRPISLCNVLYKVIAKALANRMKQVLDSIISPSQSAFVPGRQITDN
VLVGFECIHALNNKRTGKAGYVAIKLDMSKAYDRVEWVYIEESMKQMGFSPSWIQKVMSCISSVSYSVLVNGEHQEVFKPSRGLRQGDPLSPYLFLLCAEGFSALLEREE
SLANLAGLKINNHCPTITHLFFADDSLVFLKAKESNLHTFKKVLKQYEEASGQTINFDKSSFMASKNVKEDTKAKCEEILGIKRSNSLGQYLGMPSQTGRNKGGVFRNIK
DRVEKTLQGWKGNLFSMGGKEVLIKAVAQAIPVYTMSCFRLPTNICSFIDRLCAKFWWGSYGNKDKAHWMNWKKMCRNKNQGGMGFRYLVLSTKLCSRNKAGE