; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0027670 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0027670
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr8:3303949..3311020
RNA-Seq ExpressionLag0027670
SyntenyLag0027670
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
GO:0016787 - hydrolase activity (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily
IPR036875 - Zinc finger, CCHC-type superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032541.1 pol protein [Cucumis melo var. makuwa]3.0e-11344.57Show/hide
Query:  PVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSI
        PV  QA   P  +  APP+P Q+     +  +KHL+DF++Y+P  F+G   +PT A+ W+  ++ IF YM CPE Q+V CAVF L +    WW++AER +
Subjt:  PVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSI

Query:  STTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAATF
              +TW QFK  F+ K++S  +++ K +EFL L QG+ +VE+Y+ EF  LSRFA +MV  EA + ++F+ GLR D+Q +V AL P  +A ALR A  
Subjt:  STTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAATF

Query:  MGMS----------------------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGHMAAECPR-GKN
        + +                                               AA    + + P  +T      G+CLAG GVCFRC + GH A E  R G  
Subjt:  MGMS----------------------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGHMAAECPR-GKN

Query:  VDSRRPF-----------GSNQARPAPREVQHMPLPARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMD
        V    P            GS+ +  +   VQH+ L         E+EPLG  L VSTP+G  LL++E++K C+V +++R+LDVTL+VL+M+DFDVILGMD
Subjt:  VDSRRPF-----------GSNQARPAPREVQHMPLPARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMD

Query:  WLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSI
        WL+ NHA IDC  KEV+F P  GASFKF+G     +PKV+S +KA +L  +G W  L SV D R  +  +SS PVV E+ DVFP++LPGL P REVDF I
Subjt:  WLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSI

Query:  ELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
        ELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  ELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

KAA0035574.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-11545.4Show/hide
Query:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ
        R  GRG GRG G G+V  E  V PV  QA N       AP VP Q+     +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+
Subjt:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ

Query:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD
        V CAVFML +    WW++ ER +    G +TW QFK +FF K++S  ++  K +EFL L Q + +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR 
Subjt:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD

Query:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----
        D+Q +V A  PT +A ALR A  + +   AN+  + +  E+S              G+CL G   CF+C + GH A  CP       +N  +  P     
Subjt:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----

Query:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG
        F +N+    R        +P+                 A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VT++VL+M DFDVILG
Subjt:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG

Query:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF
        MDWLA NHA IDC  KEV F P   +SFKFK   S+S+P+V+S ++A +L  +G WC L SV DTR     +SS P+V ++ DVFPE+LPGL P REV+F
Subjt:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF

Query:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS
        +IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRP  S
Subjt:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS

KAA0037291.1 pol protein [Cucumis melo var. makuwa]3.2e-11541.82Show/hide
Query:  WYQSPVSRFYRLAYVVSI---------RIPIAKYNMHPRMRGR-----GLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAP-PVPAQIDRAGS-----
        WYQS   RF RLAY VS+          +P+ +  M PR   R     G GRG GR Q  + +   P        P    PAP P PA +  A       
Subjt:  WYQSPVSRFYRLAYVVSI---------RIPIAKYNMHPRMRGR-----GLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAP-PVPAQIDRAGS-----

Query:  -TEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQY
         +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+V CAVFML +    WW++ ER +      +TW QFK +F+ K++S  ++ 
Subjt:  -TEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQY

Query:  WKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAAT----------------------------
         K +EFL L QG+ +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR D+Q +V A  P  +A ALR A                             
Subjt:  WKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAAT----------------------------

Query:  -----------------------FMGMSAANAIPM-TKEPESSTGQCLAGLGVCFRCGKGGHMAAECPR-----GKNVDSRRP-----FGSNQARPAPR-
                               F    AA   P+ T   +   G+CL G   CF+C + GH A  CP       +N  +  P     F +N+       
Subjt:  -----------------------FMGMSAANAIPM-TKEPESSTGQCLAGLGVCFRCGKGGHMAAECPR-----GKNVDSRRP-----FGSNQARPAPR-

Query:  EVQHMPLP---------------------ARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHA
         V    LP                     A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VTLIVL+M DFDVILGMDWLA NHA
Subjt:  EVQHMPLP---------------------ARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHA

Query:  IIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTT
         IDC +KEV F P   ASFKFKG  SKS+P+V+S I+A +L  +G W  L SV DTR     +SS PVV ++ DVFPE+LPGL P REV+F+IELEP T 
Subjt:  IIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTT

Query:  SISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
         IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  SISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

KAA0037906.1 reverse transcriptase [Cucumis melo var. makuwa]4.4e-11744.33Show/hide
Query:  GRGLGRGHGRGQVPLEQVVPPVGHQADNGP---ENQIPAPPVP--AQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEP
        GRG GRG GRGQ       P V   A N     + Q  APP P  AQ      +  +KHL+DF++Y+P  F+G   +PT A+ W+  ++ IF YM CPE 
Subjt:  GRGLGRGHGRGQVPLEQVVPPVGHQADNGP---ENQIPAPPVP--AQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEP

Query:  QQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGL
        Q+V CAVF L +    WW++AER +      +TW QFK  F+ K++S  +++ K +EFL L QG+ +VE+Y+ EF  LSRFA +MV  EA + ++F+ GL
Subjt:  QQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGL

Query:  RDDVQRVVGALDPTDYAAALRAATFMGMS--------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGH
        R D+Q +V AL P  +A ALR A  + +                                 AA    + + P  +T      G+CLAG GVCFRC + GH
Subjt:  RDDVQRVVGALDPTDYAAALRAATFMGMS--------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGH

Query:  MAAECPRGK-NVDSRRPFGSNQARPAPREVQH-----------MPLPARNL-------------------HARIELEPLGFSLLVSTPAGVELLARERVK
         A  CPR        +P  S Q R      Q            +P+                        H  +E+EPLG  L VSTP+G  LL++E++K
Subjt:  MAAECPRGK-NVDSRRPFGSNQARPAPREVQH-----------MPLPARNL-------------------HARIELEPLGFSLLVSTPAGVELLARERVK

Query:  TCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGI
         C+V +++R+LDVTL+VL+M+DFDVI+GMDWLA NHA IDC  KEV+F P  GASFKFKG     +PKV+S +KA +L  +G W  L SV D R  +A +
Subjt:  TCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGI

Query:  SSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
        SS PVV E+ DVFP++LPGL P REVDF+IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  SSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

TYK30962.1 ty3-gypsy retrotransposon protein [Cucumis melo var. makuwa]1.9e-11545.4Show/hide
Query:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ
        R  GRG GRG G G+V  E  V PV  QA N       AP VP Q+     +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+
Subjt:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ

Query:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD
        V CAVFML +    WW++ ER +    G +TW QFK +FF K++S  ++  K +EFL L Q + +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR 
Subjt:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD

Query:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----
        D+Q +V A  PT +A ALR A  + +   AN+  + +  E+S              G+CL G   CF+C + GH A  CP       +N  +  P     
Subjt:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----

Query:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG
        F +N+    R        +P+                 A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VT++VL+M DFDVILG
Subjt:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG

Query:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF
        MDWLA NHA IDC  KEV F P   +SFKFK   S+S+P+V+S ++A +L  +G WC L SV DTR     +SS P+V ++ DVFPE+LPGL P REV+F
Subjt:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF

Query:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS
        +IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRP  S
Subjt:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS

TrEMBL top hitse value%identityAlignment
A0A5A7SSL3 Reverse transcriptase1.4e-11344.57Show/hide
Query:  PVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSI
        PV  QA   P  +  APP+P Q+     +  +KHL+DF++Y+P  F+G   +PT A+ W+  ++ IF YM CPE Q+V CAVF L +    WW++AER +
Subjt:  PVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSI

Query:  STTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAATF
              +TW QFK  F+ K++S  +++ K +EFL L QG+ +VE+Y+ EF  LSRFA +MV  EA + ++F+ GLR D+Q +V AL P  +A ALR A  
Subjt:  STTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAATF

Query:  MGMS----------------------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGHMAAECPR-GKN
        + +                                               AA    + + P  +T      G+CLAG GVCFRC + GH A E  R G  
Subjt:  MGMS----------------------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGHMAAECPR-GKN

Query:  VDSRRPF-----------GSNQARPAPREVQHMPLPARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMD
        V    P            GS+ +  +   VQH+ L         E+EPLG  L VSTP+G  LL++E++K C+V +++R+LDVTL+VL+M+DFDVILGMD
Subjt:  VDSRRPF-----------GSNQARPAPREVQHMPLPARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMD

Query:  WLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSI
        WL+ NHA IDC  KEV+F P  GASFKF+G     +PKV+S +KA +L  +G W  L SV D R  +  +SS PVV E+ DVFP++LPGL P REVDF I
Subjt:  WLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSI

Query:  ELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
        ELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  ELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

A0A5A7SW90 Reverse transcriptase9.1e-11645.4Show/hide
Query:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ
        R  GRG GRG G G+V  E  V PV  QA N       AP VP Q+     +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+
Subjt:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ

Query:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD
        V CAVFML +    WW++ ER +    G +TW QFK +FF K++S  ++  K +EFL L Q + +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR 
Subjt:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD

Query:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----
        D+Q +V A  PT +A ALR A  + +   AN+  + +  E+S              G+CL G   CF+C + GH A  CP       +N  +  P     
Subjt:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----

Query:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG
        F +N+    R        +P+                 A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VT++VL+M DFDVILG
Subjt:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG

Query:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF
        MDWLA NHA IDC  KEV F P   +SFKFK   S+S+P+V+S ++A +L  +G WC L SV DTR     +SS P+V ++ DVFPE+LPGL P REV+F
Subjt:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF

Query:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS
        +IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRP  S
Subjt:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS

A0A5A7T6R9 Reverse transcriptase1.5e-11541.82Show/hide
Query:  WYQSPVSRFYRLAYVVSI---------RIPIAKYNMHPRMRGR-----GLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAP-PVPAQIDRAGS-----
        WYQS   RF RLAY VS+          +P+ +  M PR   R     G GRG GR Q  + +   P        P    PAP P PA +  A       
Subjt:  WYQSPVSRFYRLAYVVSI---------RIPIAKYNMHPRMRGR-----GLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAP-PVPAQIDRAGS-----

Query:  -TEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQY
         +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+V CAVFML +    WW++ ER +      +TW QFK +F+ K++S  ++ 
Subjt:  -TEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQY

Query:  WKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAAT----------------------------
         K +EFL L QG+ +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR D+Q +V A  P  +A ALR A                             
Subjt:  WKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAAT----------------------------

Query:  -----------------------FMGMSAANAIPM-TKEPESSTGQCLAGLGVCFRCGKGGHMAAECPR-----GKNVDSRRP-----FGSNQARPAPR-
                               F    AA   P+ T   +   G+CL G   CF+C + GH A  CP       +N  +  P     F +N+       
Subjt:  -----------------------FMGMSAANAIPM-TKEPESSTGQCLAGLGVCFRCGKGGHMAAECPR-----GKNVDSRRP-----FGSNQARPAPR-

Query:  EVQHMPLP---------------------ARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHA
         V    LP                     A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VTLIVL+M DFDVILGMDWLA NHA
Subjt:  EVQHMPLP---------------------ARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHA

Query:  IIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTT
         IDC +KEV F P   ASFKFKG  SKS+P+V+S I+A +L  +G W  L SV DTR     +SS PVV ++ DVFPE+LPGL P REV+F+IELEP T 
Subjt:  IIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTT

Query:  SISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
         IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  SISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

A0A5A7T6U3 Reverse transcriptase2.2e-11744.33Show/hide
Query:  GRGLGRGHGRGQVPLEQVVPPVGHQADNGP---ENQIPAPPVP--AQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEP
        GRG GRG GRGQ       P V   A N     + Q  APP P  AQ      +  +KHL+DF++Y+P  F+G   +PT A+ W+  ++ IF YM CPE 
Subjt:  GRGLGRGHGRGQVPLEQVVPPVGHQADNGP---ENQIPAPPVP--AQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEP

Query:  QQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGL
        Q+V CAVF L +    WW++AER +      +TW QFK  F+ K++S  +++ K +EFL L QG+ +VE+Y+ EF  LSRFA +MV  EA + ++F+ GL
Subjt:  QQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGL

Query:  RDDVQRVVGALDPTDYAAALRAATFMGMS--------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGH
        R D+Q +V AL P  +A ALR A  + +                                 AA    + + P  +T      G+CLAG GVCFRC + GH
Subjt:  RDDVQRVVGALDPTDYAAALRAATFMGMS--------------------------------AANAIPMTKEPESST------GQCLAGLGVCFRCGKGGH

Query:  MAAECPRGK-NVDSRRPFGSNQARPAPREVQH-----------MPLPARNL-------------------HARIELEPLGFSLLVSTPAGVELLARERVK
         A  CPR        +P  S Q R      Q            +P+                        H  +E+EPLG  L VSTP+G  LL++E++K
Subjt:  MAAECPRGK-NVDSRRPFGSNQARPAPREVQH-----------MPLPARNL-------------------HARIELEPLGFSLLVSTPAGVELLARERVK

Query:  TCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGI
         C+V +++R+LDVTL+VL+M+DFDVI+GMDWLA NHA IDC  KEV+F P  GASFKFKG     +PKV+S +KA +L  +G W  L SV D R  +A +
Subjt:  TCQVLVSDRLLDVTLIVLEMRDFDVILGMDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGI

Query:  SSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP
        SS PVV E+ DVFP++LPGL P REVDF+IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRPS+SP
Subjt:  SSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSISP

A0A5D3E4V0 Reverse transcriptase9.1e-11645.4Show/hide
Query:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ
        R  GRG GRG G G+V  E  V PV  QA N       AP VP Q+     +  +KHL+DF++Y+P  F+G   DPT A+ W++ ++ IF YM CPE Q+
Subjt:  RMRGRGLGRGHGRGQVPLEQVVPPVGHQADNGPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQ

Query:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD
        V CAVFML +    WW++ ER +    G +TW QFK +FF K++S  ++  K +EFL L Q + +VE+Y+ EF  LSRFA EM+ TEA +  +F+ GLR 
Subjt:  VPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQKYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRD

Query:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----
        D+Q +V A  PT +A ALR A  + +   AN+  + +  E+S              G+CL G   CF+C + GH A  CP       +N  +  P     
Subjt:  DVQRVVGALDPTDYAAALRAATFMGM-SAANAIPMTKEPESS-------------TGQCLAGLGVCFRCGKGGHMAAECP-----RGKNVDSRRP-----

Query:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG
        F +N+    R        +P+                 A  LHAR+E+EPL   L VSTP+G  +L++E+VK CQ+ ++  +++VT++VL+M DFDVILG
Subjt:  FGSNQA---RPAPREVQHMPL----------------PARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILG

Query:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF
        MDWLA NHA IDC  KEV F P   +SFKFK   S+S+P+V+S ++A +L  +G WC L SV DTR     +SS P+V ++ DVFPE+LPGL P REV+F
Subjt:  MDWLAENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDF

Query:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS
        +IELEP T  IS+APYRMAPAELKELKVQ+QELLDKGFIRP  S
Subjt:  SIELEPSTTSISKAPYRMAPAELKELKVQIQELLDKGFIRPSIS

SwissProt top hitse value%identityAlignment
P92519 Uncharacterized mitochondrial protein AtMg008102.8e-1346.24Show/hide
Query:  KYIIDLLKRVEMLDSKPSSTPVV----SGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK
        KY   +L    MLD KP STP+     S  S +++     P+P  +RSIVGALQY T+TRPD+S+AVN V Q +H PT    + +KR+L Y+K
Subjt:  KYIIDLLKRVEMLDSKPSSTPVV----SGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE13.8e-1850Show/hide
Query:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYL
        +YI+DLL R  M+ +KP +TP+     LS + G  L +P +YR IVG+LQY   TRPD+S+AVN++ QF+H PT  +L  +KRIL YL
Subjt:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYL

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE24.4e-1950Show/hide
Query:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYL
        +Y +DLL R  ML +KP +TP+ +   L+ H G  LP+P +YR IVG+LQY   TRPDLS+AVN++ Q++H PT  + N +KR+L YL
Subjt:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYL

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 83.4e-1443.82Show/hide
Query:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK
        KY +DLL    +L  KPSS P+    + S H G    +   YR ++G L Y  ITR D+SFAVNK+ QF  AP   +   V +ILHY+K
Subjt:  KYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK

ATMG00240.1 Gag-Pol-related retrotransposon family protein2.4e-0453.85Show/hide
Query:  YCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK
        Y TITRPDL+FAVN++ QF  A  T  +  V ++LHY+K
Subjt:  YCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK

ATMG00810.1 DNA/RNA polymerases superfamily protein2.0e-1446.24Show/hide
Query:  KYIIDLLKRVEMLDSKPSSTPVV----SGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK
        KY   +L    MLD KP STP+     S  S +++     P+P  +RSIVGALQY T+TRPD+S+AVN V Q +H PT    + +KR+L Y+K
Subjt:  KYIIDLLKRVEMLDSKPSSTPVV----SGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATTCAATCCCTTCAATAAGTATATTATTGACCTTTTGAAGCGGGTTGAGATGCTTGACAGCAAGCCGAGCTCTACTCCTGTGGTTTCAGGCACTTCATTATCTCG
TCATGGTGGTGACCCACTTCCGAATCCAATGAAATATAGGAGTATTGTCGGCGCTCTTCAATATTGTACCATCACCCGACCAGATCTATCATTTGCAGTCAACAAAGTGT
ATCAATTTCTCCATGCTCCAACAACAATTTATTTGAATTTTGTCAAGCGTATTCTCCACTATTTGAAAGAAGACAATCTCTTGTTCAAGTATCATGACCTTGCTAATGCT
GCTACGGAAGTATTTTGGCTTGCGGACATCATGACCAAAGGTCTTTCATCTTCTCATTTTCAGAATCTTCGTGTCAAGCTCACTATTGTGAATCACTCAGAATGTCCATT
TGCCCTGTGGGGACCCGCCCCTAATGGGGCGGGGAATCCCCGTTTAAGCGGGGAATGGGGGAGGGAGCGGGGGAAAAAAACTCCCCGAGGTGGGGATGGGGATGGGGAAG
CCGTCCCCGCCCCGCCCCGCCCCGTGGACATCCCTATTAGCCACGTGAATGGAAAACAATATCTAGAATCTGAAGATGAAAAAGATTTGGATTCTCATGACCTCAAAGCT
TTGGAATTATCAAGGAAAATAAGGGAAGTTGATAGCAATGCGGTGGCCAAAGACAAAAATGAGTATGGTAACGTGGTCAGGTCGCAGGGGAATGTCGGGGCCTTAAGTGC
TGAATCCAGATTTCGAATCCTGGGCTTGGGGCGTTACAAATGGTATCAGAGCCCAGTTTCTAGATTCTATAGACTCGCTTACGTCGTAAGCATCAGAATTCCAATAGCCA
AGTACAATATGCATCCACGTATGCGAGGTCGAGGATTGGGTCGTGGACATGGTAGGGGTCAGGTACCTCTAGAGCAAGTGGTTCCTCCAGTGGGTCACCAGGCTGACAAT
GGACCTGAGAATCAGATACCTGCACCCCCAGTGCCAGCACAGATAGATCGAGCAGGGTCGACTGAGGGCTCTAAGCATCTGAAGGACTTCAAAAGATATGACCCACCGGC
GTTCAATGGGAAGACGGTGGACCCCACTGTCGCAGAGGCTTGGATAGCCAAGATGAAGGAGATTTTTTGTTACATGGGATGCCCCGAGCCTCAGCAAGTCCCATGTGCCG
TGTTCATGCTAAGGGAGGACGCGCTGATGTGGTGGCAGTCAGCAGAGAGATCTATCAGTACCACCGCAGGTCCCGTCACGTGGGTCCAGTTCAAGGGAACCTTCTTCCAA
AAGTATTACTCCACGATCATCCAGTACTGGAAGGAGGAAGAATTCTTGGCTTTAAGTCAAGGTAACAAGTCAGTGGAGGAATACGAACTGGAATTCCCGCGCTTGTCTCG
TTTCGCCCAAGAAATGGTAGACACTGAAGCAAAGAAGATGAAGAGGTTCATCTCGGGCCTCAGGGACGATGTGCAGAGAGTGGTAGGGGCACTTGACCCAACTGATTATG
CGGCAGCCCTTCGAGCAGCCACATTTATGGGTATGTCGGCTGCGAATGCAATCCCGATGACTAAAGAACCAGAATCCAGTACAGGTCAATGCTTAGCAGGCTTAGGAGTG
TGCTTCAGATGCGGCAAGGGTGGACACATGGCCGCAGAGTGCCCACGAGGAAAAAACGTAGATTCTAGGCGACCTTTCGGTTCTAATCAAGCCAGACCAGCACCCAGAGA
GGTTCAACACATGCCTCTACCAGCAAGAAACCTGCATGCTCGGATAGAGTTAGAGCCATTAGGTTTTAGCCTATTAGTTTCTACTCCAGCTGGAGTAGAATTGTTAGCTC
GGGAGAGGGTTAAAACCTGTCAGGTATTGGTGTCAGATCGTCTGTTAGATGTGACACTAATTGTACTGGAGATGAGAGATTTCGATGTTATCCTGGGCATGGACTGGTTA
GCCGAGAATCACGCCATCATTGATTGCCGTCAAAAGGAAGTATTATTCAAGCCTCTAGAGGGAGCAAGCTTCAAGTTCAAGGGAATCAGATCAAAGTCTGTTCCTAAAGT
GGTATCTACGATAAAGGCTAGAAGGCTCAGGGATCGAGGAGCGTGGTGCTTTTTAGTCAGTGTGTCAGATACTCGAGTTGAAAAGGCGGGGATAAGCTCTGTACCAGTGG
TCAATGAGTTCATGGATGTATTCCCTGAAGACCTTCCAGGTTTGCTTCCGGTTCGAGAAGTGGATTTCAGCATAGAGCTTGAGCCAAGCACAACCTCTATTTCTAAGGCA
CCCTACAGAATGGCCCCAGCAGAGCTGAAAGAACTAAAGGTGCAGATACAGGAACTCCTGGACAAAGGTTTTATTCGTCCTAGCATTTCACCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGGATTCAATCCCTTCAATAAGTATATTATTGACCTTTTGAAGCGGGTTGAGATGCTTGACAGCAAGCCGAGCTCTACTCCTGTGGTTTCAGGCACTTCATTATCTCG
TCATGGTGGTGACCCACTTCCGAATCCAATGAAATATAGGAGTATTGTCGGCGCTCTTCAATATTGTACCATCACCCGACCAGATCTATCATTTGCAGTCAACAAAGTGT
ATCAATTTCTCCATGCTCCAACAACAATTTATTTGAATTTTGTCAAGCGTATTCTCCACTATTTGAAAGAAGACAATCTCTTGTTCAAGTATCATGACCTTGCTAATGCT
GCTACGGAAGTATTTTGGCTTGCGGACATCATGACCAAAGGTCTTTCATCTTCTCATTTTCAGAATCTTCGTGTCAAGCTCACTATTGTGAATCACTCAGAATGTCCATT
TGCCCTGTGGGGACCCGCCCCTAATGGGGCGGGGAATCCCCGTTTAAGCGGGGAATGGGGGAGGGAGCGGGGGAAAAAAACTCCCCGAGGTGGGGATGGGGATGGGGAAG
CCGTCCCCGCCCCGCCCCGCCCCGTGGACATCCCTATTAGCCACGTGAATGGAAAACAATATCTAGAATCTGAAGATGAAAAAGATTTGGATTCTCATGACCTCAAAGCT
TTGGAATTATCAAGGAAAATAAGGGAAGTTGATAGCAATGCGGTGGCCAAAGACAAAAATGAGTATGGTAACGTGGTCAGGTCGCAGGGGAATGTCGGGGCCTTAAGTGC
TGAATCCAGATTTCGAATCCTGGGCTTGGGGCGTTACAAATGGTATCAGAGCCCAGTTTCTAGATTCTATAGACTCGCTTACGTCGTAAGCATCAGAATTCCAATAGCCA
AGTACAATATGCATCCACGTATGCGAGGTCGAGGATTGGGTCGTGGACATGGTAGGGGTCAGGTACCTCTAGAGCAAGTGGTTCCTCCAGTGGGTCACCAGGCTGACAAT
GGACCTGAGAATCAGATACCTGCACCCCCAGTGCCAGCACAGATAGATCGAGCAGGGTCGACTGAGGGCTCTAAGCATCTGAAGGACTTCAAAAGATATGACCCACCGGC
GTTCAATGGGAAGACGGTGGACCCCACTGTCGCAGAGGCTTGGATAGCCAAGATGAAGGAGATTTTTTGTTACATGGGATGCCCCGAGCCTCAGCAAGTCCCATGTGCCG
TGTTCATGCTAAGGGAGGACGCGCTGATGTGGTGGCAGTCAGCAGAGAGATCTATCAGTACCACCGCAGGTCCCGTCACGTGGGTCCAGTTCAAGGGAACCTTCTTCCAA
AAGTATTACTCCACGATCATCCAGTACTGGAAGGAGGAAGAATTCTTGGCTTTAAGTCAAGGTAACAAGTCAGTGGAGGAATACGAACTGGAATTCCCGCGCTTGTCTCG
TTTCGCCCAAGAAATGGTAGACACTGAAGCAAAGAAGATGAAGAGGTTCATCTCGGGCCTCAGGGACGATGTGCAGAGAGTGGTAGGGGCACTTGACCCAACTGATTATG
CGGCAGCCCTTCGAGCAGCCACATTTATGGGTATGTCGGCTGCGAATGCAATCCCGATGACTAAAGAACCAGAATCCAGTACAGGTCAATGCTTAGCAGGCTTAGGAGTG
TGCTTCAGATGCGGCAAGGGTGGACACATGGCCGCAGAGTGCCCACGAGGAAAAAACGTAGATTCTAGGCGACCTTTCGGTTCTAATCAAGCCAGACCAGCACCCAGAGA
GGTTCAACACATGCCTCTACCAGCAAGAAACCTGCATGCTCGGATAGAGTTAGAGCCATTAGGTTTTAGCCTATTAGTTTCTACTCCAGCTGGAGTAGAATTGTTAGCTC
GGGAGAGGGTTAAAACCTGTCAGGTATTGGTGTCAGATCGTCTGTTAGATGTGACACTAATTGTACTGGAGATGAGAGATTTCGATGTTATCCTGGGCATGGACTGGTTA
GCCGAGAATCACGCCATCATTGATTGCCGTCAAAAGGAAGTATTATTCAAGCCTCTAGAGGGAGCAAGCTTCAAGTTCAAGGGAATCAGATCAAAGTCTGTTCCTAAAGT
GGTATCTACGATAAAGGCTAGAAGGCTCAGGGATCGAGGAGCGTGGTGCTTTTTAGTCAGTGTGTCAGATACTCGAGTTGAAAAGGCGGGGATAAGCTCTGTACCAGTGG
TCAATGAGTTCATGGATGTATTCCCTGAAGACCTTCCAGGTTTGCTTCCGGTTCGAGAAGTGGATTTCAGCATAGAGCTTGAGCCAAGCACAACCTCTATTTCTAAGGCA
CCCTACAGAATGGCCCCAGCAGAGCTGAAAGAACTAAAGGTGCAGATACAGGAACTCCTGGACAAAGGTTTTATTCGTCCTAGCATTTCACCTTAG
Protein sequenceShow/hide protein sequence
MGFNPFNKYIIDLLKRVEMLDSKPSSTPVVSGTSLSRHGGDPLPNPMKYRSIVGALQYCTITRPDLSFAVNKVYQFLHAPTTIYLNFVKRILHYLKEDNLLFKYHDLANA
ATEVFWLADIMTKGLSSSHFQNLRVKLTIVNHSECPFALWGPAPNGAGNPRLSGEWGRERGKKTPRGGDGDGEAVPAPPRPVDIPISHVNGKQYLESEDEKDLDSHDLKA
LELSRKIREVDSNAVAKDKNEYGNVVRSQGNVGALSAESRFRILGLGRYKWYQSPVSRFYRLAYVVSIRIPIAKYNMHPRMRGRGLGRGHGRGQVPLEQVVPPVGHQADN
GPENQIPAPPVPAQIDRAGSTEGSKHLKDFKRYDPPAFNGKTVDPTVAEAWIAKMKEIFCYMGCPEPQQVPCAVFMLREDALMWWQSAERSISTTAGPVTWVQFKGTFFQ
KYYSTIIQYWKEEEFLALSQGNKSVEEYELEFPRLSRFAQEMVDTEAKKMKRFISGLRDDVQRVVGALDPTDYAAALRAATFMGMSAANAIPMTKEPESSTGQCLAGLGV
CFRCGKGGHMAAECPRGKNVDSRRPFGSNQARPAPREVQHMPLPARNLHARIELEPLGFSLLVSTPAGVELLARERVKTCQVLVSDRLLDVTLIVLEMRDFDVILGMDWL
AENHAIIDCRQKEVLFKPLEGASFKFKGIRSKSVPKVVSTIKARRLRDRGAWCFLVSVSDTRVEKAGISSVPVVNEFMDVFPEDLPGLLPVREVDFSIELEPSTTSISKA
PYRMAPAELKELKVQIQELLDKGFIRPSISP