; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001246 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001246
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase
Genome locationchr4:27671025..27676592
RNA-Seq ExpressionLag0001246
SyntenyLag0001246
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6734747.1 hypothetical protein I3842_01G285500 [Carya illinoinensis]3.2e-10437.16Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   + GSP++DPN HL  FL+IC TVK+N V+ED IRLRLFPFSL+DKA  WLQS+ PGSI +W  + + FL KFFPPAKT +LR++IG F+Q    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
         L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SK  E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ++
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI
        +L++                                                                                 S+ S +   L++A++
Subjt:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI

Query:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVA--KEKETTEPKEPTGEA----
        +     N  F                  A +K +E Q+GQL + +N   +G  P+  E +  E CKAIT+   ++   A  KE ++T      G++    
Subjt:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVA--KEKETTEPKEPTGEA----

Query:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------
         EE   ++  E+ +  PTI    +P +L P         K+    +F KFL  F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++              
Subjt:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------

Query:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN
                                    ++ LCDLGASIN++   +C+KL +G +K T++ LQLAD+S+  P  I+E++L KV +F+ P DF VLD++E+
Subjt:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN

Query:  PAMPIILGRPFLATGRLKI
          +P+ILGRPFLATGR  I
Subjt:  PAMPIILGRPFLATGRLKI

KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]6.5e-10537.32Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   + GSP++DPN HL  FL+IC TVK+N V+ED IRLRLFPFSL+DKA  WLQS+ PGSI +W  + + FL KFFPPAKT +LR++IG F+Q    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
         L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SK  E A  LLE+MA+N+YQWP+ER+  KK+ AG+ E++ ++AL AQ++
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI
        +L++                                                                                 S+ S +   L++A++
Subjt:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI

Query:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVA--KEKETTEPKEPTGEA----
        +     N  F                  A +K +E Q+GQL + +N   +G  P+  E +  E CKAIT+   ++   A  KE ++T      G++    
Subjt:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVA--KEKETTEPKEPTGEA----

Query:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------
         EE   ++  E+ +  PTI    +P +L P         K+    +F KFL  F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++              
Subjt:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------

Query:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN
                                    ++ LCDLGASIN++  S+C+KL +G +K T++ LQLAD+S+  P  I+E++L KV +F+ P DF VLD++E+
Subjt:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN

Query:  PAMPIILGRPFLATGRLKI
          +P+ILGRPFLATGR  I
Subjt:  PAMPIILGRPFLATGRLKI

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]5.0e-10537.8Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   + GSP++DPN HL  FL+IC TVK+N V+ED IRLRLFPFSL+DKA  WLQS+ PGSI +W  + + FL KFFPPAKT +LR++IG F+Q    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
         L+EAWER+K+L+R+CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SK  E A  LLE+MA+N+YQWP+ER+  KK+ AG+ +++ ++AL AQ++
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI
        +L++                                                                                 S+ S R   L++A++
Subjt:  SLANAFLKF----------------------------------------------------------------------------SESSNRSNKLQEAVI

Query:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVH--HEEKTRVAKEKETTEPKEPTGEA----
        +     N  F                  AAIK IE Q+GQL + +N   +G  P+  E +  E CKAIT+    E +    KE ++T      G++    
Subjt:  AINNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVH--HEEKTRVAKEKETTEPKEPTGEA----

Query:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------
         E+   ++  E+ +  PTI    +P +L P         K+    +F KFL  F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++              
Subjt:  -EEGTTSNEAEKLNPEPTI---PSPTVLVPK-------SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------

Query:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN
                                    +K LCDLGASIN++ LS+C+KL +  +K T++ LQLAD+S+  P  I+E++L KV +F+ P DF VLD++E+
Subjt:  ----------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKEN

Query:  PAMPIILGRPFLATGRLKI
          +P+ILGRPFLATGR  I
Subjt:  PAMPIILGRPFLATGRLKI

XP_022843226.1 uncharacterized protein LOC111366761 [Olea europaea var. sylvestris]2.3e-10238.11Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   + G+ +EDPN+HL SFL+IC TVK+N V+EDAIRLRLF FSL+DKA  W QS+P GSITTWD L Q FL K+FPP+K+ +LR +I  F+Q    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
          +EAWERFK+LLR+CPQHG+  W+Q+++FYNGL   T+T+VDAAAGG L++K  E+A  LL+D+ATNSYQWPSERSG KK+ AG+ E+D ++AL AQ++
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLK--------------------------------------------------------------------------------------------
        SL N  +                                                                                             
Subjt:  SLANAFLK--------------------------------------------------------------------------------------------

Query:  FSESSNRSNKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKEKETTEPKEP---TGEAEEGTT
         SE+ +R NK +  +  I   V+   A +K +E Q+GQL +++ +  KGK P++ E +  E+C AIT+   +    +K K+   P      T E +    
Subjt:  FSESSNRSNKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKEKETTEPKEP---TGEAEEGTT

Query:  SNEAE----------KLNPEPTIPSPTVLVPK--SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK------------------
          EAE               P I  P +  P+   KK    +F KFL  F  +H+NIPF++ L QMP+Y KF+KE ++ KKK                  
Subjt:  SNEAE----------KLNPEPTIPSPTVLVPK--SKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK------------------

Query:  -----------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPI
                               ++ALCD GASIN++ LS+ KKL +G +K T++ LQLAD+S+  P  ++E++L KV +F+LPVDF VLD++EN  +P+
Subjt:  -----------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPI

Query:  ILGRPFLATGRLKI
        ILGRPFLATGR  I
Subjt:  ILGRPFLATGRLKI

XP_023874613.1 uncharacterized protein LOC111987139 [Quercus suber]2.7e-10337.13Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   + GSP++DPN HL  FL+IC T+K+N V+ED IRLRLFPFSL+DKA  WLQS+ PGSIT+W  + + FL KFFPPAKT +LR++IG F+Q    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
         L+EAWER+K+L+R CPQHG PDWLQVQ+FYNGL   T+TIVDAA+GGTL+SK  E A +LLE+MA+N+YQWP+ER+  KK+ AG+ E++  +AL AQ++
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLKF--------------------------------------------------------------------------SESSNRSNKLQEAVIAI
        SL++                                                                               S+ S +   L++A+++ 
Subjt:  SLANAFLKF--------------------------------------------------------------------------SESSNRSNKLQEAVIAI

Query:  NNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVH--HEEKTRVAKEKETTEPKEPTGEAEEGTTS
               F                  A +K +E Q+GQL + +N   +G  P+  E +  E CKAIT+    E +   +KE ETT      G+++     
Subjt:  NNTVNGHF------------------AAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVH--HEEKTRVAKEKETTEPKEPTGEAEEGTTS

Query:  NEAEKLNPEPTIPSPTVLVPKS---------------KKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK----------------
         E  +     T   P++  P +               K+    +F KFL  F  +H+NIPF+DALEQMP+Y KF+K+ ++KK++                
Subjt:  NEAEKLNPEPTIPSPTVLVPKS---------------KKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK----------------

Query:  --------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPA
                                  +K LCDLGASIN++ LS+ +KL +G +K T++ LQLAD+S+  P  I+E++L KV +F+ P DF VLD++E+  
Subjt:  --------------------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPA

Query:  MPIILGRPFLATGR
        +P+ILGRPFLATGR
Subjt:  MPIILGRPFLATGR

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129451.1e-9740.37Show/hide
Query:  YRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQ-LFEA
        + G P +DPNSHL +FL+IC T K N V++DAIRLRLFPFSL+DKA  WL S+P GSITTW+ L Q FL KFFPPAKT K+R  I +F  QFDG+ L+EA
Subjt:  YRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQ-LFEA

Query:  WERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA--
        WERFKELLR+CP HG PDWLQVQ FYNGL  S KTI+DAAAGG L+SK    A  LLE+MA+N+YQWPSERSG +K A G +EID +  L  Q+++L+  
Subjt:  WERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLA--

Query:  ----------NAFL-----------------------------------------------KFSESSN------------------------RSNKLQEA
                  N+ +                                                FS S+N                        + ++L+E 
Subjt:  ----------NAFL-----------------------------------------------KFSESSN------------------------RSNKLQEA

Query:  VIA-INNT---VNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQE--KSSLEYCKAITVHHEEKTRVAKEKETTEPKEPTGEAE----EGTTSNE----
        ++  I+ T   +    A+++ +ETQ+GQL + +N   +G +P++ +      E C+AIT+      R  KE E    K    E E    EG   NE    
Subjt:  VIA-INNT---VNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQE--KSSLEYCKAITVHHEEKTRVAKEKETTEPKEPTGEAE----EGTTSNE----

Query:  ------AEKLNPEPTI-PSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------------------
              AE       I P P       K+  + +F KFL  F  LH+NIPF++ALEQMP Y KF+K+ L+KK+K                          
Subjt:  ------AEKLNPEPTI-PSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK--------------------------

Query:  ----------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFL
                         KAL DLGASIN++  S+ +KL +G  K TSV LQLAD+S V P  I+E++L KV +F+ PVDF +LD++E+  +PIILGRPFL
Subjt:  ----------------EKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFL

Query:  AT
        AT
Subjt:  AT

A0A6J1DU19 uncharacterized protein LOC1110243613.4e-8339.05Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M R+  +RG+  EDPN+HL  FLD+CGTVK+N V +DAIRLRLFP SLQDK                  +VQAFL  FFPPAKT +LRT+I +F++    
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
        QLFE WER+KELLRKCPQHG  +WLQ+Q+FYNGL   T+TI+DAAAGGTLLS+  E+A  LL+DMA NS+QWPSERS  KK+ AG++EID +S+L+AQ+ 
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLANAFLKFS-------------------------------------------------ESSNRSNKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVV
        +L NA  K S                                                 E  +R+++++  V  +   + G+  +IK +E Q+GQ+   +
Subjt:  SLANAFLKFS-------------------------------------------------ESSNRSNKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVV

Query:  NTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIP
        NTM KGK P++ E    E+CKA+T+   ++ +  ++K+  EP   T E E        ++  P      PT  +  S  NS                 +P
Subjt:  NTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIP

Query:  F-SDALEQMPHYRKFMKEWLNKKKKEKALCDLGASIN---IIHLSLCKKLNIGG-------IKSTSVRLQLADQSV---VSPYVIVENILNKVGRFLLPV
        +   ALEQMP+Y +FMK+ +  K+K +A   +  +     I+   L +KL   G       I S+S    L D      + P  ++E++L KV R + P 
Subjt:  F-SDALEQMPHYRKFMKEWLNKKKKEKALCDLGASIN---IIHLSLCKKLNIGG-------IKSTSVRLQLADQSV---VSPYVIVENILNKVGRFLLPV

Query:  DFFVLDIKENPAMPIILGRPFLATG
        DF VL  +E+  +PIILGR FLATG
Subjt:  DFFVLDIKENPAMPIILGRPFLATG

A0A6P6GGL5 LOW QUALITY PROTEIN: uncharacterized protein LOC1124928782.4e-8136.7Show/hide
Query:  HLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQLFEAWERFKELLRKCP
        H+ S L +C   K N VS+D  RLRLFP+SL+DKA  WL S+P  +ITTWD +   FL K FPP+KT KL++ I  F Q     L++AWERFKELLR+ P
Subjt:  HLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQLFEAWERFKELLRKCP

Query:  QHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLK-FSESSNRS
         HG+P W+QVQ+FYNGL    K +VDAAAGG+L+ K    A  L+E+MA N+YQ+PSER    +      E  +++ L AQ   LA  F+K  +    + 
Subjt:  QHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLK-FSESSNRS

Query:  NKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKE-----------------------------
         + ++ +   +   N   AAIK++E Q+GQL +     ++G +P++ EK+  E  +AIT+   ++    KE                             
Subjt:  NKLQEAVIAINNTVNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKE-----------------------------

Query:  KETTEPKEPTGEAEEGTTS--------------NEAE-------KLNPEPTIPSPTVLVPKSKKNSKA-----------------------KFDKFLYAF
        K   E  E     +EG +                EAE       KL P P +  P   + K K+N +                        +F KFL  F
Subjt:  KETTEPKEPTGEAEEGTTS--------------NEAE-------KLNPEPTIPSPTVLVPKSKKNSKA-----------------------KFDKFLYAF

Query:  MGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEK-----------------------------------------ALCDLGASINIIHLSLCKKLNIGGI
          LHVNIPF DALEQMP Y KF+KE L+ K++ +                                         ALCDLGASIN++  S+ +KL +G +
Subjt:  MGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEK-----------------------------------------ALCDLGASINIIHLSLCKKLNIGGI

Query:  KSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLATGRLKI
        K T+V LQ+AD+S+  P  I+E++L KV +F+ P DF +LD++E+  +PIILGRPFLATGR  I
Subjt:  KSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLATGRLKI

A0A6P6X9H2 Reverse transcriptase2.2e-8235.69Show/hide
Query:  MEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQLFEAWERFKE
        +EDPNSHL +FL+IC T+K+N VS++AIRLRLFPFSL+DKA  WL S  P + TTWD L +AFL K+FPP KT KLR  I  F Q     L+E WERF++
Subjt:  MEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQLFEAWERFKE

Query:  LLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVF---------EIDNVSALQAQMSSL
        LLRKCP HG P+WL +Q FYNGL+ STKT++DAAAGG L+  I      +L     N  +  S + G    A+            E D    + ++    
Subjt:  LLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVF---------EIDNVSALQAQMSSL

Query:  ANAFLKFSESSNRSN-----------------------------------------------KLQEAVIAINNTVNGHFAAIKT----IETQLGQLVSVV
         N + + ++++  SN                                               KL +        V G    + T    +E Q+GQ+ S +
Subjt:  ANAFLKFSESSNRSN-----------------------------------------------KLQEAVIAINNTVNGHFAAIKT----IETQLGQLVSVV

Query:  NTMNKGKVPAEQEKSSLEYCKAITVHH----EEKTRVAKEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPS--------PTVLVPKSKKNSK--AKFD
        N  N+G++P++ E +  E+ KAIT+      E+      +K+ +E        +E T    + + NP  T PS        P V  P+  K +K    F+
Subjt:  NTMNKGKVPAEQEKSSLEYCKAITVHH----EEKTRVAKEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPS--------PTVLVPKSKKNSK--AKFD

Query:  KFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK------------------------------------------EKALCDLGASINIIHLSLCK
        KF+  F  LH+NIPF+DA+ Q+P Y KF+KE + +K+K                                           KALCDLGAS+++I L++ +
Subjt:  KFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKK------------------------------------------EKALCDLGASINIIHLSLCK

Query:  KLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLAT
        +L +  +K T++ LQLAD+S+  P  ++EN+L KV +F++PVDF VLD++E+ +MPIILGRPFLAT
Subjt:  KLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLAT

A0A6P6XAQ1 Reverse transcriptase7.5e-8335.47Show/hide
Query:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG
        M +   Y G+  EDPNSHL +FL+IC T+K N VSEDAI+LRLFPFSL+DKA  WLQS PP + TTWD L +AFL KFFPP KT KLR  I +F QQ   
Subjt:  MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDG

Query:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS
         L+EAWER++EL R+CP HG PDWL VQ FYNGLT  TKT VDAAAGG L+ K  E A+ L+E+MA N+YQW +ER G  +  AG+ E+D ++ L A+M 
Subjt:  QLFEAWERFKELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMS

Query:  SLAN------------------------------------------------------------------------------------------------
        ++                                                                                                  
Subjt:  SLAN------------------------------------------------------------------------------------------------

Query:  --AFLKFSESSN-RSNKLQEAVIAINNTVNGHFAAI----KTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKT------------RVA
          A  K + +SN +  KL  A       + G    +    + +E QLGQ+ + VN  N+G +P++ E +  E+ KAIT+   ++                
Subjt:  --AFLKFSESSN-RSNKLQEAVIAINNTVNGHFAAI----KTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKT------------RVA

Query:  KEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKE---------
        + K+ +E KE + E E+G    E  +L  E   P P  +           + KFL   M     +  S+ +         ++  L  K K+         
Subjt:  KEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPSPTVLVPKSKKNSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKE---------

Query:  -------KALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLAT
               KALCDLGAS+++I L++ ++L +  +K T++ LQLAD+S+  P  I+EN+L KV +F++PVDF VLD++E+  +PIILGRPFLAT
Subjt:  -------KALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFFVLDIKENPAMPIILGRPFLAT

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTCCCACCTGAAATCCTTCTTAGACATATGCGGGACGGTAAAGTTAAATGATGTTTCTGAAGA
TGCCATTCGCTTACGTTTATTTCCTTTTTCTCTGCAGGATAAAGCGAGTGACTGGTTACAATCTATTCCACCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAAGCAT
TCCTAAAGAAGTTCTTCCCTCCTGCTAAGACGGTCAAGCTGAGGACCAAAATTGGGACATTCCAGCAACAATTCGATGGGCAGCTGTTCGAGGCTTGGGAGAGATTTAAA
GAGCTGCTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTCCAGGTTCAATTATTTTATAATGGCTTAACTCCTAGCACAAAAACTATCGTTGATGCAGCTGCAGG
TGGGACTCTGTTGTCCAAAATTGTTGAGAGTGCTAGGACTTTGTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCGGGACCCAAAAAGATTGCTG
CCGGAGTGTTTGAGATCGATAACGTAAGTGCACTGCAGGCCCAGATGTCTTCCTTGGCTAATGCGTTTTTAAAGTTTTCAGAATCAAGTAACAGGTCAAATAAGCTCCAG
GAGGCCGTGATCGCCATAAACAACACTGTCAATGGTCATTTTGCAGCCATTAAGACTATAGAGACTCAGTTGGGACAACTGGTAAGTGTTGTCAACACCATGAACAAAGG
TAAAGTCCCAGCTGAGCAAGAGAAATCTTCGTTGGAGTACTGCAAGGCCATCACTGTGCATCACGAGGAGAAGACTCGAGTAGCTAAGGAAAAGGAGACTACTGAACCAA
AGGAACCCACTGGAGAAGCTGAAGAAGGCACCACCTCAAACGAAGCTGAAAAGCTTAATCCTGAGCCTACTATCCCTTCTCCTACTGTTTTAGTTCCTAAATCAAAGAAA
AATTCTAAGGCTAAGTTTGACAAGTTTTTATATGCTTTCATGGGATTGCATGTTAATATTCCTTTTTCAGATGCCTTGGAGCAGATGCCGCACTACAGAAAATTCATGAA
GGAATGGCTCAACAAGAAGAAAAAGGAAAAAGCATTATGTGACTTAGGGGCTAGCATAAACATAATTCATCTATCATTATGCAAAAAGCTTAACATAGGAGGAATAAAAT
CTACTTCTGTTAGACTTCAGTTGGCTGACCAATCTGTGGTTAGTCCATATGTGATTGTTGAGAATATTCTGAACAAAGTAGGTAGATTTCTCCTTCCTGTTGATTTCTTT
GTGTTGGATATTAAAGAAAATCCTGCCATGCCTATTATATTAGGGAGACCATTCCTTGCTACAGGGAGGCTCAAAATTGTTATGTACTACTGGTCTGGGACATATCTCAA
ATCTTTCAAGAGTCTGACAGAAGTCGTGGAGATCCATGCCATTCCGAACGAAGACGAAGAAGAAGAAGAAAGGAGGACTGAGGATGAGAATAATGAGAAGAAAAAGCAGT
GTCGTCCGTCTTGTTGCAAACTTGCGAGACATTTGTTGGGAATCTGGGATGTGAAGAAAGAAGAACTAAGAAGACAAAGGGTTGCGAGACATTTGTTGGGAATCTGGGAT
GTGAAGAAAGAAGAACTAAGAAGACAAAGGGTTGCGAGACATTTGTTGGGAATCTGGGATGTCAAGAAAGAAGAACTAAGAAGACAAAGGGGAAGGCGAAGCGACAGGGT
TGATGAGAGAGAAAAAAGTGGCTGGAATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGATTGTGCTTACAGAGGATCACCCATGGAGGATCCAAATTCCCACCTGAAATCCTTCTTAGACATATGCGGGACGGTAAAGTTAAATGATGTTTCTGAAGA
TGCCATTCGCTTACGTTTATTTCCTTTTTCTCTGCAGGATAAAGCGAGTGACTGGTTACAATCTATTCCACCTGGGAGCATCACCACTTGGGATGCTTTGGTCCAAGCAT
TCCTAAAGAAGTTCTTCCCTCCTGCTAAGACGGTCAAGCTGAGGACCAAAATTGGGACATTCCAGCAACAATTCGATGGGCAGCTGTTCGAGGCTTGGGAGAGATTTAAA
GAGCTGCTAAGGAAGTGCCCTCAGCATGGCTACCCCGACTGGCTCCAGGTTCAATTATTTTATAATGGCTTAACTCCTAGCACAAAAACTATCGTTGATGCAGCTGCAGG
TGGGACTCTGTTGTCCAAAATTGTTGAGAGTGCTAGGACTTTGTTAGAGGATATGGCCACCAACAGCTATCAGTGGCCATCTGAGCGGTCGGGACCCAAAAAGATTGCTG
CCGGAGTGTTTGAGATCGATAACGTAAGTGCACTGCAGGCCCAGATGTCTTCCTTGGCTAATGCGTTTTTAAAGTTTTCAGAATCAAGTAACAGGTCAAATAAGCTCCAG
GAGGCCGTGATCGCCATAAACAACACTGTCAATGGTCATTTTGCAGCCATTAAGACTATAGAGACTCAGTTGGGACAACTGGTAAGTGTTGTCAACACCATGAACAAAGG
TAAAGTCCCAGCTGAGCAAGAGAAATCTTCGTTGGAGTACTGCAAGGCCATCACTGTGCATCACGAGGAGAAGACTCGAGTAGCTAAGGAAAAGGAGACTACTGAACCAA
AGGAACCCACTGGAGAAGCTGAAGAAGGCACCACCTCAAACGAAGCTGAAAAGCTTAATCCTGAGCCTACTATCCCTTCTCCTACTGTTTTAGTTCCTAAATCAAAGAAA
AATTCTAAGGCTAAGTTTGACAAGTTTTTATATGCTTTCATGGGATTGCATGTTAATATTCCTTTTTCAGATGCCTTGGAGCAGATGCCGCACTACAGAAAATTCATGAA
GGAATGGCTCAACAAGAAGAAAAAGGAAAAAGCATTATGTGACTTAGGGGCTAGCATAAACATAATTCATCTATCATTATGCAAAAAGCTTAACATAGGAGGAATAAAAT
CTACTTCTGTTAGACTTCAGTTGGCTGACCAATCTGTGGTTAGTCCATATGTGATTGTTGAGAATATTCTGAACAAAGTAGGTAGATTTCTCCTTCCTGTTGATTTCTTT
GTGTTGGATATTAAAGAAAATCCTGCCATGCCTATTATATTAGGGAGACCATTCCTTGCTACAGGGAGGCTCAAAATTGTTATGTACTACTGGTCTGGGACATATCTCAA
ATCTTTCAAGAGTCTGACAGAAGTCGTGGAGATCCATGCCATTCCGAACGAAGACGAAGAAGAAGAAGAAAGGAGGACTGAGGATGAGAATAATGAGAAGAAAAAGCAGT
GTCGTCCGTCTTGTTGCAAACTTGCGAGACATTTGTTGGGAATCTGGGATGTGAAGAAAGAAGAACTAAGAAGACAAAGGGTTGCGAGACATTTGTTGGGAATCTGGGAT
GTGAAGAAAGAAGAACTAAGAAGACAAAGGGTTGCGAGACATTTGTTGGGAATCTGGGATGTCAAGAAAGAAGAACTAAGAAGACAAAGGGGAAGGCGAAGCGACAGGGT
TGATGAGAGAGAAAAAAGTGGCTGGAATTAA
Protein sequenceShow/hide protein sequence
MARDCAYRGSPMEDPNSHLKSFLDICGTVKLNDVSEDAIRLRLFPFSLQDKASDWLQSIPPGSITTWDALVQAFLKKFFPPAKTVKLRTKIGTFQQQFDGQLFEAWERFK
ELLRKCPQHGYPDWLQVQLFYNGLTPSTKTIVDAAAGGTLLSKIVESARTLLEDMATNSYQWPSERSGPKKIAAGVFEIDNVSALQAQMSSLANAFLKFSESSNRSNKLQ
EAVIAINNTVNGHFAAIKTIETQLGQLVSVVNTMNKGKVPAEQEKSSLEYCKAITVHHEEKTRVAKEKETTEPKEPTGEAEEGTTSNEAEKLNPEPTIPSPTVLVPKSKK
NSKAKFDKFLYAFMGLHVNIPFSDALEQMPHYRKFMKEWLNKKKKEKALCDLGASINIIHLSLCKKLNIGGIKSTSVRLQLADQSVVSPYVIVENILNKVGRFLLPVDFF
VLDIKENPAMPIILGRPFLATGRLKIVMYYWSGTYLKSFKSLTEVVEIHAIPNEDEEEEERRTEDENNEKKKQCRPSCCKLARHLLGIWDVKKEELRRQRVARHLLGIWD
VKKEELRRQRVARHLLGIWDVKKEELRRQRGRRSDRVDEREKSGWN