; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0035351 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0035351
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrotrans_gag domain-containing protein
Genome locationchr3:20144527..20149051
RNA-Seq ExpressionLag0035351
SyntenyLag0035351
Gene Ontology termsNA
InterPro domainsIPR005162 - Retrotransposon gag domain
IPR021109 - Aspartic peptidase domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7947748.1 hypothetical protein I3843_14G109500 [Carya illinoinensis]3.3e-8333.38Show/hide
Query:  VFQGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQA
        V  G  S I+  PI ANNFELK  LI  ++       P      +  +F  + D  K NGV+ED IRL LFPF L+DKAR WLQS+ PGSI +W  + + 
Subjt:  VFQGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQA

Query:  FLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWP
        FL K FPPAK  +LR+EIG F+Q   E L+E WER+K+L+R+C    +P                                              +YQWP
Subjt:  FLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWP

Query:  SERSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIE--SAAALASRSQEETTEQ------------------------------------
        +ER+  K  VAG+ E++ + AL  Q+ +L++     +     QS E  ++ ++   S E + EQ                                    
Subjt:  SERSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIE--SAAALASRSQEETTEQ------------------------------------

Query:  -------------------------------------------------------CAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQ-
                                                                 +KN+E Q+GQL + ++   +   P+  E    E CKAIT+   
Subjt:  -------------------------------------------------------CAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQ-

Query:  EEAEEEPESEDYDTPTGEAGEDTSSDEAQKPE------------------PEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE
        +E E  P  E   TPT  A    S D+ ++ E                    PPI +P L  P+  +++K  K    QF KF+++F  ++INIPFA+ALE
Subjt:  EEAEEEPESEDYDTPTGEAGEDTSSDEAQKPE------------------PEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE

Query:  -MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQS
         MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ D GSF++PC+ G   F R LCDLGASIN++P S+C+KL +GE+K T + LQLAD+S
Subjt:  -MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQS

Query:  VVKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV
        +  P  IIE+VL++V +F  P D  V+DM E+  +P+ILGR FL TG  +ID+++ ELT+RV
Subjt:  VVKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV

KAG7990634.1 hypothetical protein I3843_02G035100 [Carya illinoinensis]9.5e-8333.59Show/hide
Query:  VFQGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQA
        V  G  S I+  PI ANNFELK  LI  ++       P      +  +F  + D  K NGV+ED IRL LFPF L+DKAR WLQS+ PGSI +W  + + 
Subjt:  VFQGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQA

Query:  FLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWP
        FL K FPPAK  +LR+EIG F+Q   E L+E WER+K+L+R+C    +P                                              +YQWP
Subjt:  FLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWP

Query:  SERSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIE--SAAALASRSQEETTEQ------------------------------------
        +ER+  K  VAG+ +++ + AL  Q+ +L++     +     QS E  ++ ++   S E + EQ                                    
Subjt:  SERSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIE--SAAALASRSQEETTEQ------------------------------------

Query:  -------------------------------------------------------CAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQ-
                                                                AIKNIE Q+GQL + ++   +   P+  E    E CKAIT+   
Subjt:  -------------------------------------------------------CAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQ-

Query:  EEAEEEPESEDYDTPT----GEAGEDTSSDEAQKPEPE-------------PPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE-
        +E E  P  E   TPT    G++      DE      E             PPI +P L  P+  +++K  K    QF KF+++F  ++INIPFA+ALE 
Subjt:  EEAEEEPESEDYDTPT----GEAGEDTSSDEAQKPEPE-------------PPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE-

Query:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV
        MP Y +F+K+ ++KKR+ ++ +TV L+  CS  +Q+K+P+K+ D GSF++PC+ G   F + LCDLGASIN++PLS+C+KL + E+K T + LQLAD+S+
Subjt:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV

Query:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV
          P  IIE+VL++V +F  P D  V+DM E+  +P+ILGR FL TG  +ID+++ ELT+RV
Subjt:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV

XP_038973113.1 uncharacterized protein LOC120105094 [Phoenix dactylifera]5.0e-8438.97Show/hide
Query:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK
        G Q  IV   + ANNFE+K GLIQ ++       P     ++   F  + D  K NGVS+DAIRL LFPF L+DKA+ WL S  P S T W+AL QAFL 
Subjt:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK

Query:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----LSMDIPTG--------------------FSYQWPSERSAPKNIVAGVFEVDKVNA
        K FPP K  KLR +I +F Q   E L+E WERFK+L RKC     +++D   G                     +YQW +ER  PK  V G+++VD +N 
Subjt:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----LSMDIPTG--------------------FSYQWPSERSAPKNIVAGVFEVDKVNA

Query:  LQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEEPESE-----
        L  ++ SL   F   S T    +  + A     +     +   +  ++  +  + +      K   P++ E    E+CKA+T+   +   +   E     
Subjt:  LQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEEPESE-----

Query:  --DYDTPTGEAGEDTSSDEAQKPEP-------EPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKE
          DYD  + +  E+   D A+   P        PPIP P        +R K+ K +Q QF+KF+ VF  L+INIPFA+AL ++P Y +F+KE ++KKRK 
Subjt:  --DYDTPTGEAGEDTSSDEAQKPEP-------EPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKRKE

Query:  KKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGRFF
        +  +T+ L   CS  +Q K+P K+ D GSFS+PC+ G   F RALCDLGAS+ ++PLS+ +KL + E+K T + LQLAD+SV  P+ ++ENVLI+V +F 
Subjt:  KKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGRFF

Query:  LPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        +P+D  V++M E+  +P+ILGR FL T G IIDI+   LT++VG
Subjt:  LPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

XP_038976300.1 uncharacterized protein LOC120107204 [Phoenix dactylifera]2.8e-8234.21Show/hide
Query:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK
        G Q  IV   + ANNFE+K GLIQ ++       P     ++   F  + D  K NGVS+DAIRL LFPF L+DKA+ WL S  P S TTW+AL QAFL 
Subjt:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK

Query:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----------------------LSMDIPTG--------------------FSYQWPSER
        K FPP K  KLR +I +F Q   E L+E WERFK+L RKC                       +++D   G                     +YQW +ER
Subjt:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----------------------LSMDIPTG--------------------FSYQWPSER

Query:  SAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSA-------------------------------------------------------------
          PK  V G+++VD +N L  ++ SL   F K     S                                                              
Subjt:  SAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSA-------------------------------------------------------------

Query:  ---------------------QSIESAAALASRSQEETTEQCAIK---------NIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEE
                             QS E A    + +  E  E+   K         N+E QLGQL + +++  +   P++ E    E+CKA+T+   +   +
Subjt:  ---------------------QSIESAAALASRSQEETTEQCAIK---------NIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEE

Query:  PESE-------DYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKR
           E       DY+    +  E+   D A+ P P PP+      +P  ++ K+ K +   QF+KF+ VF  L+INIPFA+AL ++P Y +F+KE ++KKR
Subjt:  PESE-------DYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKR

Query:  KEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGR
        K +  +T+ L   CS  +Q K+P K+ D GSFS+PC+ G   F RALCDLGAS++++PLS+ +KL + E+K T + LQLAD+SV  P+ I+ENVLI+V +
Subjt:  KEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGR

Query:  FFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        F +P+D  V++M E+  +P+ILGR FL T G IID++   LT++VG
Subjt:  FFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

XP_038976409.1 uncharacterized protein LOC113461320 [Phoenix dactylifera]1.6e-8234.37Show/hide
Query:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK
        G Q  IV   + ANNFE+K GLIQ ++       P     ++   F  + D  K NGVS+DAIRL LFPF L+DKA+ WL S  P S TTW+AL QAFL 
Subjt:  GQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLK

Query:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----------------------LSMDIPTG--------------------FSYQWPSER
        K FPP K  KLR +I +F Q   E L+E WERFK+L RKC                       +++D   G                     +YQW +ER
Subjt:  KIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKC-----------------------LSMDIPTG--------------------FSYQWPSER

Query:  SAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSA-------------------------------------------------------------
          PK  V G+++VD +N L  ++ SL   F K     S                                                              
Subjt:  SAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSA-------------------------------------------------------------

Query:  ---------------------QSIESAAALASRSQEETTEQCAIK---------NIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEE
                             QS E A    + +  E  E+   K         N+E QLGQL + +++  +   P++ E    E+CKA+T+   +   +
Subjt:  ---------------------QSIESAAALASRSQEETTEQCAIK---------NIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEE

Query:  PESE-------DYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKR
          SE       DY+    +  E+   D A+ P P PP+      +P  ++ K+ K +   QF+KF+ VF  L+INIPFA+AL ++P Y +F+KE ++KKR
Subjt:  PESE-------DYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEAL-EMPQYNRFMKEWLAKKR

Query:  KEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGR
        K +  +T+ L   CS  +Q K+P K+ D GSFS+PC+ G   F RALCDLGAS++++PLS+ +KL + E+K T + LQLAD+SV  P+ I+ENVLI+V +
Subjt:  KEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVGR

Query:  FFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        F +P+D  V++M E+  +P+ILGR FL T G IID++   LT++VG
Subjt:  FFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

TrEMBL top hitse value%identityAlignment
A0A6J0ZX64 LOW QUALITY PROTEIN: uncharacterized protein LOC1104129455.1e-7433.08Show/hide
Query:  QGQQSGIVYAPIIANNFELKTGLIQWLE-IVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAF
        QG    I    I ANNFE+K   IQ ++  V     P     S+ + F  + D  K NGV++DAIRL LFPF L+DKA+ WL S+  GSITTW+ L Q F
Subjt:  QGQQSGIVYAPIIANNFELKTGLIQWLE-IVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAF

Query:  LKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWPS
        L K FPPAK  K+R +I +F Q   E L+E WERFKELLR+C    IP                                              +YQWPS
Subjt:  LKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWPS

Query:  ERSAPKNIVAGVFEVDKVNALQPQMTSLA------------NAFMKFSGTGSAQSI-------ESAAALASRSQEET-----------------------
        ERS  +  V G +E+D +  L  Q+ +L+            N+ +     G + S        ES   + + ++++                        
Subjt:  ERSAPKNIVAGVFEVDKVNALQPQMTSLA------------NAFMKFSGTGSAQSI-------ESAAALASRSQEET-----------------------

Query:  --------------------------------------------TEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQE--KTQMEYCKAIT---------V
                                                    ++  +++N+ETQ+GQL + ++   +   P++ +      E C+AIT         V
Subjt:  --------------------------------------------TEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQE--KTQMEYCKAIT---------V

Query:  HQEEAEEEPESEDYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE-MPQYNRFMKEWLAKK
        +Q+  E E E  D +   G    +    +    + E    S  +  P    ++ +K+  + QF KF+NVF  L+INIPFAEALE MP Y +F+K+ L+KK
Subjt:  HQEEAEEEPESEDYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALE-MPQYNRFMKEWLAKK

Query:  RKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVG
        RK  + +TV+L   CS  +Q K+P K+ D GSF++PC+ G   F +AL DLGASIN++P S+ +KL +GE K T V LQLAD+S V P  IIE+VL++V 
Subjt:  RKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLIRVG

Query:  RFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        +F  P+D  ++DM E+  +P+ILGR FL T G IID+   +++ +VG
Subjt:  RFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

A0A6P6GGL5 LOW QUALITY PROTEIN: uncharacterized protein LOC1124928787.1e-6833.69Show/hide
Query:  KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIPTGF----
        K NGVS+D  RL LFP+ L+DKA+ WL S+   +ITTWD +   FL K+FPP+K  KL+++I  F Q   + L++ WERFKELLR+      PT      
Subjt:  KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIPTGF----

Query:  ---------------------------------------SYQWPSERSA-PKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRS
                                               +YQ+PSER    ++ V  V +VD +N L  Q   LA  FMK   T   Q   +   L + S
Subjt:  ---------------------------------------SYQWPSERSA-PKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRS

Query:  QEETTEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEEPE----------SEDYDTPTGEAG-------------------
        Q    +  AIK++E Q+GQL +     ++   P++ EK   E  +AIT+   +    P+           ED ++P                        
Subjt:  QEETTEQCAIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEEPE----------SEDYDTPTGEAG-------------------

Query:  -EDTSSDEAQKPEPE---PP---------------IPSPTLMVPKEKKRKKKKKNNQV-------------------QFDKFMNVFMNLNINIPFAEALE
         +D    +A K  P    PP                P+P L  P +K ++ ++KN                      QF KF++VF  L++NIPF +ALE
Subjt:  -EDTSSDEAQKPEPE---PP---------------IPSPTLMVPKEKKRKKKKKNNQV-------------------QFDKFMNVFMNLNINIPFAEALE

Query:  -MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV
         MP Y +F+KE L+ KR+ +  + V L+   S R+  ++P K+ D GSF +PC+   Y+F ALCDLGASIN++P S+ +KL +G++K T V LQ+AD+S+
Subjt:  -MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV

Query:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV
         +P  I+E+VL++V +F  P D  ++DM E+ ++P+ILGR FL TG  +ID+++R++T+RV
Subjt:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRV

A0A6P6TF62 Reverse transcriptase6.9e-7133.1Show/hide
Query:  NGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP---------
        NGVS++AIRL LFPF L+DKA+ WL S  P + TTWD L +AFL K FPP K  KLR +I  F Q   E L+E WERF++LLRKC    +P         
Subjt:  NGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP---------

Query:  TGFSYQWPSE-RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCA---------------------------
         G S+   +   +A    + G+ E+D +N L  QM ++     +  G G + S    A  +    E  T +C                            
Subjt:  TGFSYQWPSE-RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCA---------------------------

Query:  -----------------------------------------------------------------IKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYC
                                                                          +N+E Q+GQ+ S ++  N+ + P++ E    E+ 
Subjt:  -----------------------------------------------------------------IKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYC

Query:  KAITVHQ-EEAEEEPESEDYDTPTGEAGEDTSSDEA-------QKPEPEPPIPSPTLMVPKEKKRKKKKKNNQV--QFDKFMNVFMNLNINIPFAEA-LE
        KAIT+   ++ E+ P S      + E  E   + EA       Q P    P  S  + +P      ++ K N+    F+KF+ +F  L+INIPFA+A L+
Subjt:  KAITVHQ-EEAEEEPESEDYDTPTGEAGEDTSSDEA-------QKPEPEPPIPSPTLMVPKEKKRKKKKKNNQV--QFDKFMNVFMNLNINIPFAEA-LE

Query:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV
        +P Y +F+KE + +KRK +  +T+ L   CS  +Q K+P K+ D GSFS+PC+ G+ +F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+
Subjt:  MPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSV

Query:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
          P+ ++ENVLI+V +F +P+D  V+DM E+ SMP+ILGR FL T G IID++  +L  ++G
Subjt:  VKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

A0A6P6X9H2 Reverse transcriptase8.1e-7233.33Show/hide
Query:  KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------
        K NGVS++AIRL LFPF L+DKA+ WL S  P + TTWD L +AFL K FPP K  KLR +I  F Q   E L+ETWERF++LLRKC    +P       
Subjt:  KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------

Query:  --TGFSYQWPSE-RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCA-------------------------
           G S+   +   +A    + G+ E+D +N L  QM ++     +  G G + S    A  +    E  T +C                          
Subjt:  --TGFSYQWPSE-RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQCA-------------------------

Query:  -------------------------------------------------------------------IKNIETQLGQLVSVVSTMNKSKAPAEQEKTQME
                                                                            +N+E Q+GQ+ S ++  N+ + P++ E    E
Subjt:  -------------------------------------------------------------------IKNIETQLGQLVSVVSTMNKSKAPAEQEKTQME

Query:  YCKAITVHQ-EEAEEEPESEDYDTPTGEAGEDTSSDEA-------QKPEPEPPIPSPTLMVPKEKKRKKKKKNNQV--QFDKFMNVFMNLNINIPFAEA-
        + KAIT+   ++ E+ P S      + E  E   + EA       Q P    P  S  + +P      ++ K N+    F+KF+ +F  L+INIPFA+A 
Subjt:  YCKAITVHQ-EEAEEEPESEDYDTPTGEAGEDTSSDEA-------QKPEPEPPIPSPTLMVPKEKKRKKKKKNNQV--QFDKFMNVFMNLNINIPFAEA-

Query:  LEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQ
        L++P Y +F+KE + +KRK +  +T+ L   CS  +Q K+P K+ D GSFS+PC+ G+ +F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+
Subjt:  LEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQ

Query:  SVVKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        S+  P+ ++ENVLI+V +F +P+D  V+DM E+ SMP+ILGR FL T G IID++  +L  ++G
Subjt:  SVVKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

A0A6P6XAQ1 Reverse transcriptase6.7e-6630.46Show/hide
Query:  QGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFL
        QG Q+ IV   + ANNFE+K  LIQ ++             S+   F  + D  K NGVSEDAI+L LFPF L+DKA+ WLQS  P + TTWD L +AFL
Subjt:  QGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDG-KNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFL

Query:  KKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWPSE
         K FPP K  KLR +I +F QQ  E L+E WER++EL R+C    +P                                              +YQW +E
Subjt:  KKIFPPAKKVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIP-------------------------------------------TGFSYQWPSE

Query:  RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAA----------------------------------------------------
        R   +   AG+ EVD +N L  +M ++     +  G+ S Q +  A+                                                     
Subjt:  RSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAA----------------------------------------------------

Query:  -------------------------------LASRSQEE-------TTEQC------------AIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCK
                                       LA+ S ++       TT++               +N+E QLGQ+ + V+  N+   P++ E    E+ K
Subjt:  -------------------------------LASRSQEE-------TTEQC------------AIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCK

Query:  AITVHQEEAEEEPESEDYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALEMPQYNRFMKEWL
        AIT+   +   EP       P   +G +    E +K        S      KE+K K+K + N++Q +    +               +P Y +F+KE +
Subjt:  AITVHQEEAEEEPESEDYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNVFMNLNINIPFAEALEMPQYNRFMKEWL

Query:  AKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLI
         KKRK    +T+ L   CS  +Q K+P K+ D GSF+VPC+ G   F +ALCDLGAS+++IPL++ ++L + E+K T + LQLAD+S+  P+ I+ENVLI
Subjt:  AKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSF-RALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLADQSVVKPVCIIENVLI

Query:  RVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG
        +V +F +P+D  V+DM E+ ++P+ILGR FL T G IID++R +   ++G
Subjt:  RVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAGAAGCAATCCAGACAGAATTAAACTCACTTTCAAACGTGAGGTTTTCGGACCAGTAGTCCAGACACCAGAAGATGTCAAGCCTGTGGGATACAAATGGTCAGG
ATTTGCTATCATTACTGTATATGTTGATGATTTGAATATAATTGGAACTCCTGAAGAGCTTCGAAAGGCAATAGAATATCTTAAGAAAGAATTTGAGATGAAAGATCTCG
GAAACAAAGTTTTGGGGGATTCGCTTTTGAGACTTCTTGGAGCCGTAAACAGGGCAGAAACAGAGACTGTTGGAGCCAAAGCAAAGGGAGGAAGTTGGAAATCAACCCAT
TCCGTGTTTCAGGGACAACAATCGGGGATTGTCTATGCCCCGATCATTGCCAACAACTTTGAGTTGAAGACCGGTCTCATTCAATGGCTCGAGATTGTGCTTATCGAGGA
TCGCCCACGAGGATCCAAATTCTCATATAAAATCATTTTTAGACATTTGTGGGACGGTAAAAATAATGGAGTTTCTGAGGATGCTATTCGCTTATGCTTATTTCCTTTTC
CTTTGCAGGATAAAGCACGAGATTGGTTGCAATCTATCACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCTTTTCTAAAGAAAATTTTCCCTCCTGCAAAG
AAGGTCAAGCTGAGGACTGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAGACCTGGGAGCGATTTAAAGAATTGCTGAGGAAGTGCCTCAGCATGGA
TATCCCGACTGGCTTTAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAACATTGTTGCTGGAGTGTTTGAGGTTGACAAGGTAAATGCACTCCAGCCCCAGATGA
CCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGATCTCAGGAGGAGACCACAGAACAGTGT
GCCATTAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAGTGTTGTAAGCACCATGAATAAAAGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAA
AGCCATCACTGTACACCAGGAGGAAGCTGAAGAGGAACCTGAGTCTGAGGATTATGACACGCCTACAGGGGAAGCTGGGGAGGACACATCATCAGATGAGGCTCAAAAGC
CTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAAGAAAAGAAAAGGAAGAAAAAGAAAAAGAACAATCAGGTTCAGTTTGACAAGTTTATGAATGTC
TTTATGAATCTGAACATTAATATTCCTTTTGCAGAAGCATTAGAAATGCCCCAGTATAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGA
CACGGTATATCTCGCTTCCACATGCAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATTCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTCAT
TTAGAGCTTTATGTGATTTAGGTGCTAGCATTAATATTATTCCTCTATCTCTGTGCAAAAAGTTAGTTATAGGTGAGATTAAAACTACTCCTGTAAAGCTCCAATTGGCT
GATCAGTCTGTGGTTAAACCAGTTTGCATTATAGAAAATGTTTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGATTTGTATGTTATGGATATGATGGAAAATCCTTC
AATGCCTGTTATATTAGGAAGATCATTCCTCGATACTGGGGGAGTGATTATTGATATTGAGCGTAGGGAGCTCACTATTAGAGTCGGAATGAAAAGAAATATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAGAAGCAATCCAGACAGAATTAAACTCACTTTCAAACGTGAGGTTTTCGGACCAGTAGTCCAGACACCAGAAGATGTCAAGCCTGTGGGATACAAATGGTCAGG
ATTTGCTATCATTACTGTATATGTTGATGATTTGAATATAATTGGAACTCCTGAAGAGCTTCGAAAGGCAATAGAATATCTTAAGAAAGAATTTGAGATGAAAGATCTCG
GAAACAAAGTTTTGGGGGATTCGCTTTTGAGACTTCTTGGAGCCGTAAACAGGGCAGAAACAGAGACTGTTGGAGCCAAAGCAAAGGGAGGAAGTTGGAAATCAACCCAT
TCCGTGTTTCAGGGACAACAATCGGGGATTGTCTATGCCCCGATCATTGCCAACAACTTTGAGTTGAAGACCGGTCTCATTCAATGGCTCGAGATTGTGCTTATCGAGGA
TCGCCCACGAGGATCCAAATTCTCATATAAAATCATTTTTAGACATTTGTGGGACGGTAAAAATAATGGAGTTTCTGAGGATGCTATTCGCTTATGCTTATTTCCTTTTC
CTTTGCAGGATAAAGCACGAGATTGGTTGCAATCTATCACCCCTGGGAGCATCACCACCTGGGATGCTTTGGTCCAGGCTTTTCTAAAGAAAATTTTCCCTCCTGCAAAG
AAGGTCAAGCTGAGGACTGAGATTGGGACATTCCAACAACAATATGATGAGCAGCTGTTCGAGACCTGGGAGCGATTTAAAGAATTGCTGAGGAAGTGCCTCAGCATGGA
TATCCCGACTGGCTTTAGCTATCAGTGGCCATCTGAGCGGTCTGCACCTAAAAACATTGTTGCTGGAGTGTTTGAGGTTGACAAGGTAAATGCACTCCAGCCCCAGATGA
CCTCCCTTGCTAATGCTTTTATGAAATTTTCAGGTACAGGGAGTGCACAGTCAATTGAATCAGCTGCTGCTTTAGCATCTAGATCTCAGGAGGAGACCACAGAACAGTGT
GCCATTAAGAACATTGAGACTCAGCTGGGACAGTTGGTAAGTGTTGTAAGCACCATGAATAAAAGTAAGGCCCCAGCTGAGCAAGAGAAAACCCAGATGGAGTACTGTAA
AGCCATCACTGTACACCAGGAGGAAGCTGAAGAGGAACCTGAGTCTGAGGATTATGACACGCCTACAGGGGAAGCTGGGGAGGACACATCATCAGATGAGGCTCAAAAGC
CTGAACCTGAGCCTCCTATTCCTTCTCCCACACTGATGGTTCCCAAAGAAAAGAAAAGGAAGAAAAAGAAAAAGAACAATCAGGTTCAGTTTGACAAGTTTATGAATGTC
TTTATGAATCTGAACATTAATATTCCTTTTGCAGAAGCATTAGAAATGCCCCAGTATAACAGGTTCATGAAGGAGTGGTTAGCAAAGAAGCGAAAGGAAAAGAAGGTTGA
CACGGTATATCTCGCTTCCACATGCAGCACCAGAGTACAACAGAAGGTACCTGAAAAAGTAGCAGATTCAGGGAGTTTTTCTGTTCCTTGCAGTTTTGGTACTTATTCAT
TTAGAGCTTTATGTGATTTAGGTGCTAGCATTAATATTATTCCTCTATCTCTGTGCAAAAAGTTAGTTATAGGTGAGATTAAAACTACTCCTGTAAAGCTCCAATTGGCT
GATCAGTCTGTGGTTAAACCAGTTTGCATTATAGAAAATGTTTTAATCAGAGTAGGTAGATTTTTCCTCCCTATTGATTTGTATGTTATGGATATGATGGAAAATCCTTC
AATGCCTGTTATATTAGGAAGATCATTCCTCGATACTGGGGGAGTGATTATTGATATTGAGCGTAGGGAGCTCACTATTAGAGTCGGAATGAAAAGAAATATTTAA
Protein sequenceShow/hide protein sequence
MERSNPDRIKLTFKREVFGPVVQTPEDVKPVGYKWSGFAIITVYVDDLNIIGTPEELRKAIEYLKKEFEMKDLGNKVLGDSLLRLLGAVNRAETETVGAKAKGGSWKSTH
SVFQGQQSGIVYAPIIANNFELKTGLIQWLEIVLIEDRPRGSKFSYKIIFRHLWDGKNNGVSEDAIRLCLFPFPLQDKARDWLQSITPGSITTWDALVQAFLKKIFPPAK
KVKLRTEIGTFQQQYDEQLFETWERFKELLRKCLSMDIPTGFSYQWPSERSAPKNIVAGVFEVDKVNALQPQMTSLANAFMKFSGTGSAQSIESAAALASRSQEETTEQC
AIKNIETQLGQLVSVVSTMNKSKAPAEQEKTQMEYCKAITVHQEEAEEEPESEDYDTPTGEAGEDTSSDEAQKPEPEPPIPSPTLMVPKEKKRKKKKKNNQVQFDKFMNV
FMNLNINIPFAEALEMPQYNRFMKEWLAKKRKEKKVDTVYLASTCSTRVQQKVPEKVADSGSFSVPCSFGTYSFRALCDLGASINIIPLSLCKKLVIGEIKTTPVKLQLA
DQSVVKPVCIIENVLIRVGRFFLPIDLYVMDMMENPSMPVILGRSFLDTGGVIIDIERRELTIRVGMKRNI