; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0026623 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0026623
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr10:39741837..39750704
RNA-Seq ExpressionLag0026623
SyntenyLag0026623
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO57964.1 reverse transcriptase [Corchorus capsularis]2.3e-4928.52Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLST----------GMGVQ-------------------------------TQIQG
        +FFADD+LLF R    +  A+  +L+ +E ASGQ IN DKS I FS +T           +GVQ                                ++QG
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLST----------GMGVQ-------------------------------TQIQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W+GK+    G+ VL++++ QAIP Y M+CF+ PK  + +++  M RFWW   K   RIHW SW  +C  K  GG+GFRD E FN ALLAKQCWR+     
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDA-----------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSS
        S   RV + +YF  G F++A                  W+ +LL+  F   +V  IL+IPL      D +IW+    G YSV+SGY + +  LL   P  
Subjt:  SFISRVLKGRYFPSGDFLDA-----------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSS

Query:  SSSEALSMWSAWGIMYALLLAMQIFREL--LMGSEWEFLLQSVQANSMLNLLRDVKDKV--------------------------DWA------------
           + L+  S WG +    +   ++R +  ++ S+ +  ++ VQ +S+ ++  + +  V                          +W+            
Subjt:  SSSEALSMWSAWGIMYALLLAMQIFREL--LMGSEWEFLLQSVQANSMLNLLRDVKDKV--------------------------DWA------------

Query:  -----KFEELVVVLWVVWCCRNQQTFRG--RVPSV-----------------------------------------NVDAAYCEGLSRVGAGVVIWDEVG
               E +  +LW++W  RN+  + G  + PSV                                         N DAAYC      G GVVI D  G
Subjt:  -----KFEELVVVLWVVWCCRNQQTFRG--RVPSV-----------------------------------------NVDAAYCEGLSRVGAGVVIWDEVG

Query:  LVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDLALVILETDSMR
         V+ +A      V +S  AE  A++ G+ +     L  V  E+DS++
Subjt:  LVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDLALVILETDSMR

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]1.8e-5730.28Show/hide
Query:  VQTQIQGWKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCW
        V   +QGWK K+F   G+EVL+K++ QAIPCYTM+CFRLPK+LI++      RFWW   KED++IHWV+W ++  PKC GGMGFRDLE FN+ALLAKQCW
Subjt:  VQTQIQGWKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCW

Query:  RIDQRPTSFISRVLKGRYFPSGDFLDA-----------------------------------------------------------------------GW
        RI   P S +SRVLKGRYF    F++A                                                                       GW
Subjt:  RIDQRPTSFISRVLKGRYFPSGDFLDA-----------------------------------------------------------------------GW

Query:  NENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALL----AQTPSSSSSEALSMW--SAWGIMYALLLAMQIFRELL----
          +++R  F+  E   IL+IP+   + ED++IW++EK GVYSV+SGY+   +ALL     Q PSSSSSE +  W    W +     + + ++R  L    
Subjt:  NENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALL----AQTPSSSSSEALSMW--SAWGIMYALLLAMQIFRELL----

Query:  ------------------MGSEWEFLL---------QSVQANSMLN------LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFR--------------
                           G   E  +         +++  NS         +LR+  + +  A FEEL VV+W +W  RN + F               
Subjt:  ------------------MGSEWEFLL---------QSVQANSMLN------LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFR--------------

Query:  -------------------GRVPS---------------VNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDL
                           GRV +               +N DA++       G G++I ++ G VM +A    +++++  +AE +A V+G++L  E+ +
Subjt:  -------------------GRVPS---------------VNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDL

Query:  ALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR
                           A+ DLSE G +VL+A+
Subjt:  ALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR

XP_024042628.1 uncharacterized protein LOC112099434 [Citrus clementina]9.0e-5428.49Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFS-------LST----------------------------------GMGVQTQIQG
        + FADDSL+F R  + +   +  I  CY   SGQ  NFDKS I FS       +ST                                   + + ++I  
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFS-------LST----------------------------------GMGVQTQIQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W+ K F   G+EVL+K++ QA+P Y M+ F+LP     DI + + R+WW   ++ R IHW SW  + + K  GG+GFRDL +FNQAL+AKQ WRI Q   
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFL------------------------------------------DAGWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFE
        S +++VL+ +YF  G+F                                           D  W E LL   F   +  +I+ IPL    S D+VIWH++
Subjt:  SFISRVLKGRYFPSGDFL------------------------------------------DAGWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFE

Query:  KCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQIFRELLMGSEWEFLLQSVQANSMLNLLRDVKDKVDWAKFEELVVVLWVVWCC
        + G YSVKSGY   Q+AL  + P + SS   +   +  I +A L+  +  R+    +++    + +    ML + +++  K+    FE L V  W +W  
Subjt:  KCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQIFRELLMGSEWEFLLQSVQANSMLNLLRDVKDKVDWAKFEELVVVLWVVWCC

Query:  RNQQTFRGRVP----------------------------------------------SVNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSY
        RN+  F  + P                                               VNVDAA  +  +  G G VI D  G ++ +A  +     N  
Subjt:  RNQQTFRGRVP----------------------------------------------SVNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSY

Query:  LAEGLAMVDGMRLVVEMDLALVILETDSMRVYSLLHD
         AE  A+  G+++ V+  L  +I+ETDS  V  L+++
Subjt:  LAEGLAMVDGMRLVVEMDLALVILETDSMRVYSLLHD

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]2.3e-4934.04Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQIQ-----------------------------------------G
        +FFADDSLLF +  +    A+   L  Y RASGQ +N DKS++SFS +T + VQ   Q                                          
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQIQ-----------------------------------------G

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W  KIF   G+EVLLK++VQ+IP Y M+CFRLP KL  +I   M +FWW    ++++IHW  W+ +CK K  GGMGFR    FNQALLAKQ WRI Q PT
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDA---------------------------------------------------------------------GWNENLLRHHF
        S +SRVLKG YF   DF+ A                                                                      WN   L+  F
Subjt:  SFISRVLKGRYFPSGDFLDA---------------------------------------------------------------------GWNENLLRHHF

Query:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQI
        S+ +V +IL IPL  +   D+ IWH+E  G YSV SGY L   + L +   SS S     W  W   + L L  ++
Subjt:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQI

XP_030483481.1 uncharacterized protein LOC115700065 [Cannabis sativa]2.3e-4932.98Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQ-----------------------------------------IQG
        +FFADDSLLF    E    A+   L  Y +ASGQ +N DKS++SFS +T +  Q Q                                         ++ 
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQ-----------------------------------------IQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W  KIF   G+E+LLK++VQ+IP Y M+CF+LP  L   +   M+ FWW  ++   +IHW SWK +CK K  GGMGFR    +NQALLAKQ WR+   P+
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDA---------------------------------------------------------------------GWNENLLRHHF
        S +SR+LK RYFP   FL+A                                                                      WN+ LL   F
Subjt:  SFISRVLKGRYFPSGDFLDA---------------------------------------------------------------------GWNENLLRHHF

Query:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQI
        SS +V  ILTIPL + S+ D +IWH+   G+Y V SGY    +A L  +  SS+S + + W  W   + L L  +I
Subjt:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQI

TrEMBL top hitse value%identityAlignment
A0A1R3GIN3 Reverse transcriptase1.1e-4928.52Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLST----------GMGVQ-------------------------------TQIQG
        +FFADD+LLF R    +  A+  +L+ +E ASGQ IN DKS I FS +T           +GVQ                                ++QG
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLST----------GMGVQ-------------------------------TQIQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W+GK+    G+ VL++++ QAIP Y M+CF+ PK  + +++  M RFWW   K   RIHW SW  +C  K  GG+GFRD E FN ALLAKQCWR+     
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDA-----------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSS
        S   RV + +YF  G F++A                  W+ +LL+  F   +V  IL+IPL      D +IW+    G YSV+SGY + +  LL   P  
Subjt:  SFISRVLKGRYFPSGDFLDA-----------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSS

Query:  SSSEALSMWSAWGIMYALLLAMQIFREL--LMGSEWEFLLQSVQANSMLNLLRDVKDKV--------------------------DWA------------
           + L+  S WG +    +   ++R +  ++ S+ +  ++ VQ +S+ ++  + +  V                          +W+            
Subjt:  SSSEALSMWSAWGIMYALLLAMQIFREL--LMGSEWEFLLQSVQANSMLNLLRDVKDKV--------------------------DWA------------

Query:  -----KFEELVVVLWVVWCCRNQQTFRG--RVPSV-----------------------------------------NVDAAYCEGLSRVGAGVVIWDEVG
               E +  +LW++W  RN+  + G  + PSV                                         N DAAYC      G GVVI D  G
Subjt:  -----KFEELVVVLWVVWCCRNQQTFRG--RVPSV-----------------------------------------NVDAAYCEGLSRVGAGVVIWDEVG

Query:  LVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDLALVILETDSMR
         V+ +A      V +S  AE  A++ G+ +     L  V  E+DS++
Subjt:  LVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDLALVILETDSMR

A0A2N9I833 Uncharacterized protein6.5e-5039.12Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQIQ--------------GWKGKIFLCRGREVLLKSIVQAIPCYTM
        +FFADDSLLF++    E R +  IL  YE ASGQ IN  K+ + FS ST   ++  IQ              GWK K+    GRE+L+KS+ QAIP Y M
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQIQ--------------GWKGKIFLCRGREVLLKSIVQAIPCYTM

Query:  NCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLDAGWNEN--
        +CFRLP +LI++I   + RFWW    E  ++HW+SW + C+ K  GG+GFR+L++FN+ALLAKQ WR+    TS   +V K +YFP    LDA  N    
Subjt:  NCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLDAGWNEN--

Query:  -------------------------LLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYR--LGQMA------LLAQTPSSSSS
                                 +++ +F S E   IL IPL +  ++D +IW   K G Y+V+SGY   LG+ A      L   TP+S  +
Subjt:  -------------------------LLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYR--LGQMA------LLAQTPSSSSS

A0A6J1DAR4 uncharacterized protein LOC1110189548.5e-5830.28Show/hide
Query:  VQTQIQGWKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCW
        V   +QGWK K+F   G+EVL+K++ QAIPCYTM+CFRLPK+LI++      RFWW   KED++IHWV+W ++  PKC GGMGFRDLE FN+ALLAKQCW
Subjt:  VQTQIQGWKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCW

Query:  RIDQRPTSFISRVLKGRYFPSGDFLDA-----------------------------------------------------------------------GW
        RI   P S +SRVLKGRYF    F++A                                                                       GW
Subjt:  RIDQRPTSFISRVLKGRYFPSGDFLDA-----------------------------------------------------------------------GW

Query:  NENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALL----AQTPSSSSSEALSMW--SAWGIMYALLLAMQIFRELL----
          +++R  F+  E   IL+IP+   + ED++IW++EK GVYSV+SGY+   +ALL     Q PSSSSSE +  W    W +     + + ++R  L    
Subjt:  NENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALL----AQTPSSSSSEALSMW--SAWGIMYALLLAMQIFRELL----

Query:  ------------------MGSEWEFLL---------QSVQANSMLN------LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFR--------------
                           G   E  +         +++  NS         +LR+  + +  A FEEL VV+W +W  RN + F               
Subjt:  ------------------MGSEWEFLL---------QSVQANSMLN------LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFR--------------

Query:  -------------------GRVPS---------------VNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDL
                           GRV +               +N DA++       G G++I ++ G VM +A    +++++  +AE +A V+G++L  E+ +
Subjt:  -------------------GRVPS---------------VNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDL

Query:  ALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR
                           A+ DLSE G +VL+A+
Subjt:  ALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR

A0A803PIN0 Uncharacterized protein3.3e-5438.12Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQ-----------------------------------------IQG
        +FFADDSLLF    E    A+   L  Y +ASGQ +N DKSI+SFS +T +  Q Q                                         ++ 
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQ-----------------------------------------IQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W  K+F   G+E+LLK++VQ+IP Y  +CF+LP  L   +   M++FWW  ++   +IHW SWK +CK K  GGMGFR    +NQALLAKQ WR    P+
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDA-------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSE
        S +SR+LK RYFP   FL+A              WN+ LL  +FSS +V  ILTIPL +  + D +IWH    G+Y+V SGY    +A L  +  SS+S 
Subjt:  SFISRVLKGRYFPSGDFLDA-------------GWNENLLRHHFSSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSE

Query:  ALSMWSAWGIMYALLLAMQI
        + + W  W   + L L  +I
Subjt:  ALSMWSAWGIMYALLLAMQI

A0A803QHU6 Uncharacterized protein2.2e-5026.89Show/hide
Query:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQT-----------------------------------------QIQG
        + FADDSLLF +  E  AR++   L  Y +ASGQ +N DKS++SFS +T    QT                                          +  
Subjt:  VFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQT-----------------------------------------QIQG

Query:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT
        W  KIF   G+EVLLK++VQ+IP Y M+CFRL KK    +   M  FWW+ ++   +IHW  WK++CK K  GGMGFR    FNQALLAKQ WRI   P 
Subjt:  WKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPT

Query:  SFISRVLKGRYFPSGDFLDAG---------------------------------------------------------------------WNENLLRHHF
        S +SR+LK R+F +  FLDA                                                                      WN  +L  +F
Subjt:  SFISRVLKGRYFPSGDFLDAG---------------------------------------------------------------------WNENLLRHHF

Query:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWS--------------AWGIMYALL-LAMQIFRE------
           +   ILTIPL      D++IWHF   G+Y+V+SG+ L   +L  Q  SS+S+     W                W +++ +L  A  +F++      
Subjt:  SSCEVCSILTIPLRHISSEDKVIWHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWS--------------AWGIMYALL-LAMQIFRE------

Query:  --LLMGSEWE--------------------FLLQSVQANSMLN--LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFRGR-------------------
           L  S WE                    F L   +A +M N   L+++   +    FE L+ +LW +W  RN+    G                    
Subjt:  --LLMGSEWE--------------------FLLQSVQANSMLN--LLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTFRGR-------------------

Query:  -----VP--------------------------------SVNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMD
             VP                                 +NVDAA      ++G G +I D  G+V+ + +        S   E  A+   +   ++  
Subjt:  -----VP--------------------------------SVNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMD

Query:  LALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR
        LA+  +ETD++RV + L +S   DLS F  ++++ R
Subjt:  LALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEAR

SwissProt top hitse value%identityAlignment
P93295 Uncharacterized mitochondrial protein AtMg003104.5e-2450Show/hide
Query:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPK-CLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLD
        A+P Y M+CFRL K L + ++ AMT FWW+  +  R+I WV+W+ +CK K   GG+GFRDL  FNQALLAKQ +RI  +P + +SR+L+ RYFP    ++
Subjt:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPK-CLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLD

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein8.8e-2345Show/hide
Query:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLDA
        A+P YTM CF LPK + + I   +  FWW   +E + +HW +W  +   K  GG+GF+D+E FN ALL KQ WR+  RP S +++V K RYF   D L+A
Subjt:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLDA

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.2e-2550Show/hide
Query:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPK-CLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLD
        A+P Y M+CFRL K L + ++ AMT FWW+  +  R+I WV+W+ +CK K   GG+GFRDL  FNQALLAKQ +RI  +P + +SR+L+ RYFP    ++
Subjt:  AIPCYTMNCFRLPKKLIQDISRAMTRFWWNGDKEDRRIHWVSWKTMCKPK-CLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCCTGCTATCTCCCACTTGTTTTTTTTGCTGATGACAGCCTCCTCTTTTTTCGGGTTAAAGAGGGTGAAGCTCGAGCTGTGCATAGCATCCTCCAGTGTTATGAGCG
AGCATCCGGACAAACTATAAATTTTGATAAGTCTATCATCTCCTTTAGTCTGAGCACTGGAATGGGTGTTCAAACTCAGATTCAGGGTTGGAAGGGTAAGATTTTTCTCT
GTAGGGGCAGGGAGGTGTTGCTAAAATCTATTGTGCAGGCTATCCCGTGTTACACTATGAATTGTTTCCGCTTGCCTAAAAAACTGATCCAGGACATTAGTAGGGCAATG
ACGCGATTCTGGTGGAATGGGGATAAGGAAGATAGAAGGATCCATTGGGTGAGTTGGAAGACTATGTGCAAGCCAAAATGTTTGGGTGGAATGGGCTTCAGAGATTTGGA
AACCTTCAACCAAGCTCTCCTAGCCAAACAGTGTTGGAGGATTGATCAGCGTCCTACCTCGTTTATCTCCCGTGTGTTGAAGGGGCGGTATTTTCCTAGTGGAGACTTCC
TCGATGCAGGGTGGAATGAAAATCTTCTCCGACATCACTTTAGTTCTTGCGAGGTATGTTCTATCCTTACTATTCCCTTGCGGCATATTTCGTCTGAGGATAAAGTCATT
TGGCATTTCGAGAAGTGTGGGGTCTATTCGGTCAAGAGTGGGTACCGACTTGGCCAGATGGCTTTGCTTGCTCAAACCCCATCTTCGTCTTCGAGTGAGGCACTGTCTAT
GTGGTCGGCATGGGGAATCATGTATGCACTTCTTTTGGCAATGCAAATTTTTCGTGAATTATTGATGGGATCTGAGTGGGAGTTCCTACTGCAGAGTGTCCAGGCGAACT
CTATGCTTAATCTGCTTAGGGATGTGAAGGACAAGGTCGACTGGGCTAAGTTTGAAGAGCTTGTTGTGGTGCTATGGGTCGTGTGGTGTTGCCGAAACCAACAGACGTTT
AGAGGGCGAGTTCCTTCAGTGAACGTGGATGCTGCATATTGTGAGGGCCTGTCTCGAGTGGGTGCTGGAGTTGTTATTTGGGATGAGGTAGGGCTAGTCATGCTGTCAGC
TGCTGTTAGCCATGATCATGTGGAGAATTCATATTTGGCAGAAGGTCTGGCAATGGTCGATGGTATGAGACTTGTGGTGGAGATGGATTTAGCTTTGGTAATCCTTGAGA
CTGACTCTATGCGGGTTTATTCCCTGCTGCATGATTCTGCGATGGTGGATCTGTCTGAGTTCGGTGTACTGGTTTTAGAGGCACGAAAGGAGGTGGTGGTTGAAGTTTTC
AAAGATTCCACACTTATCCTCAAGAATCCTCTACATCAGGAAAGATCCGACGCGCCGCCGCCTGCAACCCACTCACGAGCTTCCGTCGATCTGCAAGTCGCCGTCGCCAC
TGTAGGCGCGGACAGCAGCGAGGAGGGACCATTTTTCTGCGTTTTTGGCTGTTTTAGCAAGCGTACCCATTCAATTCCGGCGATTCAAGCAGTGGGTCTTCAAGTTTCAG
CGTGTTTGAGCAATTTCGGCTTTGGGTGGATTCGAGAGCTCGGACAGCAAGCTAAGTTCTGCTTTGGTTGCTTGAAGTCTTGGAATTTAAAGGAAAATTGGTTAGAAGCC
GATCAAGGATTTGTTGGGCTAACTTATTTTTCTTTGGAGTTTATCATGCTTGCGTGGACAAGATTTAAGCATCAGTCTTTACTTGTGGAATGTAAATGGGTAGAGGTAAA
GGTGTACAGTACGCGTGACGGAGAGCTGCTGAGCCATGGATCTCCTTTAGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCCTGCTATCTCCCACTTGTTTTTTTTGCTGATGACAGCCTCCTCTTTTTTCGGGTTAAAGAGGGTGAAGCTCGAGCTGTGCATAGCATCCTCCAGTGTTATGAGCG
AGCATCCGGACAAACTATAAATTTTGATAAGTCTATCATCTCCTTTAGTCTGAGCACTGGAATGGGTGTTCAAACTCAGATTCAGGGTTGGAAGGGTAAGATTTTTCTCT
GTAGGGGCAGGGAGGTGTTGCTAAAATCTATTGTGCAGGCTATCCCGTGTTACACTATGAATTGTTTCCGCTTGCCTAAAAAACTGATCCAGGACATTAGTAGGGCAATG
ACGCGATTCTGGTGGAATGGGGATAAGGAAGATAGAAGGATCCATTGGGTGAGTTGGAAGACTATGTGCAAGCCAAAATGTTTGGGTGGAATGGGCTTCAGAGATTTGGA
AACCTTCAACCAAGCTCTCCTAGCCAAACAGTGTTGGAGGATTGATCAGCGTCCTACCTCGTTTATCTCCCGTGTGTTGAAGGGGCGGTATTTTCCTAGTGGAGACTTCC
TCGATGCAGGGTGGAATGAAAATCTTCTCCGACATCACTTTAGTTCTTGCGAGGTATGTTCTATCCTTACTATTCCCTTGCGGCATATTTCGTCTGAGGATAAAGTCATT
TGGCATTTCGAGAAGTGTGGGGTCTATTCGGTCAAGAGTGGGTACCGACTTGGCCAGATGGCTTTGCTTGCTCAAACCCCATCTTCGTCTTCGAGTGAGGCACTGTCTAT
GTGGTCGGCATGGGGAATCATGTATGCACTTCTTTTGGCAATGCAAATTTTTCGTGAATTATTGATGGGATCTGAGTGGGAGTTCCTACTGCAGAGTGTCCAGGCGAACT
CTATGCTTAATCTGCTTAGGGATGTGAAGGACAAGGTCGACTGGGCTAAGTTTGAAGAGCTTGTTGTGGTGCTATGGGTCGTGTGGTGTTGCCGAAACCAACAGACGTTT
AGAGGGCGAGTTCCTTCAGTGAACGTGGATGCTGCATATTGTGAGGGCCTGTCTCGAGTGGGTGCTGGAGTTGTTATTTGGGATGAGGTAGGGCTAGTCATGCTGTCAGC
TGCTGTTAGCCATGATCATGTGGAGAATTCATATTTGGCAGAAGGTCTGGCAATGGTCGATGGTATGAGACTTGTGGTGGAGATGGATTTAGCTTTGGTAATCCTTGAGA
CTGACTCTATGCGGGTTTATTCCCTGCTGCATGATTCTGCGATGGTGGATCTGTCTGAGTTCGGTGTACTGGTTTTAGAGGCACGAAAGGAGGTGGTGGTTGAAGTTTTC
AAAGATTCCACACTTATCCTCAAGAATCCTCTACATCAGGAAAGATCCGACGCGCCGCCGCCTGCAACCCACTCACGAGCTTCCGTCGATCTGCAAGTCGCCGTCGCCAC
TGTAGGCGCGGACAGCAGCGAGGAGGGACCATTTTTCTGCGTTTTTGGCTGTTTTAGCAAGCGTACCCATTCAATTCCGGCGATTCAAGCAGTGGGTCTTCAAGTTTCAG
CGTGTTTGAGCAATTTCGGCTTTGGGTGGATTCGAGAGCTCGGACAGCAAGCTAAGTTCTGCTTTGGTTGCTTGAAGTCTTGGAATTTAAAGGAAAATTGGTTAGAAGCC
GATCAAGGATTTGTTGGGCTAACTTATTTTTCTTTGGAGTTTATCATGCTTGCGTGGACAAGATTTAAGCATCAGTCTTTACTTGTGGAATGTAAATGGGTAGAGGTAAA
GGTGTACAGTACGCGTGACGGAGAGCTGCTGAGCCATGGATCTCCTTTAGGATAG
Protein sequenceShow/hide protein sequence
MSCYLPLVFFADDSLLFFRVKEGEARAVHSILQCYERASGQTINFDKSIISFSLSTGMGVQTQIQGWKGKIFLCRGREVLLKSIVQAIPCYTMNCFRLPKKLIQDISRAM
TRFWWNGDKEDRRIHWVSWKTMCKPKCLGGMGFRDLETFNQALLAKQCWRIDQRPTSFISRVLKGRYFPSGDFLDAGWNENLLRHHFSSCEVCSILTIPLRHISSEDKVI
WHFEKCGVYSVKSGYRLGQMALLAQTPSSSSSEALSMWSAWGIMYALLLAMQIFRELLMGSEWEFLLQSVQANSMLNLLRDVKDKVDWAKFEELVVVLWVVWCCRNQQTF
RGRVPSVNVDAAYCEGLSRVGAGVVIWDEVGLVMLSAAVSHDHVENSYLAEGLAMVDGMRLVVEMDLALVILETDSMRVYSLLHDSAMVDLSEFGVLVLEARKEVVVEVF
KDSTLILKNPLHQERSDAPPPATHSRASVDLQVAVATVGADSSEEGPFFCVFGCFSKRTHSIPAIQAVGLQVSACLSNFGFGWIRELGQQAKFCFGCLKSWNLKENWLEA
DQGFVGLTYFSLEFIMLAWTRFKHQSLLVECKWVEVKVYSTRDGELLSHGSPLG