; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0036530 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0036530
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionRetrovirus-related Pol polyprotein from transposon RE1
Genome locationchr3:47971013..47979751
RNA-Seq ExpressionLag0036530
SyntenyLag0036530
Gene Ontology termsNA
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG8491907.1 hypothetical protein CXB51_015260 [Gossypium anomalum]2.8e-13032.28Show/hide
Query:  EVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQ
        ++LL ++G+ LE  +     IP+  LP          I  PL+    +QD+ L+SWLL ++++++L  +    T  + W  I   + +++  K+   +  
Subjt:  EVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQ

Query:  LQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHISRPNTP--------------------------------PQNIPSSVFSNSTTNI--------
        L +LKKG +++K+YL KVK L +SL+  G  +  Q+ +    TP                                 Q   SSV SN  T+         
Subjt:  LQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHISRPNTP--------------------------------PQNIPSSVFSNSTTNI--------

Query:  -----------------------------------------------ATLAPTSGDFSSYNSYYN----------------------------NDANWYP
                                                        TL+  + + +   +Y+                             ND  WYP
Subjt:  -----------------------------------------------ATLAPTSGDFSSYNSYYN----------------------------NDANWYP

Query:  DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANG
        D+G TNH+T D  N++  S Y G Q + MGNG+ VSI + GS+ + +     +R   L+++L+VP + KNL+SV QFAKDN V+FEFHP  CFVKD    
Subjt:  DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANG

Query:  QLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNT---------------------WCSD
        + LL G + NGLYKF  S     KS ++S     S    T Q+       V S  +WH+RLGHP +N++  +L +                     W  +
Subjt:  QLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNT---------------------WCSD

Query:  -------------------------------QHKGFKCLDIT--------------------------------------SKRMYVSRHVLFDEFNFPFA
                                       +H+    + +T                                        R+++SRH+ FDE  FPF 
Subjt:  -------------------------------QHKGFKCLDIT--------------------------------------SKRMYVSRHVLFDEFNFPFA

Query:  SSTKLSDVDLSKSS---IVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSS
        + +       S +    +  LPL+  S PA              + +  +SP + S P S+      SP +  S   +S        ++T++   P  S 
Subjt:  SSTKLSDVDLSKSS---IVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSS

Query:  NTHPMTTRAKAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT
        N HPM TR+K+GIFKP+V  +   + EP +   AL+ P W  A   +Y+A L N T DL+P P+D+K++GCKWI+KVKR++DGS+ARYK RLV KG+ Q 
Subjt:  NTHPMTTRAKAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT

Query:  ADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGF
        A +++ ETFSPV+KPTTIR++L L+++ +W + Q+DINNAFL+G L EE++M QPPGF         PLVC+L+KALYGLKQAPRAW+ +L++FL  + F
Subjt:  ADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGF

Query:  VNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFW----MPTVF
          SKAD+SLF+    +  IY+L+YVDD+II G+ + AI+  +  L   FALKDLG L++FLGIE+  TP  G+F+SQ KYI +L  RA        PT  
Subjt:  VNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFW----MPTVF

Query:  QLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
          + +   ++   G   +D   YRSIVG LQY+ +TRPDI+
Subjt:  QLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

KYP46257.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan]1.4e-12631.43Show/hide
Query:  IKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLK
        IKGH L  H    P IP +F      D+ +  I +  Y  W QQDQLL SWL  SMS+D+LT+++ C ++ ++WD I   + S   AK  Q + +L++  
Subjt:  IKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLK

Query:  KGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI--------------------------------------SRPNTPPQNIPSSVFSNSTTNIATLAPTS
           +S+ +Y+ +++ L+++L+A+G  +  ++H+                                      SR +   +   +S+  N TT +    P++
Subjt:  KGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI--------------------------------------SRPNTPPQNIPSSVFSNSTTNIATLAPTS

Query:  ---------------------------------------GDFSSYN-------------SYYNNDA---------------------------NWYPDTG
                                               G F+ Y               YY  D                            NWYPD+G
Subjt:  ---------------------------------------GDFSSYN-------------SYYNNDA---------------------------NWYPDTG

Query:  TTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINR--SFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQ
         +NHVTN   N+   + + G  QI +GNG G   LH  ST + +  S IN    F L +LL+VP ITKNLISVSQF+KDN+V+FEFHPH+C VK Q   +
Subjt:  TTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINR--SFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQ

Query:  LLLQGHL-SNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPT-----------------------------------
        +LLQG + S+GLYKF    P      S+ +V       I       + T   SF  WH+RLGHP                                    
Subjt:  LLLQGHL-SNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPT-----------------------------------

Query:  -SNIVFSILN-----------------------------------TWC----------------------------------------------------
         SN+  +I N                                   TW                                                     
Subjt:  -SNIVFSILN-----------------------------------TWC----------------------------------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ---------SDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFA---SSTKLSDVDLSKSSIVTLPLINNSLPASISA----------NSSITLSSSSISNT
                 S  HKG+KCL     R+Y+S+ V+F+E  FP+    SS+K SD  L     +++PL  N  P    A           S   +S S   NT
Subjt:  ---------SDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFA---SSTKLSDVDLSKSSIVTLPLINNSLPASISA----------NSSITLSSSSISNT

Query:  S--TSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPRVLLTEYLDN-EPPNFKLALKCPHWIKAMK
        +  TSPS  SIP SS  E  SSP                       + +P    N HPMTTRAK GI KPR+  T  L   EP   K AL  P W  AM+
Subjt:  S--TSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPRVLLTEYLDN-EPPNFKLALKCPHWIKAMK

Query:  EDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGL
         +Y+A L+N TW LVP P  +  IGCKW+++VK N +GS+ +YKARLVAKGF+Q    DY+ETFSPVIKP T+R+ILTL+LT+ W + QLD+NNAFL+G 
Subjt:  EDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGL

Query:  LTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTL
        L EEV+M QPPGF     +S   LVC+L KA+YGLKQAPRAW+D+LK+ L    F  SK D SLF+ S  N  IY+L+YVDD+II G++++ ++ L+S L
Subjt:  LTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTL

Query:  HRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        H  F+LKDLG L+FFLGIEV+  P+G L ++QSKYIR+LL+R  +     +   + S   +S++G E F D   YRS+VG LQY T+TRP+IS
Subjt:  HRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

RVW60229.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]5.5e-12629.28Show/hide
Query:  IKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLK
        ++G+GLE  +     +P    P   TDK    +P P +  + +QD LL SWLL S+    L Q++ C++A E+W+ I+  + S+++AKVM +K+Q+Q LK
Subjt:  IKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLK

Query:  KGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI-------------------SRPNTP------------------------------------------
        K G++M+DYL K+KN  + L+  GHKI D DHI                   S+ ++P                                          
Subjt:  KGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI-------------------SRPNTP------------------------------------------

Query:  -PQNIPSSVFSNSTT-----------------------------------------------------NIATLAPTSG--------------------DF
             PSS F N                                                        N+    PT G                    + 
Subjt:  -PQNIPSSVFSNSTT-----------------------------------------------------NIATLAPTSG--------------------DF

Query:  SSYNSYYNNDAN----------------WYPDTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITK
        + Y++  N D +                W+PD+G TNHVT+DLGN+  G++Y G  +I MGNG+G+ I H G +   SS S  N+   L+++L VP I K
Subjt:  SSYNSYYNNDAN----------------WYPDTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITK

Query:  NLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKF-------------TMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDL
        NL+SVSQFA+DNNV+FEFHP VCFVKD++N  LLLQG+L  GLY+F             ++SN K + +  ++++ H+   +  E+   NS  HV  FDL
Subjt:  NLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKF-------------TMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDL

Query:  WHARLGHPTSNIVFSILN-----------------------------------------------------------------------TWC--------
        WH RLGHP S IV  +LN                                                                       TW         
Subjt:  WHARLGHPTSNIVFSILN-----------------------------------------------------------------------TWC--------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  -----------------------------------------------------SDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTK--LSDVDLSK
                                                             S +HKG+KCL+    RM++SR V+FDE  FPFA   +  +  V  S 
Subjt:  -----------------------------------------------------SDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTK--LSDVDLSK

Query:  SSIVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVA----------NPRDSSNTHPM
          +  +PL+ N  P S+S   S++L +SS  ++          I S  + +S+   S ++  +++S + P++S    +            P +S NT P+
Subjt:  SSIVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVA----------NPRDSSNTHPM

Query:  T---------TRAKAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKG
        T         TR+K GIFKP+V   +    EP  F+ A+  P W +AM E++ A + N TW LV  P ++  +GC+W++K+KRN DGS++RYKARLVAKG
Subjt:  T---------TRAKAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKG

Query:  FHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLY
        + Q    D+ ETFSPV+KPTTIR++L ++++ +W + QLD+NNAFL+G L EEV+MDQPPGF     +    LVC+L KALYGLKQAPRAW+D+LK  L 
Subjt:  FHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLY

Query:  NSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIE--VRRTPNGGLFMSQSKYIRELLDRAIFWMPT
          GF ++K+D SLF++      ++VL+YVDD+++ GSS+  I++LIS L   F+LKDLG L++FLGIE  +++T   G   ++S             +PT
Subjt:  NSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIE--VRRTPNGGLFMSQSKYIRELLDRAIFWMPT

Query:  VFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
            + S   +S   G+   +   YRS+VG LQY+T+TRP+I+
Subjt:  VFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

RVX03305.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Vitis vinifera]7.2e-12631.38Show/hide
Query:  EVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQ
        ++L T++GH L+  + E   +P+EFL +D  D+T   +  P +  W QQDQL+ SWLL S+++ +LT+M+NC T+ ++W  + + + ++  AKV QFKTQ
Subjt:  EVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQ

Query:  LQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI-----------------------------------SRPNTPPQNIPSSVFSN----------
        L N KKG +S+ DYL K++N+++ L+ VGHKI  +DHI                                   ++ +   +NI  + FS           
Subjt:  LQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHI-----------------------------------SRPNTPPQNIPSSVFSN----------

Query:  -----------STTNIATLAPT---------SGDFSS-----------------------------YNSYYN----------------------------
                   ST N     PT          G+F+                                 YY                             
Subjt:  -----------STTNIATLAPT---------SGDFSS-----------------------------YNSYYN----------------------------

Query:  ------------------NDANWYPDTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVS
                           D NWY D+G T+H+T +L N+   S +    ++ +GNG G+ I H G T   SS    +++  L+ LL+VP+ITKNL+SVS
Subjt:  ------------------NDANWYPDTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVS

Query:  QFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILN-
        +FA DN+VFFEFHP  CFVKD +   +L+   L  GLY F   N +    L +S+   S+ L   E     S T    F LWH RLGHP+S+IV  +LN 
Subjt:  QFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILN-

Query:  ----------------------TWC-SDQHK------------------GFKCLDITS------------------------------------KRMYVS
                              TW    +HK                  G K   + S                                    K  ++ 
Subjt:  ----------------------TWC-SDQHK------------------GFKCLDITS------------------------------------KRMYVS

Query:  RH--VLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASI----------------------------------------------------SANS
         H   L  + + PF    +     +   + +  P++ N  P  +                                                    S N 
Subjt:  RH--VLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASI----------------------------------------------------SANS

Query:  SITLS---------------------SSSISNTSTS-PSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSS----NTHPMTTRAK
        +I +S                     +SS S++STS P + S+P+     +V     S S +  +     P TS  +V + P  SS     +H M TR+K
Subjt:  SITLS---------------------SSSISNTSTS-PSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSS----NTHPMTTRAK

Query:  AGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFS
         GIFKP+  L   +   P +   AL+  HW + M ++Y A L N TWDLVP P D+K+IGCKW++KVK N +G++ +YKARLVAKGFHQ    D+NETFS
Subjt:  AGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFS

Query:  PVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLF
        PV+KPTTI I+LT++L   WK+ QLD+NNAFL+G L E++FM QP GFI      +   VC+L K+LYGLKQAPRAW+++L   L   GF ++K+D SLF
Subjt:  PVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLF

Query:  MKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLG
        +        Y+L+YVDD++I G++   +  +I+ L+  FALKDLG +++FLGI+V+ T + G+ +SQ+KYI  LL +  +  +  V   + S   +S  G
Subjt:  MKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLG

Query:  GEVFSDPKFYRSIVGGLQYLTLTRPDIS
           FS+ + YRS VG LQY T+TRPDI+
Subjt:  GEVFSDPKFYRSIVGGLQYLTLTRPDIS

XP_012491075.1 PREDICTED: uncharacterized protein LOC105803429 [Gossypium raimondii]4.2e-12631.36Show/hide
Query:  GRASLNRRHFSVFSPGVLGAHYKRLMKNKHLLEVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQ
        G   L  ++F       L  H   L K     ++LL ++G+GLE  +    ++P    P   T      +  P +    +QD+ L+SWLL +++++VL  
Subjt:  GRASLNRRHFSVFSPGVLGAHYKRLMKNKHLLEVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQ

Query:  MLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSLSAVGH-----------------KILDQDHISR------PNTPP
        +    T+  IW  I   + + +  K+   +  L ++KK G+++K+YL+KVK L +SL+AVG                  K+  Q ++++       + P 
Subjt:  MLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSLSAVGH-----------------KILDQDHISR------PNTPP

Query:  QNIPSSVFSNSTTNIATLAPTSGDFSSY------------------NSYYNNDAN------------------------------------------WYP
        ++  S   + S+    +   T G   S+                  N Y+  D N                                          WYP
Subjt:  QNIPSSVFSNSTTNIATLAPTSGDFSSY------------------NSYYNNDAN------------------------------------------WYP

Query:  DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANG
        D+G TNHVT D  N+   S Y G  Q+  GNG   SI + GS+ +++     +R   L+ +L+VP + KNL+SV QFAKDN V+FEFHP++CFVKD   G
Subjt:  DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANG

Query:  QLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIV-------------------------------
        ++LL+GH+ + LY+F  S  K    +  S +  SS    + Q S        +  LWH RLGHP S  +                               
Subjt:  QLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIV-------------------------------

Query:  ------------------------------FSILNTW---------------------------------------------------------------
                                      FS   +W                                                               
Subjt:  ------------------------------FSILNTW---------------------------------------------------------------

Query:  -------------------------CS--------DQH------------------KGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSI
                                 C+         QH                  KGF+CLD    R+Y++RHV FDE  FPF  S        S    
Subjt:  -------------------------CS--------DQH------------------KGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSI

Query:  VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIP--ISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDS-------SNTHPMTTRA
           P +   LP  +S NSS++   + +  TS  PS A+ P    S      SP+ S+        H +P T T S   +  DS       +  HPM TR+
Subjt:  VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIP--ISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDS-------SNTHPMTTRA

Query:  KAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETF
        K+GIFKP++L +  ++NEP     A + P W +A + +YDA + N TWDL+P PE ++ +GCKWI+K+K+N+DGS+ARYK RLV KG+ Q A ID+ ETF
Subjt:  KAGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETF

Query:  SPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSL
        SPV+KPTT+R +L L+++ NW + Q+DINNAFL+G L+EE++M QPPGF     ++   LVC+L+KALYGLKQAP AW+ +L+ FL +  F  SK D+SL
Subjt:  SPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSL

Query:  FMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIF----WMPTVFQLLWSAPPV
        F+    +  +YVL+YVDD+II G+   AI + +  LH  F LKDLG L +FLG+EVR T + GLF+SQ KYI +LL+RA       +PT   L+ +   +
Subjt:  FMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIF----WMPTVFQLLWSAPPV

Query:  SQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        S   G +  D   YRSIVG LQY+ +TRPDI+
Subjt:  SQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

TrEMBL top hitse value%identityAlignment
A0A2N9F105 Uncharacterized protein7.7e-13436.49Show/hide
Query:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN
        LPT S  TD  +T  P P+++ W  QDQL+ S L+ S+SE+VL  ++ CTTA+E+W  +A  +TS++ A+ MQ   QL  L+KG +S+ D+  +   L +
Subjt:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN

Query:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS
        +L+A+   + D + +S                 +    P                  QN PS   S +T + A     T     G +S        S N+
Subjt:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS

Query:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG
        +  N                                                           D NWY DTG T+H+T+D GN+ + S +Y G +QIR+G
Subjt:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG

Query:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN
        NG G+SI H G T L +  S    +F+LR++L+VP+ITKNLISV +F KD N   EFHP    VKD+  G+ LLQG   +GLY F           +++N
Subjt:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN

Query:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS
           SS  N +       RT   S   WH+RLGHP   IV ++++ +     S++++ F   CL   SK++            FP +S+   S ++L  S 
Subjt:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS

Query:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----
        +               V  P+   +  A +     +T SS + + +S SP+ +  P  S   I+ SP   L    V+    +P     S + NP      
Subjt:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----

Query:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK
             S + HPMTTR++  I K             PR LL E     L  EP  F  A++ PHW  AM  ++DA L N TW LVP    + IIGCKW+++
Subjt:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK

Query:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA
        +KR+++GS+ RYKARLVAKGFHQ   +DY+ETFSPVIKPTT+R++L+++++  W + Q+DI NAFLHG L+EEVFM QPPG+  SHP      +C+L+KA
Subjt:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA

Query:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS
        LYGLKQAPRAW+ +L   L   GF  S++DSSLF+    +  +YVLIYVDD+II  S  +AI++L+  L   FA+KDLG LNFFLGIEV R  +G L +S
Subjt:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS

Query:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        Q +YI +LL R  +     V   + +   +S L GE  +DP  YRS VG LQYL +TRPDI+
Subjt:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

A0A2N9G872 Uncharacterized protein5.9e-13436.49Show/hide
Query:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN
        LPT S  TD  +T  P P+++ W  QDQL+ S L+ S+SE+VL  ++ CTTA+E+W  +A  +TS++ A+ MQ   QL  L+KG +S+ D+  +   L +
Subjt:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN

Query:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS
        +L+A+   + D + +S                 +    P                  QN PS   S +T + A     T     G +S        S N+
Subjt:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS

Query:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG
        +  N                                                           D NWY DTG T+H+T+D GN+ + S +Y G +QIR+G
Subjt:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG

Query:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN
        NG G+SI H G T L +  S    +F+LR++L+VP+ITKNLISV +F KD N   EFHP    VKD+  G+ LLQG   +GLY F           +++N
Subjt:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN

Query:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS
           SS  N +       RT   S   WH+RLGHP   IV ++++ +     S++++ F   CL   SK++            FP +S+   S ++L  S 
Subjt:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS

Query:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----
        +               V  P+   +  A +     +T SS + + +S SP+ +  P  S   I+ SP   L    V+    +P     S + NP      
Subjt:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----

Query:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK
             S + HPMTTR++  I K             PR LL E     L  EP  F  A++ PHW  AM  ++DA L N TW LVP    + IIGCKW+++
Subjt:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK

Query:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA
        +KR+++GS+ RYKARLVAKGFHQ   +DY+ETFSPVIKPTT+R++L+++++  W + Q+DI NAFLHG L+EEVFM QPPG+  SHP      +C+L+KA
Subjt:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA

Query:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS
        LYGLKQAPRAW+ +L   L   GF  S++DSSLF+    +  +YVLIYVDD+II  S  +AI++L+  L   FA+KDLG LNFFLGIEV R  +G L +S
Subjt:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS

Query:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        Q +YI +LL R  +     V   + +   +S L GE  +DP  YRS VG LQYL +TRPDI+
Subjt:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

A0A2N9H4Q7 Reverse transcriptase Ty1/copia-type domain-containing protein5.7e-13737.11Show/hide
Query:  TEGRASLNRRHFSVFSPGVLGAHYKRLMKNKHLLEVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVL
        T+   S  +  F+  +  + G    RL  +K  +   +T +   + E +K     P +FLPT S+  ++  +P P ++ W QQDQL+ S ++ S++E ++
Subjt:  TEGRASLNRRHFSVFSPGVLGAHYKRLMKNKHLLEVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVL

Query:  TQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHISRPNTPPQN-------------------
         Q++  +T++++W A+   ++S + A++MQ   QL  LKK G S+ ++  K KNL ++LSA G  + + + +S   +   +                   
Subjt:  TQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHISRPNTPPQN-------------------

Query:  -------IPSSVFSNSTTNIATLAP------TSGDFSSYNSYYNN--DANWYPDTGTTNHVTNDLGNMTIGSD-YAGGQQIRMGNGSGVSILHTGSTYLL
               I      + T+NI  L P      T+    ++ +  ++  D NWYPDT  T+H+T +  N+ + +D Y G  Q+ +GNG G+ I H GS  L 
Subjt:  -------IPSSVFSNSTTNIATLAP------TSGDFSSYNSYYNN--DANWYPDTGTTNHVTNDLGNMTIGSD-YAGGQQIRMGNGSGVSILHTGSTYLL

Query:  SSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKN
           S  + SF L D+L+VPQITKNL+SVS+F  DN+V+FEFH    +VKD  +G +LL+G  ++GL+               S+   ++ L +       
Subjt:  SSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKN

Query:  SRTHVDSFDLWHARLGHPTSNIVFSILNTWCSDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASISANSSIT
         RT   S D WH+RL                   HKG++CL I+S R+Y++ +V+FDE  FPFA+                   ++NS            
Subjt:  SRTHVDSFDLWHARLGHPTSNIVFSILNTWCSDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASISANSSIT

Query:  LSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPR-------VLLTE--YLDNEPPN
         SSSSIS++   PS   + + S   + S P   L+L   S    +PT   T     P+ +  +HPM TR+K  +  P+       VLLTE      EP  
Subjt:  LSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPR-------VLLTE--YLDNEPPN

Query:  FKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNW
        F LA K P W +AM  ++ A + N TW LVP      ++GCKW++K+K+  DG + RYKARLVAKGFHQ   IDY +TF+P++KPTTIR IL+++++ NW
Subjt:  FKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNW

Query:  KMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMII
         + QLD+ NAFLHG L E VFM QPPGFI  HP+     VC LKKALY LKQA RAW+ RL + L   GFV S++DSSL++ S  +  IY LIYVDD+I+
Subjt:  KMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMII

Query:  RGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYL
           S  AI  LI +L   FALKDLG  ++FLG+E     + GLF++Q KYI +LL RA +    ++   + S+  +SQ  GE FS+P+ Y +IVG LQYL
Subjt:  RGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYL

Query:  TLTRPDIS
        +LTRPD+S
Subjt:  TLTRPDIS

A0A2N9HTS2 Uncharacterized protein5.9e-13436.49Show/hide
Query:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN
        LPT S  TD  +T  P P+++ W  QDQL+ S L+ S+SE+VL  ++ CTTA+E+W  +A  +TS++ A+ MQ   QL  L+KG +S+ D+  +   L +
Subjt:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN

Query:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS
        +L+A+   + D + +S                 +    P                  QN PS   S +T + A     T     G +S        S N+
Subjt:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS

Query:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG
        +  N                                                           D NWY DTG T+H+T+D GN+ + S +Y G +QIR+G
Subjt:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG

Query:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN
        NG G+SI H G T L +  S    +F+LR++L+VP+ITKNLISV +F KD N   EFHP    VKD+  G+ LLQG   +GLY F           +++N
Subjt:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN

Query:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS
           SS  N +       RT   S   WH+RLGHP   IV ++++ +     S++++ F   CL   SK++            FP +S+   S ++L  S 
Subjt:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW----CSDQHKGF--KCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSS

Query:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----
        +               V  P+   +  A +     +T SS + + +S SP+ +  P  S   I+ SP   L    V+    +P     S + NP      
Subjt:  I---------------VTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRD----

Query:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK
             S + HPMTTR++  I K             PR LL E     L  EP  F  A++ PHW  AM  ++DA L N TW LVP    + IIGCKW+++
Subjt:  -----SSNTHPMTTRAKAGIFK-------------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYK

Query:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA
        +KR+++GS+ RYKARLVAKGFHQ   +DY+ETFSPVIKPTT+R++L+++++  W + Q+DI NAFLHG L+EEVFM QPPG+  SHP      +C+L+KA
Subjt:  VKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKA

Query:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS
        LYGLKQAPRAW+ +L   L   GF  S++DSSLF+    +  +YVLIYVDD+II  S  +AI++L+  L   FA+KDLG LNFFLGIEV R  +G L +S
Subjt:  LYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMS

Query:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        Q +YI +LL R  +     V   + +   +S L GE  +DP  YRS VG LQYL +TRPDI+
Subjt:  QSKYIRELLDR-AIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

A0A2N9HX01 Uncharacterized protein6.5e-13336.61Show/hide
Query:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN
        LPT S  TD  +T  P PL++ W  QDQL+ S L+ S+SE+VL  ++ CTTA+E+W  +A  +TS++ A+ MQ   QL  L+KG +S+ D+  +   L +
Subjt:  LPTDS--TDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLIN

Query:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS
        +L+A+   + D + +S                 +    P                  QN PS   S +T + A     T     G +S        S N+
Subjt:  SLSAVGHKILDQDHIS-----------------RPNTPP------------------QNIPSSVFSNSTTNIA-----TLAPTSGDFS--------SYNS

Query:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG
        +  N                                                           D NWY DTG T+H+T+D GN+ + S +Y G +QIR+G
Subjt:  YYNN-----------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGS-DYAGGQQIRMG

Query:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN
        NG G+SI H G T L +  S    +F+LR++L+VP+ITKNLISV +F KD N   EFHP    VKD+  G+ LLQG   +GLY F           +++N
Subjt:  NGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSN

Query:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW--CSDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTL
           SS  N +       RT   S   WH+RLGHP   IV ++++ +  CSD      C + +S                  +S      V    S+I+T 
Subjt:  VDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW--CSDQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTL

Query:  PLINNSLPASISAN---SSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFK---
        P   N    ++S +      ++ S  +++ S SP     P+     I+                 NP+ + +   A P  S + HPMTTR++  I K   
Subjt:  PLINNSLPASISAN---SSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFK---

Query:  ----------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTA
                  PR LL E     L  EP  F  A++ PHW  AM  ++DA L N TW LVP    + IIGCKW++++KR+++GS+ RYKARLVAKGFHQ  
Subjt:  ----------PRVLLTE----YLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTA

Query:  DIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFV
         +DY+ETFSPVIKPTT+R++L+++++  W + Q+DI NAFLHG L+EEVFM QPPG+  SHP      +C+L+KALYGLKQAPRAW+ +L   L   GF 
Subjt:  DIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFV

Query:  NSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDR-AIFWMPTVFQLLW
         S++DSSLF+    +  +YVLIYVDD+II  S  +AI++L+  L   FA+KDLG LNFFLGIEV R  +G L +SQ +YI +LL R  +     V   + 
Subjt:  NSKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDR-AIFWMPTVFQLLW

Query:  SAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        +   +S L GE  +DP  YRS VG LQYL +TRPDI+
Subjt:  SAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

SwissProt top hitse value%identityAlignment
P04146 Copia protein2.4e-4729.79Show/hide
Query:  GFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDV---DLSKSSIVTLP-----LINNSLPASISANSSIT-LSSSSISNTSTSPSEASIPISSEFEIV
        GFK  D  +++  V+R V+ DE N   + + K   V   D  +S     P     +I    P       +I  L  S  S     P+++   I +EF   
Subjt:  GFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDV---DLSKSSIVTLP-----LINNSLPASISANSSIT-LSSSSISNTSTSPSEASIPISSEFEIV

Query:  SSPQFSLSLAFVSQSHN-----------------------NPTTSTTSVVA---------NPRDSSNTHPMTTRAKAGIFKPR-------------VLLT
        S    ++     S+  N                       NP  S  S  A         NP  +     +  R++    KP+             VL  
Subjt:  SSPQFSLSLAFVSQSHN-----------------------NPTTSTTSVVA---------NPRDSSNTHPMTTRAKAGIFKPR-------------VLLT

Query:  EYLDNEPPN----FKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTT
          + N+ PN     +       W +A+  + +A   N TW +   PE+K I+  +W++ VK N  G+  RYKARLVA+GF Q   IDY ETF+PV + ++
Subjt:  EYLDNEPPN----FKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTT

Query:  IRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFM--KSFM
         R IL+L + YN K+HQ+D+  AFL+G L EE++M  P G      S +   VC+L KA+YGLKQA R W++  +  L    FVNS  D  +++  K  +
Subjt:  IRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFM--KSFM

Query:  NIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFWMPTVFQLLWSAPPVSQLGGEVFSD
        N +IYVL+YVDD++I       +N+    L   F + DL  +  F+GI +    +  +++SQS Y++++L +  F M     +  S P  S++  E+ + 
Subjt:  NIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFWMPTVFQLLWSAPPVSQLGGEVFSD

Query:  PKF----YRSIVGGLQYLTL-TRPDIS
         +      RS++G L Y+ L TRPD++
Subjt:  PKF----YRSIVGGLQYLTL-TRPDIS

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-943.1e-5532.24Show/hide
Query:  DQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQF
        D+  G++  D   K++  SR V+F E     A+       +    + VT+P  +N+ P S  +      ++  +S     P E    +  + E +     
Subjt:  DQHKGFKCLDITSKRMYVSRHVLFDEFNFPFASSTKLSDVDLSKSSIVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQF

Query:  SLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPRVLLTEYLDNEPPNFKLALKCP---HWIKAMKEDYDAFLNNATWDLVPCPEDKKI
         +      +  + P   +      PR  S  +P T           VL+++  D EP + K  L  P     +KAM+E+ ++   N T+ LV  P+ K+ 
Subjt:  SLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAKAGIFKPRVLLTEYLDNEPPNFKLALKCP---HWIKAMKEDYDAFLNNATWDLVPCPEDKKI

Query:  IGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVP
        + CKW++K+K++ D  L RYKARLV KGF Q   ID++E FSPV+K T+IR IL+L+ + + ++ QLD+  AFLHG L EE++M+QP GF +   +    
Subjt:  IGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVP

Query:  LVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFM-NIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIE-VR
        +VC+L K+LYGLKQAPR WY +  +F+ +  ++ + +D  ++ K F  N  I +L+YVDDM+I G     I  L   L ++F +KDLG     LG++ VR
Subjt:  LVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFM-NIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIE-VR

Query:  RTPNGGLFMSQSKYIRELLDRAIFWMPTVFQLLWSAPPVSQLGGEVFSDPKF---------------YRSIVGGLQY-LTLTRPDIS
           +  L++SQ KYI  +L+R        F +  + P  + L G +    K                Y S VG L Y +  TRPDI+
Subjt:  RTPNGGLFMSQSKYIRELLDRAIFWMPTVFQLLWSAPPVSQLGGEVFSDPKF---------------YRSIVGGLQY-LTLTRPDIS

P92520 Uncharacterized mitochondrial protein AtMg008201.2e-2747.52Show/hide
Query:  MTTRAKAGIFK--PR--VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT
        M TR+KAGI K  P+  + +T  +  EP +   ALK P W +AM+E+ DA   N TW LVP P ++ I+GCKW++K K +SDG+L R KARLVAKGFHQ 
Subjt:  MTTRAKAGIFK--PR--VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT

Query:  ADIDYNETFSPVIKPTTIRIILTL--------SLTYNWKMH
          I + ET+SPV++  TIR IL +        S+ + +KMH
Subjt:  ADIDYNETFSPVIKPTTIRIILTL--------SLTYNWKMH

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE11.3e-9026.78Show/hide
Query:  LPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSL
        +P  +    + P   P Y++W +QD+L+ S +LG++S  V   +   TTA +IW+ +   Y + +   V Q +TQL+   KG  ++ DY+  +    + L
Subjt:  LPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSL

Query:  SAVG---------HKILDQ---------DHISRPNTPPQ---------NIPSSVF-----------SNSTTNIATLAPTSGDFSSYNSYYNN--------
        + +G          ++L+          D I+  +TPP          N  S +            +N+ ++  T    + +  + N+ Y+N        
Subjt:  SAVG---------HKILDQ---------DHISRPNTPPQ---------NIPSSVF-----------SNSTTNIATLAPTSGDFSSYNSYYNN--------

Query:  -----------------------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGSDYAG
                                                                                 NW  D+G T+H+T+D  N+++   Y G
Subjt:  -----------------------------------------------------------------------DANWYPDTGTTNHVTNDLGNMTIGSDYAG

Query:  GQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTM--SNPK
        G  + + +GS + I HTGST L +    +N    L ++LYVP I KNLISV +    N V  EF P    VKD   G  LLQG   + LY++ +  S P 
Subjt:  GQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTM--SNPK

Query:  QDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW-------------CSD-----------------------------
           +  SS   HSS                     WHARLGHP  +I+ S+++ +             CSD                             
Subjt:  QDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW-------------CSD-----------------------------

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------QHK------------------GFKCLDI
                                                                                QHK                   + CL +
Subjt:  ------------------------------------------------------------------------QHK------------------GFKCLDI

Query:  TSKRMYVSRHVLFDEFNFPFAS-STKLSDVDLSK-------SSIVTLPLINNSLPA--------------SISA---NSSITLSS--SSISNTSTSPSEA
         + R+Y+SRHV FDE  FPF++    LS V   +       S   TLP     LPA              S SA   NS ++ S+  SS S++  S  E 
Subjt:  TSKRMYVSRHVLFDEFNFPFAS-STKLSDVDLSK-------SSIVTLPLINNSLPA--------------SISA---NSSITLSS--SSISNTSTSPSEA

Query:  SIPISSEFEIVSSP------------------------QFSLSLAFVSQSHN---NPTTSTTSVVANPRDSS---------------------NTHPMTT
        + P  +  +  + P                        Q + SL+  +QS +   +PTTS +S   +P   S                     NTH M T
Subjt:  SIPISSEFEIVSSP------------------------QFSLSLAFVSQSHN---NPTTSTTSVVANPRDSS---------------------NTHPMTT

Query:  RAKAGIFKPR----VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDK-KIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTAD
        RAKAGI KP     + ++   ++EP     ALK   W  AM  + +A + N TWDLVP P     I+GC+WI+  K NSDGSL RYKARLVAKG++Q   
Subjt:  RAKAGIFKPR----VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDK-KIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTAD

Query:  IDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVN
        +DY ETFSPVIK T+IRI+L +++  +W + QLD+NNAFL G LT++V+M QPPGFI          VC+L+KALYGLKQAPRAWY  L+N+L   GFVN
Subjt:  IDYNETFSPVIKPTTIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVN

Query:  SKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWS
        S +D+SLF+       +Y+L+YVDD++I G+    +++ +  L + F++KD   L++FLGIE +R P  GL +SQ +YI +LL R  +     V   +  
Subjt:  SKADSSLFMKSFMNIHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWS

Query:  APPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS
        +P +S   G   +DP  YR IVG LQYL  TRPDIS
Subjt:  APPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS

Q9ZT94 Retrovirus-related Pol polyprotein from transposon RE23.2e-8426.43Show/hide
Query:  LPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQ--FKTQLQNLKKGGMSMKDYLAKVKNLIN
        +P  +    + P   P Y++W +QD+L+ S +LG++S  V   +   TTA +IW+ +   Y + +   V Q  F T+   L   G  M D+  +V+ ++ 
Subjt:  LPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMSEDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQ--FKTQLQNLKKGGMSMKDYLAKVKNLIN

Query:  SLSAVGHKILDQDHISRPNTPPQ------------------------NIPSSVFSNSTTNIATLAPTSGDFSSYNSYYNNDANWYP--------------
        +L      ++DQ  I+  +TPP                          I ++V ++  TN        GD  +YN+  N   +W P              
Subjt:  SLSAVGHKILDQDHISRPNTPPQ------------------------NIPSSVFSNSTTNIATLAPTSGDFSSYNSYYNNDANWYP--------------

Query:  -----------------------------------------------------------DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTG
                                                                   D+G T+H+T+D  N++    Y GG  + + +GS + I HTG
Subjt:  -----------------------------------------------------------DTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTG

Query:  STYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSS--SNVDHSSELNI
        S  L +S    +RS  L  +LYVP I KNLISV +    N V  EF P    VKD   G  LLQG   + LY++ +++ +     +S  S   HSS    
Subjt:  STYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQANGQLLLQGHLSNGLYKFTMSNPKQDKSLSS--SNVDHSSELNI

Query:  TEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW-------------CSD-----------------------------------------------
                         WH+RLGHP+  I+ S+++               CSD                                               
Subjt:  TEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTW-------------CSD-----------------------------------------------

Query:  ----------------------------------------------------------------------QHKG--------------------------
                                                                              +H G                          
Subjt:  ----------------------------------------------------------------------QHKG--------------------------

Query:  ----------------------------------------------------------------------------FKCLDITSKRMYVSRHVLFDEFNF
                                                                                    + CL I + R+Y SRHV FDE  F
Subjt:  ----------------------------------------------------------------------------FKCLDITSKRMYVSRHVLFDEFNF

Query:  PFA--------SSTKLSDVDLSKSSIVTLPLINNSLPA---------------------SISANSSITLSSSSISNTSTS---------PSEASIPISSE
        PF+        S  + SD   +  S  TLP     LPA                       +  SS  L SSSIS+ S+S         P   + P  ++
Subjt:  PFA--------SSTKLSDVDLSKSSIVTLPLINNSLPA---------------------SISANSSITLSSSSISNTSTS---------PSEASIPISSE

Query:  FEIVSSPQFS-----------------LSLAFVSQSH-----------NNPTTSTTS-------------VVANPRDSSNTHPMTTRAKAGIFKPR----
            +SP  +                 L  + +S  H           N+P++S+TS             +  N +   NTH M TRAK GI KP     
Subjt:  FEIVSSPQFS-----------------LSLAFVSQSH-----------NNPTTSTTS-------------VVANPRDSSNTHPMTTRAKAGIFKPR----

Query:  VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDK-KIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPT
           +   ++EP     A+K   W +AM  + +A + N TWDLVP P     I+GC+WI+  K NSDGSL RYKARLVAKG++Q   +DY ETFSPVIK T
Subjt:  VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDK-KIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPT

Query:  TIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMN
        +IRI+L +++  +W + QLD+NNAFL G LT+EV+M QPPGF+          VCRL+KA+YGLKQAPRAWY  L+ +L   GFVNS +D+SLF+     
Subjt:  TIRIILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMN

Query:  IHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSD
          IY+L+YVDD++I G+ T  +   +  L + F++K+   L++FLGIE +R P  GL +SQ +Y  +LL R  +     V   + ++P ++   G    D
Subjt:  IHIYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA-IFWMPTVFQLLWSAPPVSQLGGEVFSD

Query:  PKFYRSIVGGLQYLTLTRPDIS
        P  YR IVG LQYL  TRPD+S
Subjt:  PKFYRSIVGGLQYLTLTRPDIS

Arabidopsis top hitse value%identityAlignment
AT2G24100.1 unknown protein2.6e-2539.69Show/hide
Query:  PTLFLRIGSWQAIPKNEGDLVLKFDYRTKKMAWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPQKHSKWADESDFTEG
        P   LRIG W+   + EGDLV K  +   K+ WE++ +G  K KIEI WS+I+ ++A + +   G L + L + P F++E   +P+KH+ W   SDFT+G
Subjt:  PTLFLRIGSWQAIPKNEGDLVLKFDYRTKKMAWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIESKPQKHSKWADESDFTEG

Query:  RASLNRRHFSVFSPGVLGAHYKRLMKNKHLL
        +AS+NR+HF    PG++  H+++L++  H L
Subjt:  RASLNRRHFSVFSPGVLGAHYKRLMKNKHLL

AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 81.9e-6843.95Show/hide
Query:  EPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSL
        EP  +  A +   W  AM ++  A     TW++   P +KK IGCKW+YK+K NSDG++ RYKARLVAKG+ Q   ID+ ETFSPV K T++++IL +S 
Subjt:  EPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRIILTLSL

Query:  TYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVP-LVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYV
         YN+ +HQLDI+NAFL+G L EE++M  PPG+      S  P  VC LKK++YGLKQA R W+ +    L   GFV S +D + F+K    + + VL+YV
Subjt:  TYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVP-LVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYV

Query:  DDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA--IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIV
        DD+II  ++  A+++L S L   F L+DLG L +FLG+E+ R+   G+ + Q KY  +LLD    +   P+   +  S    +  GG+ F D K YR ++
Subjt:  DDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRA--IFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIV

Query:  GGLQYLTLTRPDIS
        G L YL +TR DIS
Subjt:  GGLQYLTLTRPDIS

AT4G30780.1 unknown protein8.4e-2436.91Show/hide
Query:  GPSSSSSSELPNLVPNAAPTLFLRIGSWQAIPKNEGDLVLKFDYRTKKMAWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIE
        GP+ +  S +  L  +  P   L+IG W+   + EGDLV K  +   K+ WE++ +G  K KIEI WS+I+ ++A   +   G L L L + P F++E  
Subjt:  GPSSSSSSELPNLVPNAAPTLFLRIGSWQAIPKNEGDLVLKFDYRTKKMAWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIE

Query:  SKPQKHSKWADESDFTEGRASLNRRHFSVFSPGVLGAHYKRLMKNKHLL
         +P+KH+ W   SDFT+G+AS+NR+HF   + G++  H+++L++  H L
Subjt:  SKPQKHSKWADESDFTEGRASLNRRHFSVFSPGVLGAHYKRLMKNKHLL

ATMG00810.1 DNA/RNA polymerases superfamily protein3.0e-2145.53Show/hide
Query:  IYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFW----MPTVFQLLWSAPPVSQLGGEVFS
        +Y+L+YVDD+++ GSS   +N LI  L  TF++KDLG +++FLGI+++  P+ GLF+SQ+KY  ++L+ A       M T   L  +    S +    + 
Subjt:  IYVLIYVDDMIIRGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFW----MPTVFQLLWSAPPVSQLGGEVFS

Query:  DPKFYRSIVGGLQYLTLTRPDIS
        DP  +RSIVG LQYLTLTRPDIS
Subjt:  DPKFYRSIVGGLQYLTLTRPDIS

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)8.7e-2947.52Show/hide
Query:  MTTRAKAGIFK--PR--VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT
        M TR+KAGI K  P+  + +T  +  EP +   ALK P W +AM+E+ DA   N TW LVP P ++ I+GCKW++K K +SDG+L R KARLVAKGFHQ 
Subjt:  MTTRAKAGIFK--PR--VLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQT

Query:  ADIDYNETFSPVIKPTTIRIILTL--------SLTYNWKMH
          I + ET+SPV++  TIR IL +        S+ + +KMH
Subjt:  ADIDYNETFSPVIKPTTIRIILTL--------SLTYNWKMH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGAAAAATGAAGAAGAGAGTGAGAAGAAATTGTATATTACTTCAGAGAATGGTGAATCTTCCAGTTTCTCCATAATCTCCGGAGCCGATCAGCTTGATCATTT
TTCCAAAACTGGAATCCTCCCTCAGGGTGAAGAAACAATTGAAGTAGTTCTAAAACTTCCAGCACTTCGCCTTTCTTCTCTACTAGCAGCAATGGAATTAGAAGAGAATA
CAAACATTTCAATTCCAGCTCCATTCTCAAACCTCCCAAGTATCTTAACTGGTCGTTCACTGATCCCTCTCCCTCCCACAATTGAAGAACAACAACAGCAATACAATACT
CCATCAAATGAAAGAACAATTACCTCTGGACCAAGCAGCTCCTCTTCTTCCGAGCTCCCCAATCTCGTCCCTAATGCCGCTCCAACCCTTTTCCTTCGCATCGGCTCTTG
GCAGGCAATACCCAAAAATGAAGGTGATTTGGTGTTGAAATTTGATTATAGGACCAAGAAAATGGCTTGGGAGATTGTGAGGGAGGGGCCTTCCAAGCATAAGATTGAAA
TTGATTGGTCTAATATCATAGGGATTGAAGCTGCTATTGAAGATCATAGACAAGGAATCCTTCAACTAGAGCTGCAAAAACCACCAAGATTTTACAAGGAGATTGAATCC
AAACCACAGAAGCATTCCAAGTGGGCAGATGAATCAGATTTTACAGAAGGAAGAGCCTCTTTAAACAGGAGACACTTTTCTGTGTTTTCACCAGGAGTGCTTGGTGCACA
TTACAAGAGACTAATGAAAAACAAGCATTTGTTGGAAGTTCTTCTGACAATCAAGGGTCATGGATTGGAGGAACACATTAAAGAAGAACCCAATATTCCAAATGAATTTT
TGCCTACTGACTCTACCGATAAAACCTCCACTCCAATTCCAAAGCCCCTATATTCGAAATGGGTGCAGCAAGATCAACTTCTCTCTTCTTGGTTGCTCGGGTCAATGTCC
GAAGACGTTCTCACACAAATGTTAAATTGCACAACGGCTAAAGAGATTTGGGATGCCATAGCTATCACTTATACGTCAAGAAACACTGCCAAAGTAATGCAATTCAAAAC
GCAACTCCAAAACTTAAAAAAGGGAGGTATGTCTATGAAAGATTATTTGGCCAAGGTCAAGAATCTAATCAATTCTCTTTCTGCTGTGGGTCATAAAATCTTAGATCAAG
ATCATATTTCTCGCCCAAATACTCCCCCACAAAACATTCCATCATCTGTTTTTTCAAATAGTACGACCAACATTGCTACTTTAGCTCCTACTTCTGGTGATTTTTCTTCT
TACAATTCTTATTACAACAATGATGCAAATTGGTATCCGGATACTGGCACCACCAATCATGTGACGAATGATTTGGGTAACATGACTATTGGATCAGATTATGCAGGTGG
TCAACAAATCAGGATGGGTAATGGTTCAGGTGTGTCCATTCTCCATACTGGATCTACTTATCTTTTATCATCTGATTCTCATATTAATAGATCTTTTGTCTTACGTGATT
TACTATATGTTCCTCAAATTACTAAAAATTTAATTAGCGTTAGTCAATTCGCTAAAGATAATAACGTGTTTTTTGAATTTCATCCTCATGTTTGCTTTGTGAAGGATCAA
GCCAATGGTCAACTTCTTCTTCAAGGCCATCTCAGTAATGGACTATATAAGTTCACCATGAGTAATCCTAAGCAAGACAAATCCCTTTCTTCTAGCAATGTTGATCATTC
ATCTGAACTCAATATTACTGAACAGATTTCGAAGAATTCTAGAACTCATGTTGATTCTTTTGATTTATGGCATGCTAGATTAGGCCATCCTACTTCTAATATTGTGTTTT
CCATTCTTAATACTTGGTGCAGTGACCAACATAAGGGTTTTAAGTGTCTTGATATTACTTCTAAACGTATGTATGTATCTAGACACGTTTTATTTGATGAATTTAATTTT
CCTTTTGCAAGTTCTACTAAACTAAGTGATGTGGATTTATCTAAGTCTTCTATTGTGACTCTTCCTTTAATAAACAATTCCCTACCTGCTTCTATTAGTGCCAATTCTTC
TATTACCTTGTCTAGTTCTTCTATTTCCAACACATCTACTAGTCCTTCTGAAGCCTCTATTCCTATTTCCTCTGAATTTGAAATAGTTTCTTCACCTCAGTTTTCTTTAT
CTTTAGCTTTTGTCTCTCAATCTCATAACAACCCAACCACATCTACTACTTCTGTTGTTGCAAATCCTAGGGATTCTTCCAACACTCATCCAATGACAACAAGAGCAAAA
GCTGGAATTTTTAAACCTCGTGTTCTTCTTACAGAATATTTAGACAATGAGCCTCCAAATTTCAAATTAGCTTTAAAATGCCCTCATTGGATTAAAGCAATGAAGGAAGA
TTATGATGCTTTTTTAAATAATGCAACATGGGATCTTGTTCCATGTCCTGAAGATAAGAAGATTATAGGTTGTAAATGGATTTATAAGGTTAAAAGGAACTCTGATGGCT
CTTTAGCTAGGTACAAAGCTCGTTTAGTTGCCAAGGGATTTCACCAAACTGCTGACATTGATTATAATGAGACATTTAGTCCTGTTATAAAACCTACCACTATACGTATT
ATTCTCACTCTTTCTTTGACTTATAATTGGAAGATGCATCAACTTGACATTAATAATGCATTTCTTCATGGTTTGTTAACAGAGGAAGTATTCATGGATCAACCTCCTGG
GTTTATCATTTCTCATCCTTCTTCTTCTGTTCCCTTGGTTTGTAGATTAAAAAAGGCTTTGTATGGTCTTAAACAGGCTCCTCGTGCTTGGTATGATAGACTTAAGAACT
TTCTGTATAATTCTGGTTTTGTAAATTCCAAGGCTGATTCTTCTTTATTCATGAAATCTTTCATGAACATTCATATTTATGTTTTGATTTATGTAGATGATATGATTATA
CGTGGTTCTTCTACTAATGCCATTAATGATTTAATCTCTACTTTGCATAGGACTTTTGCTTTAAAGGACTTAGGTTCCTTAAACTTTTTTCTTGGTATAGAAGTTCGTCG
TACTCCTAATGGTGGATTATTTATGTCTCAATCTAAATATATTCGTGAATTACTTGATAGAGCTATTTTTTGGATGCCAACAGTATTTCAACTCCTATGGTCAGCTCCCC
CAGTCTCTCAGTTAGGGGGTGAAGTATTTTCTGATCCTAAATTTTATAGGAGTATAGTTGGTGGTCTTCAATATCTAACTCTCACTCGTCCTGATATTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGAAAAATGAAGAAGAGAGTGAGAAGAAATTGTATATTACTTCAGAGAATGGTGAATCTTCCAGTTTCTCCATAATCTCCGGAGCCGATCAGCTTGATCATTT
TTCCAAAACTGGAATCCTCCCTCAGGGTGAAGAAACAATTGAAGTAGTTCTAAAACTTCCAGCACTTCGCCTTTCTTCTCTACTAGCAGCAATGGAATTAGAAGAGAATA
CAAACATTTCAATTCCAGCTCCATTCTCAAACCTCCCAAGTATCTTAACTGGTCGTTCACTGATCCCTCTCCCTCCCACAATTGAAGAACAACAACAGCAATACAATACT
CCATCAAATGAAAGAACAATTACCTCTGGACCAAGCAGCTCCTCTTCTTCCGAGCTCCCCAATCTCGTCCCTAATGCCGCTCCAACCCTTTTCCTTCGCATCGGCTCTTG
GCAGGCAATACCCAAAAATGAAGGTGATTTGGTGTTGAAATTTGATTATAGGACCAAGAAAATGGCTTGGGAGATTGTGAGGGAGGGGCCTTCCAAGCATAAGATTGAAA
TTGATTGGTCTAATATCATAGGGATTGAAGCTGCTATTGAAGATCATAGACAAGGAATCCTTCAACTAGAGCTGCAAAAACCACCAAGATTTTACAAGGAGATTGAATCC
AAACCACAGAAGCATTCCAAGTGGGCAGATGAATCAGATTTTACAGAAGGAAGAGCCTCTTTAAACAGGAGACACTTTTCTGTGTTTTCACCAGGAGTGCTTGGTGCACA
TTACAAGAGACTAATGAAAAACAAGCATTTGTTGGAAGTTCTTCTGACAATCAAGGGTCATGGATTGGAGGAACACATTAAAGAAGAACCCAATATTCCAAATGAATTTT
TGCCTACTGACTCTACCGATAAAACCTCCACTCCAATTCCAAAGCCCCTATATTCGAAATGGGTGCAGCAAGATCAACTTCTCTCTTCTTGGTTGCTCGGGTCAATGTCC
GAAGACGTTCTCACACAAATGTTAAATTGCACAACGGCTAAAGAGATTTGGGATGCCATAGCTATCACTTATACGTCAAGAAACACTGCCAAAGTAATGCAATTCAAAAC
GCAACTCCAAAACTTAAAAAAGGGAGGTATGTCTATGAAAGATTATTTGGCCAAGGTCAAGAATCTAATCAATTCTCTTTCTGCTGTGGGTCATAAAATCTTAGATCAAG
ATCATATTTCTCGCCCAAATACTCCCCCACAAAACATTCCATCATCTGTTTTTTCAAATAGTACGACCAACATTGCTACTTTAGCTCCTACTTCTGGTGATTTTTCTTCT
TACAATTCTTATTACAACAATGATGCAAATTGGTATCCGGATACTGGCACCACCAATCATGTGACGAATGATTTGGGTAACATGACTATTGGATCAGATTATGCAGGTGG
TCAACAAATCAGGATGGGTAATGGTTCAGGTGTGTCCATTCTCCATACTGGATCTACTTATCTTTTATCATCTGATTCTCATATTAATAGATCTTTTGTCTTACGTGATT
TACTATATGTTCCTCAAATTACTAAAAATTTAATTAGCGTTAGTCAATTCGCTAAAGATAATAACGTGTTTTTTGAATTTCATCCTCATGTTTGCTTTGTGAAGGATCAA
GCCAATGGTCAACTTCTTCTTCAAGGCCATCTCAGTAATGGACTATATAAGTTCACCATGAGTAATCCTAAGCAAGACAAATCCCTTTCTTCTAGCAATGTTGATCATTC
ATCTGAACTCAATATTACTGAACAGATTTCGAAGAATTCTAGAACTCATGTTGATTCTTTTGATTTATGGCATGCTAGATTAGGCCATCCTACTTCTAATATTGTGTTTT
CCATTCTTAATACTTGGTGCAGTGACCAACATAAGGGTTTTAAGTGTCTTGATATTACTTCTAAACGTATGTATGTATCTAGACACGTTTTATTTGATGAATTTAATTTT
CCTTTTGCAAGTTCTACTAAACTAAGTGATGTGGATTTATCTAAGTCTTCTATTGTGACTCTTCCTTTAATAAACAATTCCCTACCTGCTTCTATTAGTGCCAATTCTTC
TATTACCTTGTCTAGTTCTTCTATTTCCAACACATCTACTAGTCCTTCTGAAGCCTCTATTCCTATTTCCTCTGAATTTGAAATAGTTTCTTCACCTCAGTTTTCTTTAT
CTTTAGCTTTTGTCTCTCAATCTCATAACAACCCAACCACATCTACTACTTCTGTTGTTGCAAATCCTAGGGATTCTTCCAACACTCATCCAATGACAACAAGAGCAAAA
GCTGGAATTTTTAAACCTCGTGTTCTTCTTACAGAATATTTAGACAATGAGCCTCCAAATTTCAAATTAGCTTTAAAATGCCCTCATTGGATTAAAGCAATGAAGGAAGA
TTATGATGCTTTTTTAAATAATGCAACATGGGATCTTGTTCCATGTCCTGAAGATAAGAAGATTATAGGTTGTAAATGGATTTATAAGGTTAAAAGGAACTCTGATGGCT
CTTTAGCTAGGTACAAAGCTCGTTTAGTTGCCAAGGGATTTCACCAAACTGCTGACATTGATTATAATGAGACATTTAGTCCTGTTATAAAACCTACCACTATACGTATT
ATTCTCACTCTTTCTTTGACTTATAATTGGAAGATGCATCAACTTGACATTAATAATGCATTTCTTCATGGTTTGTTAACAGAGGAAGTATTCATGGATCAACCTCCTGG
GTTTATCATTTCTCATCCTTCTTCTTCTGTTCCCTTGGTTTGTAGATTAAAAAAGGCTTTGTATGGTCTTAAACAGGCTCCTCGTGCTTGGTATGATAGACTTAAGAACT
TTCTGTATAATTCTGGTTTTGTAAATTCCAAGGCTGATTCTTCTTTATTCATGAAATCTTTCATGAACATTCATATTTATGTTTTGATTTATGTAGATGATATGATTATA
CGTGGTTCTTCTACTAATGCCATTAATGATTTAATCTCTACTTTGCATAGGACTTTTGCTTTAAAGGACTTAGGTTCCTTAAACTTTTTTCTTGGTATAGAAGTTCGTCG
TACTCCTAATGGTGGATTATTTATGTCTCAATCTAAATATATTCGTGAATTACTTGATAGAGCTATTTTTTGGATGCCAACAGTATTTCAACTCCTATGGTCAGCTCCCC
CAGTCTCTCAGTTAGGGGGTGAAGTATTTTCTGATCCTAAATTTTATAGGAGTATAGTTGGTGGTCTTCAATATCTAACTCTCACTCGTCCTGATATTTCTTAA
Protein sequenceShow/hide protein sequence
MDKKNEEESEKKLYITSENGESSSFSIISGADQLDHFSKTGILPQGEETIEVVLKLPALRLSSLLAAMELEENTNISIPAPFSNLPSILTGRSLIPLPPTIEEQQQQYNT
PSNERTITSGPSSSSSSELPNLVPNAAPTLFLRIGSWQAIPKNEGDLVLKFDYRTKKMAWEIVREGPSKHKIEIDWSNIIGIEAAIEDHRQGILQLELQKPPRFYKEIES
KPQKHSKWADESDFTEGRASLNRRHFSVFSPGVLGAHYKRLMKNKHLLEVLLTIKGHGLEEHIKEEPNIPNEFLPTDSTDKTSTPIPKPLYSKWVQQDQLLSSWLLGSMS
EDVLTQMLNCTTAKEIWDAIAITYTSRNTAKVMQFKTQLQNLKKGGMSMKDYLAKVKNLINSLSAVGHKILDQDHISRPNTPPQNIPSSVFSNSTTNIATLAPTSGDFSS
YNSYYNNDANWYPDTGTTNHVTNDLGNMTIGSDYAGGQQIRMGNGSGVSILHTGSTYLLSSDSHINRSFVLRDLLYVPQITKNLISVSQFAKDNNVFFEFHPHVCFVKDQ
ANGQLLLQGHLSNGLYKFTMSNPKQDKSLSSSNVDHSSELNITEQISKNSRTHVDSFDLWHARLGHPTSNIVFSILNTWCSDQHKGFKCLDITSKRMYVSRHVLFDEFNF
PFASSTKLSDVDLSKSSIVTLPLINNSLPASISANSSITLSSSSISNTSTSPSEASIPISSEFEIVSSPQFSLSLAFVSQSHNNPTTSTTSVVANPRDSSNTHPMTTRAK
AGIFKPRVLLTEYLDNEPPNFKLALKCPHWIKAMKEDYDAFLNNATWDLVPCPEDKKIIGCKWIYKVKRNSDGSLARYKARLVAKGFHQTADIDYNETFSPVIKPTTIRI
ILTLSLTYNWKMHQLDINNAFLHGLLTEEVFMDQPPGFIISHPSSSVPLVCRLKKALYGLKQAPRAWYDRLKNFLYNSGFVNSKADSSLFMKSFMNIHIYVLIYVDDMII
RGSSTNAINDLISTLHRTFALKDLGSLNFFLGIEVRRTPNGGLFMSQSKYIRELLDRAIFWMPTVFQLLWSAPPVSQLGGEVFSDPKFYRSIVGGLQYLTLTRPDIS