; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0014059 (gene) of Snake gourd v1 genome

Gene IDTan0014059
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionTAF RNA polymerase I subunit A
Genome locationLG01:8205587..8216182
RNA-Seq ExpressionTan0014059
SyntenyTan0014059
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600392.1 hypothetical protein SDJN03_05625, partial [Cucurbita argyrosperma subsp. sororia]5.1e-18680.68Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        M D LV+EAEYGS R T RKRK DTAADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+EN PQ FE  DFYA S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNG YQH VSIFSALEGLDPLLLPLHLPSS+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
        RDSNA LPF
Subjt:  RDSNAALPF

KAG7031054.1 hypothetical protein SDJN02_05093 [Cucurbita argyrosperma subsp. argyrosperma]5.1e-18680.68Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        M D LV+EAEYGS R T RKRK DTAADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+EN PQ FE  DFYA S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNG YQH VSIFSALEGLDPLLLPLHLPSS+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
        RDSNA LPF
Subjt:  RDSNAALPF

XP_022142927.1 uncharacterized protein LOC111012919 [Momordica charantia]7.9e-18780.93Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        MAD  V+EAEYG N+P +RKRKAD  ADG +DG+RA  M+R+TLSLTKPSFV+GL PKMVR ENR+TLRNVL KL+ Q NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI+NRLKY ASMELLKH+E DR RP+RIKH YDNWM KIGSMK WPI+D+FMV VEFILFCLEEGNTEDA+Q ALCLMQ+H+SVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFST+PEEIQW+ SLQFHSPIQ DR+I N  GCSVSNSHGDGA YQSNS TSVMNDKLVHVDS+G      EVD ++K+ENHPQNFEA DFY  S EK+E
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEAS SDNGGYQHYVSIFSALEGLDPLLLPLHLP SI+NWENAISLC EFLN YYKDAVKHLDLALNSNPP+L ALLPLIQLLLIGGRVDKAL E+E IC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
         DSNAALPF
Subjt:  RDSNAALPF

XP_022941583.1 uncharacterized protein LOC111446895 [Cucurbita moschata]1.5e-18580.44Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        + D LV+EAEYGS R T RKRK DTAADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+EN PQ FE  DFYA S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNG YQH VSIFSALEGLDPLLLPLHLPSS+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
        RDSNA LPF
Subjt:  RDSNAALPF

XP_022979683.1 uncharacterized protein LOC111479331 [Cucurbita maxima]1.3e-18479.95Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        M D LV+EAE+GS R T RKRK DT ADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+ENHPQ FE  DFY  S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNGGYQH VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
         DSNA LPF
Subjt:  RDSNAALPF

TrEMBL top hitse value%identityAlignment
A0A1S3BS63 uncharacterized protein LOC1034929167.7e-17276.46Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        MAD  V+E EYGS  P SRKRKAD  ADGNND +RA  M+RI LSLTKPSFVLGLAPKMVRAENRITLRN LHKLM Q NWVEA GVLSMLL+GTLRD S
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ASMELLKH+E DR RPDRI+H YD WM K GS+K WPI+D+FMV++E+ILFCLEEG  EDA+Q+ L LMQ  ES NDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVD---HNIKLENHPQNFEAQDFYAISEE
        WFST+PEEIQW+ SLQ  SPI SD +I N +GCS+SNSHG GA   SN+ +SVMNDK+VHVD +G      +VD   HNIK+ENHP NFEAQDF   S E
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVD---HNIKLENHPQNFEAQDFYAISEE

Query:  KDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEME
        KDENEASFSDNGGYQHYVSIFSALEGLDPLLLPL LP SIENWENAISLC EFLN+YYKDAVKHL LALNSNPP+L ALLPLIQLLLIGGR+DKAL+EME
Subjt:  KDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEME

Query:  NICRDSNAALPF
          C DSNAALPF
Subjt:  NICRDSNAALPF

A0A5D3D7T2 Uncharacterized protein1.2e-13552.98Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        MAD  V+E EYGS  P SRKRKAD  ADGNND +RA  M+RI LSLTKPSFVLGLAPKMVRAENRITLRN LHKLM Q NWVEA GVLSMLL+GTLRD S
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKY-----------------------------------------------AASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMV
        PI NRLKY                                               +ASMELLKH+E DR RPDRI+H YD WM K GS+K WPI+D+FMV
Subjt:  PIENRLKY-----------------------------------------------AASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMV

Query:  RVEFILFCLEEGNTEDAYQDAL------------------------------------------------------------------------------
        ++E+ILFCLEEG  EDA+Q+ L                                                                              
Subjt:  RVEFILFCLEEGNTEDAYQDAL------------------------------------------------------------------------------

Query:  ---------------------------C----------------LMQKHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNG
                                   C                LMQ  ES NDPMSNMIIGLTFRQLWFST+PEEIQW+ SLQ  SPI SD +I N +G
Subjt:  ---------------------------C----------------LMQKHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNG

Query:  CSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVD---HNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHYVSIFSALEGLDPLLL
        CS+SNSHG GA   SN+ +SVMNDK+VHVD +G      +VD   HNIK+ENHP NFEAQDF   S EKDENEASFSDNGGYQHYVSIFSALEGLDPLLL
Subjt:  CSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVD---HNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHYVSIFSALEGLDPLLL

Query:  PLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQL
        PL LP SIENWENAISLC EFLN+YYKDAVKHL LALNSNPP+L ALLPLIQ+
Subjt:  PLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQL

A0A6J1CPA4 uncharacterized protein LOC1110129193.8e-18780.93Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        MAD  V+EAEYG N+P +RKRKAD  ADG +DG+RA  M+R+TLSLTKPSFV+GL PKMVR ENR+TLRNVL KL+ Q NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRA--MRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI+NRLKY ASMELLKH+E DR RP+RIKH YDNWM KIGSMK WPI+D+FMV VEFILFCLEEGNTEDA+Q ALCLMQ+H+SVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFST+PEEIQW+ SLQFHSPIQ DR+I N  GCSVSNSHGDGA YQSNS TSVMNDKLVHVDS+G      EVD ++K+ENHPQNFEA DFY  S EK+E
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEAS SDNGGYQHYVSIFSALEGLDPLLLPLHLP SI+NWENAISLC EFLN YYKDAVKHLDLALNSNPP+L ALLPLIQLLLIGGRVDKAL E+E IC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
         DSNAALPF
Subjt:  RDSNAALPF

A0A6J1FSH8 uncharacterized protein LOC1114468957.2e-18680.44Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        + D LV+EAEYGS R T RKRK DTAADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+EN PQ FE  DFYA S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNG YQH VSIFSALEGLDPLLLPLHLPSS+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
        RDSNA LPF
Subjt:  RDSNAALPF

A0A6J1IWZ8 uncharacterized protein LOC1114793316.1e-18579.95Show/hide
Query:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS
        M D LV+EAE+GS R T RKRK DT ADG+NDG+R  AM++ITL+LTKPSFVLG+ PKM+RAENR TLRNVL KLMMQ NWVEA GVLSMLLKGTLRDRS
Subjt:  MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQR--AMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRS

Query:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL
        PI NRLKY+ SMELLKH+E DR RP+RIKH YDNWM KIGSMKRWP++D+FMV VEFILFCLEEG+TEDA+Q ALCLMQ+HESVNDPMSNMIIGLTFRQL
Subjt:  PIENRLKYAASMELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQL

Query:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE
        WFSTLPEEIQW+ SLQ+HSPI+SDR+I N +GCSVSNS GDGASYQS+S TSVM+ KL+HVDS+G     FE DH IK+ENHPQ FE  DFY  S EKDE
Subjt:  WFSTLPEEIQWKGSLQFHSPIQSDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKG-----FEVDHNIKLENHPQNFEAQDFYAISEEKDE

Query:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC
        NEASFSDNGGYQH VSIFSALEGLDPLLLPLHLP S+ENWENA+SLC EFLN+YYKDAVKHL+LALNSNPP+L ALLP IQLLLIGGRVDKAL+EMENIC
Subjt:  NEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENIC

Query:  RDSNAALPF
         DSNA LPF
Subjt:  RDSNAALPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G53200.1 unknown protein1.6e-4431.82Show/hide
Query:  KRKADTAADGNNDGQRAMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRSPIENRLKYAASMELLKHVECD
        KR+  ++++ ++D Q+  +RI     KPS++L + PK  R+E    L  +L +L+   +W +A  VLS+L+KGT+ D  P  NRLKY A ++++ H E +
Subjt:  KRKADTAADGNNDGQRAMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRSPIENRLKYAASMELLKHVECD

Query:  RKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQLWFSTL-----PEEIQWKGSLQ
        + + D I   YD W+ +IG   +   +++ +V  E I   +E     +AY   + LMQ  +    P +N+ IG++F ++W +       PE+    GS+ 
Subjt:  RKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQLWFSTL-----PEEIQWKGSLQ

Query:  FHSPIQSDRIIP----NVNGCSVSNSHGDGASYQSNSVTSVM-NDKLVHVDSKGFE--VDHNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHY
          S   S  ++     + + CS+++      S + +S TSVM N K+ H+     E  +D  +K+ + P        YAISE   ENEAS  D G  +  
Subjt:  FHSPIQSDRIIP----NVNGCSVSNSHGDGASYQSNSVTSVM-NDKLVHVDSKGFE--VDHNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHY

Query:  VSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPM-LAALLPLIQLLLIGGRVDKALNEMENICRDSNAALPF
         ++ + L  +DP LLP   P   + +   ++      ++YYK+AVK++   L S P + LAAL PL+Q+LLIGG VD+A+  +E +C   +   PF
Subjt:  VSIFSALEGLDPLLLPLHLPSSIENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPM-LAALLPLIQLLLIGGRVDKALNEMENICRDSNAALPF

AT1G53200.2 unknown protein1.7e-2230.86Show/hide
Query:  DKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQLWFSTL-----PEEIQWKGSLQFHSPIQSDRIIP----NVNGCSVSNSH
        ++ +V  E I   +E     +AY   + LMQ  +    P +N+ IG++F ++W +       PE+    GS+   S   S  ++     + + CS+++  
Subjt:  DKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQLWFSTL-----PEEIQWKGSLQFHSPIQSDRIIP----NVNGCSVSNSH

Query:  GDGASYQSNSVTSVM-NDKLVHVDSKGFE--VDHNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWE
            S + +S TSVM N K+ H+     E  +D  +K+ + P        YAISE   ENEAS  D G  +   ++ + L  +DP LLP   P   + + 
Subjt:  GDGASYQSNSVTSVM-NDKLVHVDSKGFE--VDHNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSIENWE

Query:  NAISLCDEFLNNYYKDAVKHLDLALNSNPPM-LAALLPLIQLLLIGGRVDKALNEMENICRDSNAALPF
          ++      ++YYK+AVK++   L S P + LAAL PL+Q+LLIGG VD+A+  +E +C   +   PF
Subjt:  NAISLCDEFLNNYYKDAVKHLDLALNSNPPM-LAALLPLIQLLLIGGRVDKALNEMENICRDSNAALPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGATGGGCTTGTATTGGAAGCTGAATATGGTTCTAACAGACCCACTAGTCGGAAAAGAAAGGCTGATACAGCAGCTGACGGTAATAATGATGGCCAACGAGCAAT
GAGAAGAATTACATTGTCTTTGACAAAACCATCGTTTGTTCTGGGGCTTGCACCGAAGATGGTGAGGGCAGAAAATCGAATTACATTGCGCAATGTCTTGCACAAACTTA
TGATGCAGCACAACTGGGTGGAAGCATGTGGCGTATTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATCGAGAATCGTTTGAAGTATGCGGCTTCAATG
GAGCTTCTTAAGCATGTAGAATGTGATCGTAAGAGACCAGATAGGATCAAACACACTTATGACAACTGGATGTGGAAGATTGGATCAATGAAGCGTTGGCCAATTAAGGA
CAAATTTATGGTCCGTGTGGAATTCATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCATATCAAGATGCTTTATGCCTCATGCAAAAGCATGAATCTGTGAATG
ATCCAATGTCAAATATGATTATAGGACTGACATTTCGACAACTCTGGTTCTCTACCCTTCCAGAAGAGATTCAGTGGAAGGGCTCTCTACAATTTCACTCCCCAATTCAA
TCAGATAGGATAATTCCAAATGTAAATGGGTGTTCTGTCAGCAACTCTCATGGAGATGGTGCCTCATATCAGAGTAATTCAGTGACTTCCGTCATGAATGATAAATTAGT
TCATGTTGATAGCAAGGGTTTTGAGGTTGATCATAATATAAAATTGGAAAATCATCCCCAAAATTTTGAGGCACAAGATTTTTATGCGATTTCAGAAGAAAAAGATGAAA
ATGAAGCCTCTTTCTCAGATAATGGAGGTTATCAACACTATGTTTCAATTTTTTCTGCTCTCGAGGGTTTGGATCCACTATTGTTGCCTCTACATTTGCCATCTTCCATT
GAGAATTGGGAGAATGCCATTAGTTTATGCGACGAGTTTCTGAATAACTATTATAAGGACGCAGTGAAGCACCTAGACCTTGCTCTTAACTCAAATCCACCAATGTTGGC
TGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAAGCACTCAATGAAATGGAAAATATCTGTCGTGATTCAAATGCAGCACTTCCCTTCAGTT
ATGCAGATTGA
mRNA sequenceShow/hide mRNA sequence
TTAAATTTGGGTTAGGGTTTCAATTAATCCTCTCTCCATGCACGATTTTCGAGGAGTTCAACCTCTCTCTCCATCTTCTTCTTCACGAGGAGCCAACCCACGCACAAAAG
CTCCTTGCCGCCGACCACCTCAAACCACGCCTCCCTCCATCGCAACCGTGAGCCGCCGGACGTCCACCTGCCCTTGCCGCCGCTAGCGTTGCCGTGAAGTCGAGGTCGCC
GCAAGCTGCTCGCGAGCGAAGCAGCCACTGGACGTCTTCTCTCAACGCGTCGGCAACCTCCCTCAGCCACATCACACCGTGGGAGCTACACGACAGTCAGCCGCCGTTCG
AGTTGTTTGTCGCCGTCGGATCCTCGCCGTGTCGTCTGCTGTCACACGCGCAGCTTCGCCGGAGGACGTTGGATCTGCCAGATCTGATACCAAACGACCGGTTTCATTCG
TGAAGTAGCCCGTTTCGAGCAGTCGGCGGTCGGGCATCTCGAAGCCGAGCAGAACGCGCCGCCGCATTGCCGGAGTAGTCCGTTTCAGCCAATTTCAATCTCCCAGATCA
GATTCGAACCTTCTTTTCGGGACTCGAGTATTGATCACGATCGAGTCAGAGATTCAATTTAAGGATCTCGGAGGAACTAAAGCCGAGGATTTGAGTCTCAGTAAGTAGGG
ACGTTGCACTCTGTCTCCCTCAATCTCTCCTATTCAACAAAACATGGCCGATGAGCCCCAACCTAAAACTGGCTCACCGGTTGCGTCTCACTAACTGAGGAAGATTTCAA
GAAGAACGGAATCAGAATCCATGGCAGATGGGCTTGTATTGGAAGCTGAATATGGTTCTAACAGACCCACTAGTCGGAAAAGAAAGGCTGATACAGCAGCTGACGGTAAT
AATGATGGCCAACGAGCAATGAGAAGAATTACATTGTCTTTGACAAAACCATCGTTTGTTCTGGGGCTTGCACCGAAGATGGTGAGGGCAGAAAATCGAATTACATTGCG
CAATGTCTTGCACAAACTTATGATGCAGCACAACTGGGTGGAAGCATGTGGCGTATTGAGCATGTTACTGAAAGGGACTTTGCGGGATAGATCTCCTATCGAGAATCGTT
TGAAGTATGCGGCTTCAATGGAGCTTCTTAAGCATGTAGAATGTGATCGTAAGAGACCAGATAGGATCAAACACACTTATGACAACTGGATGTGGAAGATTGGATCAATG
AAGCGTTGGCCAATTAAGGACAAATTTATGGTCCGTGTGGAATTCATTCTTTTCTGCCTTGAAGAAGGAAACACGGAGGATGCATATCAAGATGCTTTATGCCTCATGCA
AAAGCATGAATCTGTGAATGATCCAATGTCAAATATGATTATAGGACTGACATTTCGACAACTCTGGTTCTCTACCCTTCCAGAAGAGATTCAGTGGAAGGGCTCTCTAC
AATTTCACTCCCCAATTCAATCAGATAGGATAATTCCAAATGTAAATGGGTGTTCTGTCAGCAACTCTCATGGAGATGGTGCCTCATATCAGAGTAATTCAGTGACTTCC
GTCATGAATGATAAATTAGTTCATGTTGATAGCAAGGGTTTTGAGGTTGATCATAATATAAAATTGGAAAATCATCCCCAAAATTTTGAGGCACAAGATTTTTATGCGAT
TTCAGAAGAAAAAGATGAAAATGAAGCCTCTTTCTCAGATAATGGAGGTTATCAACACTATGTTTCAATTTTTTCTGCTCTCGAGGGTTTGGATCCACTATTGTTGCCTC
TACATTTGCCATCTTCCATTGAGAATTGGGAGAATGCCATTAGTTTATGCGACGAGTTTCTGAATAACTATTATAAGGACGCAGTGAAGCACCTAGACCTTGCTCTTAAC
TCAAATCCACCAATGTTGGCTGCCTTACTTCCTCTTATACAGTTGTTGTTGATTGGAGGTCGAGTTGACAAAGCACTCAATGAAATGGAAAATATCTGTCGTGATTCAAA
TGCAGCACTTCCCTTCAGTTATGCAGATTGAGGGCTGCACTTGTGGAACATTTTGATCGTGGTAACGATGTCTTGCTTTCAACTTGTTATGAGCAAATATTGAAGAAGGA
TCCAACATGTTGTCATTCACTGGGAAAACTTGTTCACATGCATAGAAATGGCAATTACAGTGTTGAATCTCTATTGGAAATGATAGCTTTGCATTTAGATGGTACATGTG
CGGAATATGATACATGGAGAGAGTTGGCTGTGTGTTTTCTGAAACTTTTTCGATTTGAAGAGGATAGAGTATCAACAGCATGTTCAATTGGGACAGGTGGACATAAGCTG
AGGTCCTCATTGAATATTAACAGTAACCTTAAGTTGTTAACTGAAAGGAACCTGAGAAACACGTGGAGATTGCGTTGTCGATGGTGGCTGACACACCATTTCGGCCATAA
AATTACATCAGAAACTTTGGTTGGTAATTTGGAGCTCTTGACTTACAAAGCAGCGTGTGCATGCCATATGTATGGAAGCAAATACAAATATGTGGTAGAGGTTTACAACC
TTTTAGATAAGCAAATCAGTAAGGACTTGTTATTGTTTTTAAAGAGGCACATGAAGAATTCATTTGGACTCTATTCTAAATTATAATCAAGAGTTTTCCTTCTTCAAATC
TTTTATCATTTACACTTTTCTTGGTGCATACAATTATTCTTTTGCCTACTAAGTGCCCCCATTGGAGAAACAATTTCACCTCACAAATAATATACACACATATTTTGGAG
GGAATGTCTTGATCTAGTAGTATGGATTATATATGGACGATAGTTAGAATTAAATGACTATTATGTCTCTTATTCGGGATTTGTGGAAAAGTGGATCAGGTTGGGCCTTG
TAGATAGTTCTGTTGGTG
Protein sequenceShow/hide protein sequence
MADGLVLEAEYGSNRPTSRKRKADTAADGNNDGQRAMRRITLSLTKPSFVLGLAPKMVRAENRITLRNVLHKLMMQHNWVEACGVLSMLLKGTLRDRSPIENRLKYAASM
ELLKHVECDRKRPDRIKHTYDNWMWKIGSMKRWPIKDKFMVRVEFILFCLEEGNTEDAYQDALCLMQKHESVNDPMSNMIIGLTFRQLWFSTLPEEIQWKGSLQFHSPIQ
SDRIIPNVNGCSVSNSHGDGASYQSNSVTSVMNDKLVHVDSKGFEVDHNIKLENHPQNFEAQDFYAISEEKDENEASFSDNGGYQHYVSIFSALEGLDPLLLPLHLPSSI
ENWENAISLCDEFLNNYYKDAVKHLDLALNSNPPMLAALLPLIQLLLIGGRVDKALNEMENICRDSNAALPFSYAD