; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr025508 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr025508
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionABC transporter ABCE
Genome locationtig00007724:992787..998893
RNA-Seq ExpressionSgr025508
SyntenySgr025508
Gene Ontology termsGO:0000054 - ribosomal subunit export from nucleus (biological process)
GO:0006413 - translational initiation (biological process)
GO:0006415 - translational termination (biological process)
GO:0016020 - membrane (cellular component)
GO:0005506 - iron ion binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0043024 - ribosomal small subunit binding (molecular function)
GO:0051536 - iron-sulfur cluster binding (molecular function)
InterPro domainsIPR013283 - RLI1
IPR017896 - 4Fe-4S ferredoxin-type, iron-sulphur binding domain
IPR017900 - 4Fe-4S ferredoxin, iron-sulphur binding, conserved site
IPR021039 - Iron-sulphur binding protein LdpA, C-terminal


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458354.1 PREDICTED: uncharacterized protein LOC103497790 isoform X1 [Cucumis melo]7.2e-20085.31Show/hide
Query:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
        MALSLS CHA LH+ QHQVA ++   KNLDNVR LV RIGIASVQSS L+SL+NG+WVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
Subjt:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN

Query:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI
        EGIQAARGI  VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISLQEE + EL QVA VSGVLKGGVITERCYGCGRC PVCPYDKI
Subjt:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI

Query:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG
         LVTYVRDAATT KL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIG DLTVSTMKTM+SIME QLHCLNLWQLDGRPMSGDIGRG
Subjt:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG

Query:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
        ATRETIAFAAQLA ++DRPPGFLQLAGGTNFHTVDGLKKE LFQSTS +  S NEEL     SSL+ALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
Subjt:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED

Query:  YPDYLLAALGEALALITIFDEVDSDAVSN
        YPD LLAAL EAL L+      D   +S+
Subjt:  YPDYLLAALGEALALITIFDEVDSDAVSN

XP_022138429.1 uncharacterized protein LOC111009602 [Momordica charantia]8.7e-21490.49Show/hide
Query:  MALSLSCHAALHVQHQVAPKHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQA
        MA+SLSCHAALH Q Q APKHKNLDNV+++VKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADAS++SAVNEGIQA
Subjt:  MALSLSCHAALHVQHQVAPKHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQA

Query:  ARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKLVTY
        ARG+AAVRRPWVMISVNDDQDLHFRKAEFDPENCP DCSRPCE VCPANAISLQEETM +LPQVAS+SGVLKGGVI+ERCYGCGRCFPVCPYDKIKLVTY
Subjt:  ARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKLVTY

Query:  VRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATRETI
        VRDAATTA+L+KR DVDALEIHTNGRQTT FQE WD LGD+SKYLRLVAVSLPNIGDLTVSTMKTMYSIME +LHC NLWQLDGRPMSGDIG+GATRETI
Subjt:  VRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATRETI

Query:  AFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPDYLL
        AF+AQLALS+DRPPGFLQLAGGTN HTVDGLKKE+LFQSTST TT MNEEL AKS SS+HALIGGIAYGGYARKIVGRVLSSMQ QNGDANIE+YPDYLL
Subjt:  AFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPDYLL

Query:  AALGEALALI
        AALGEALAL+
Subjt:  AALGEALALI

XP_022958999.1 uncharacterized protein LOC111460121 [Cucurbita moschata]4.7e-19984.27Show/hide
Query:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        MALSLSCHAAL +QHQVA     +KNLDNVR LV RIGIASVQSSPLESLR+G+W+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL
        IQAAR I  VRRPWVMISVND QDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL+EE M E  +VAS+SG LKGGVITERCYGCGRC PVCPYDKI L
Subjt:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL

Query:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR
        VTYVRDAATTAKL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIGDLT+STMKTM+SIME QL C NLWQLDGRPMSGDIGRGATR
Subjt:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR

Query:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD
        ETIAFAAQLALS+DRPPGFLQLAGGTNF+TVDGLKK+ LFQSTST    MN+EL     SSLHALIGGIAYGGYARKIVGRVLSSMQ Q+GD+NIE+YPD
Subjt:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD

Query:  YLLAALGEALALITIFDEVDSDAVSN
        YLLAAL EALAL+      D   +S+
Subjt:  YLLAALGEALALITIFDEVDSDAVSN

XP_023547405.1 uncharacterized protein LOC111806365 isoform X1 [Cucurbita pepo subsp. pepo]1.2e-19984.27Show/hide
Query:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        MALSLSCHAAL +QHQVA     +KNLDNVR LV RIGIASVQSSPLESLR+G+W+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL
        IQAAR I  VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL+EE M E  +VAS+SG LKGGVITERCYGCGRC PVCPYDKI L
Subjt:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL

Query:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR
        VTYVRDAATTA+L+KRGDVDALEIHTNGRQTTSFQELWD LGDSSKYLRLVAVSLPNIGDLT+STMKTM+SIME QL C NLWQLDGRPMSGDIGRGATR
Subjt:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR

Query:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD
        ETIAFAA LALS+DRPPGFLQLAGGTNF+TVDGLKK+ LFQSTST    MN+EL     SSLHALIGGIAYGGYARKIVGRVLSSMQ Q+GD+NIEDYPD
Subjt:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD

Query:  YLLAALGEALALITIFDEVDSDAVSN
        +LLAAL EALAL+      D   +S+
Subjt:  YLLAALGEALALITIFDEVDSDAVSN

XP_038906651.1 uncharacterized protein LOC120092590 [Benincasa hispida]1.4e-20887.56Show/hide
Query:  MALSLSCHAALHVQHQVAPK---HKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        MALSLSCHA LH+QHQVA +   +KNL+NVR LV RIGIASVQSSPL+SLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MALSLSCHAALHVQHQVAPK---HKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL
        IQAARGI  VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISLQEETM E  QVASVSGVLKGGV+TERCYGCGRC PVCPYDKI L
Subjt:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL

Query:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR
        VTYVRDAATTAKL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIGDLTVSTMKTM+SIME QLHCLNLWQLDGRPMSGDIGRG TR
Subjt:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR

Query:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD
        ETIAFAAQLALS+D PPGFLQLAGGTNFHTVDGLKKE LFQSTSTL  SMNEEL     SSLHALIGGIAYGGYARKIVGRVLSSM+TQNGDANIEDYPD
Subjt:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD

Query:  YLLAALGEALALITIFDEVDSDAVSN
        YLLAAL EA  L+      D   +S+
Subjt:  YLLAALGEALALITIFDEVDSDAVSN

TrEMBL top hitse value%identityAlignment
A0A1S3C7S6 uncharacterized protein LOC103497790 isoform X13.5e-20085.31Show/hide
Query:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
        MALSLS CHA LH+ QHQVA ++   KNLDNVR LV RIGIASVQSS L+SL+NG+WVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
Subjt:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN

Query:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI
        EGIQAARGI  VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISLQEE + EL QVA VSGVLKGGVITERCYGCGRC PVCPYDKI
Subjt:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI

Query:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG
         LVTYVRDAATT KL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIG DLTVSTMKTM+SIME QLHCLNLWQLDGRPMSGDIGRG
Subjt:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG

Query:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
        ATRETIAFAAQLA ++DRPPGFLQLAGGTNFHTVDGLKKE LFQSTS +  S NEEL     SSL+ALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
Subjt:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED

Query:  YPDYLLAALGEALALITIFDEVDSDAVSN
        YPD LLAAL EAL L+      D   +S+
Subjt:  YPDYLLAALGEALALITIFDEVDSDAVSN

A0A5D3BV06 Uncharacterized protein3.5e-20085.31Show/hide
Query:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
        MALSLS CHA LH+ QHQVA ++   KNLDNVR LV RIGIASVQSS L+SL+NG+WVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN
Subjt:  MALSLS-CHAALHV-QHQVAPKH---KNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVN

Query:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI
        EGIQAARGI  VRRPWVMISVNDDQDLHFRKAEFDPENCP+DCSRPCEIVCPANAISLQEE + EL QVA VSGVLKGGVITERCYGCGRC PVCPYDKI
Subjt:  EGIQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKI

Query:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG
         LVTYVRDAATT KL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIG DLTVSTMKTM+SIME QLHCLNLWQLDGRPMSGDIGRG
Subjt:  KLVTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIG-DLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRG

Query:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
        ATRETIAFAAQLA ++DRPPGFLQLAGGTNFHTVDGLKKE LFQSTS +  S NEEL     SSL+ALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED
Subjt:  ATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIED

Query:  YPDYLLAALGEALALITIFDEVDSDAVSN
        YPD LLAAL EAL L+      D   +S+
Subjt:  YPDYLLAALGEALALITIFDEVDSDAVSN

A0A6J1CA44 uncharacterized protein LOC1110096024.2e-21490.49Show/hide
Query:  MALSLSCHAALHVQHQVAPKHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQA
        MA+SLSCHAALH Q Q APKHKNLDNV+++VKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADAS++SAVNEGIQA
Subjt:  MALSLSCHAALHVQHQVAPKHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQA

Query:  ARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKLVTY
        ARG+AAVRRPWVMISVNDDQDLHFRKAEFDPENCP DCSRPCE VCPANAISLQEETM +LPQVAS+SGVLKGGVI+ERCYGCGRCFPVCPYDKIKLVTY
Subjt:  ARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKLVTY

Query:  VRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATRETI
        VRDAATTA+L+KR DVDALEIHTNGRQTT FQE WD LGD+SKYLRLVAVSLPNIGDLTVSTMKTMYSIME +LHC NLWQLDGRPMSGDIG+GATRETI
Subjt:  VRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATRETI

Query:  AFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPDYLL
        AF+AQLALS+DRPPGFLQLAGGTN HTVDGLKKE+LFQSTST TT MNEEL AKS SS+HALIGGIAYGGYARKIVGRVLSSMQ QNGDANIE+YPDYLL
Subjt:  AFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPDYLL

Query:  AALGEALALI
        AALGEALAL+
Subjt:  AALGEALALI

A0A6J1H6Q4 uncharacterized protein LOC1114601212.3e-19984.27Show/hide
Query:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        MALSLSCHAAL +QHQVA     +KNLDNVR LV RIGIASVQSSPLESLR+G+W+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MALSLSCHAALHVQHQVAP---KHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL
        IQAAR I  VRRPWVMISVND QDLHFRKAEFDPENCP+DCSRPCEIVCPANAISL+EE M E  +VAS+SG LKGGVITERCYGCGRC PVCPYDKI L
Subjt:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL

Query:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR
        VTYVRDAATTAKL+KRGDVDALEIHTNGRQTT FQELWD LGDSSKYLRLVAVSLPNIGDLT+STMKTM+SIME QL C NLWQLDGRPMSGDIGRGATR
Subjt:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR

Query:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD
        ETIAFAAQLALS+DRPPGFLQLAGGTNF+TVDGLKK+ LFQSTST    MN+EL     SSLHALIGGIAYGGYARKIVGRVLSSMQ Q+GD+NIE+YPD
Subjt:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD

Query:  YLLAALGEALALITIFDEVDSDAVSN
        YLLAAL EALAL+      D   +S+
Subjt:  YLLAALGEALALITIFDEVDSDAVSN

A0A6J1KVU8 uncharacterized protein LOC111499161 isoform X18.0e-19783.1Show/hide
Query:  MALSLSCHAALHVQHQVAPK---HKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
        MALSLSCHAAL +QHQVA +   +KNLDNVR LV RIGI+SVQSSPLESLR+G+W+KLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG
Subjt:  MALSLSCHAALHVQHQVAPK---HKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEG

Query:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL
        IQAAR I  VRRPWVMISVNDDQDLHFRKA FDPENCP+DCSRPCEIVCPANAISL++E M E  +VAS+SG LKGGVITERCYGCGRC PVCPYDKI L
Subjt:  IQAARGIAAVRRPWVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKL

Query:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR
        VTYVRDAATTAKL+KRGDVDALEIHTNGRQTT FQELW+ LGDSSKYLRLVAVSLPNIGDLT+STMKTM+SIME QL CLNLWQLDGRPMSGDIGRGATR
Subjt:  VTYVRDAATTAKLVKRGDVDALEIHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATR

Query:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD
        ETIAFAAQLALS+DRPPGFLQLAGGTNF+TVDGLKK+ LFQS   L   M++EL     SSLHALIGGIAYGGYARKIVGRVLSSMQ Q+GD+NIEDYPD
Subjt:  ETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDGLKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPD

Query:  YLLAALGEALALITIFDEVDSDAVSN
        YLLAAL EAL L+      D   +S+
Subjt:  YLLAALGEALALITIFDEVDSDAVSN

SwissProt top hitse value%identityAlignment
Q1G3T1 TPD1 protein homolog 12.3e-2351.04Show/hide
Query:  CSNRDISISQSKDS--TSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPLRFKS
        CS  DI + Q   +   SG+P Y V+I N+C+SDC  ++IH+ CGWF+S R+VNPR F+R+ YDDCLVN G+PL   +++ F Y+NSF YPL   S
Subjt:  CSNRDISISQSKDS--TSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPLRFKS

Q2QR54 TPD1 protein homolog 1A1.1e-1741.28Show/hide
Query:  GRRRKLLMHGRCSNRDISISQSKDS--TSGIPQYIVQIANTCL------SDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFT
        G R + +  G     DI+I Q + +   SG+P Y V + N C        +CA + IH+ CGWF+S  +V+PR F+R+ +DDCL+N G+PL   ET+ F 
Subjt:  GRRRKLLMHGRCSNRDISISQSKDS--TSGIPQYIVQIANTCL------SDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFT

Query:  YSNSFMYPL
        Y+NSF Y L
Subjt:  YSNSFMYPL

Q6TLJ2 Protein TAPETUM DETERMINANT 13.0e-2350.54Show/hide
Query:  RCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL
        +C + DI ++Q  ++   +GIP Y+V+I N C+S C  S IH++CGWF+SA+++NPR FKR+ YDDCLVN GKPL    T+ F Y+N+F Y L
Subjt:  RCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL

Q8S6P9 TPD1 protein homolog 1B2.4e-1738.85Show/hide
Query:  ALLVPRLSSTSKTKSSFFADQNQIISASLEQKQAHGRR-RKLLMHGRCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIV
        AL+V   SS+S  +S    + N++++ S +     GR   + +    CS +++ + Q  ++   SGIP Y V+I N C + C   D+H+ CG FASA +V
Subjt:  ALLVPRLSSTSKTKSSFFADQNQIISASLEQKQAHGRR-RKLLMHGRCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIV

Query:  NPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL
        +P  F+R+ ++DCLV GG  L  SE + F YSNSF YPL
Subjt:  NPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL

Arabidopsis top hitse value%identityAlignment
AT1G32583.1 FUNCTIONS IN: molecular_function unknown1.6e-2451.04Show/hide
Query:  CSNRDISISQSKDS--TSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPLRFKS
        CS  DI + Q   +   SG+P Y V+I N+C+SDC  ++IH+ CGWF+S R+VNPR F+R+ YDDCLVN G+PL   +++ F Y+NSF YPL   S
Subjt:  CSNRDISISQSKDS--TSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPLRFKS

AT4G24972.1 tapetum determinant 12.1e-2450.54Show/hide
Query:  RCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL
        +C + DI ++Q  ++   +GIP Y+V+I N C+S C  S IH++CGWF+SA+++NPR FKR+ YDDCLVN GKPL    T+ F Y+N+F Y L
Subjt:  RCSNRDISISQ--SKDSTSGIPQYIVQIANTCLSDCAPSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGAGCTTGTCCTGCCATGCCGCTCTTCATGTTCAACATCAAGTGGCTCCCAAACATAAGAACCTCGACAATGTCAGAGACCTCGTTAAACGAATTGGGATTGC
GTCAGTTCAGTCTTCTCCTCTCGAATCCCTCCGAAATGGCCACTGGGTCAAGCTTATTTGTGGGGCAAGTTTCGAGGATGTGGTTGATATCAGGAATCTCTCTCTGGTTT
ACACCCTTGCTGGGGTTGATTGCATTGACTGCGCCGCTGATGCGTCGGTTGTTAGTGCGGTGAACGAGGGAATTCAAGCGGCGAGAGGGATTGCCGCCGTTCGCAGGCCT
TGGGTGATGATCAGTGTCAATGATGACCAAGATCTTCACTTCCGCAAAGCTGAGTTTGATCCAGAGAACTGTCCAGTTGACTGTTCTAGGCCTTGTGAGATTGTTTGCCC
TGCTAATGCAATCTCACTACAGGAAGAAACAATGTTAGAGCTTCCACAAGTGGCCAGTGTATCTGGAGTACTAAAGGGTGGAGTAATCACCGAGCGCTGTTATGGTTGTG
GCCGTTGCTTTCCAGTCTGCCCATATGATAAAATAAAGCTAGTCACATATGTAAGAGATGCAGCTACCACTGCTAAACTTGTAAAACGAGGCGACGTCGATGCGTTGGAG
ATTCACACCAATGGGAGGCAAACCACTTCTTTTCAAGAACTTTGGGATAATTTAGGGGACTCATCCAAATATCTAAGGCTAGTAGCAGTAAGCCTACCTAATATTGGGGA
TTTAACAGTATCTACAATGAAGACTATGTACTCGATCATGGAACCGCAGCTCCATTGTTTGAACTTATGGCAGTTAGATGGACGCCCGATGAGTGGAGATATCGGACGAG
GTGCCACAAGGGAAACGATTGCCTTTGCTGCTCAGTTAGCTCTTTCCAGTGATCGTCCCCCTGGCTTCCTTCAACTGGCTGGTGGCACAAATTTTCACACTGTTGATGGC
TTGAAGAAAGAGAGCCTTTTTCAATCAACATCTACTCTTACTACTTCGATGAACGAAGAGTTATTGGCAAAATCACCCAGTTCATTGCACGCTCTGATCGGTGGCATCGC
TTATGGGGGCTATGCTCGGAAGATTGTTGGAAGGGTCTTGAGTTCAATGCAGACACAAAATGGAGATGCCAATATCGAAGACTATCCCGACTATCTCCTGGCTGCACTTG
GGGAAGCCTTGGCTTTGATTACTATATTTGATGAGGTAGATTCCGATGCCGTGTCCAACAACGAAGAAGCTGCATCTGCATGTCAATGTGTTCGCTGCGCTGCGCCTCTC
TTTTTGCAGCAGACGACGAGTCATCAACAGTTCACAACCCAACACAACCTGCATCTTTCCTCGGTTAAAGCACTTCCACCCTCGGAGCTTGCTCTTTTGGTTCCCCGATT
GAGTTCAACTTCAAAGACGAAATCTTCATTTTTCGCCGATCAGAATCAGATTATAAGCGCAAGTTTAGAGCAAAAGCAAGCGCATGGCCGCCGTAGAAAGCTTTTGATGC
ACGGAAGATGCAGCAACAGAGATATAAGCATCTCGCAGAGCAAGGATTCAACATCGGGAATTCCGCAGTACATAGTTCAAATTGCAAACACTTGTCTCTCAGATTGCGCA
CCGTCCGATATTCATCTCCATTGCGGCTGGTTTGCTTCTGCGAGAATCGTAAACCCTAGAACTTTCAAGAGGATGTTTTACGACGATTGCTTGGTGAACGGAGGGAAGCC
ATTGAAGATCAGCGAAACCATCAGATTCACTTACTCCAACTCCTTCATGTATCCTCTCAGATTCAAGTCGGCCAAGTTCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTGAGCTTGTCCTGCCATGCCGCTCTTCATGTTCAACATCAAGTGGCTCCCAAACATAAGAACCTCGACAATGTCAGAGACCTCGTTAAACGAATTGGGATTGC
GTCAGTTCAGTCTTCTCCTCTCGAATCCCTCCGAAATGGCCACTGGGTCAAGCTTATTTGTGGGGCAAGTTTCGAGGATGTGGTTGATATCAGGAATCTCTCTCTGGTTT
ACACCCTTGCTGGGGTTGATTGCATTGACTGCGCCGCTGATGCGTCGGTTGTTAGTGCGGTGAACGAGGGAATTCAAGCGGCGAGAGGGATTGCCGCCGTTCGCAGGCCT
TGGGTGATGATCAGTGTCAATGATGACCAAGATCTTCACTTCCGCAAAGCTGAGTTTGATCCAGAGAACTGTCCAGTTGACTGTTCTAGGCCTTGTGAGATTGTTTGCCC
TGCTAATGCAATCTCACTACAGGAAGAAACAATGTTAGAGCTTCCACAAGTGGCCAGTGTATCTGGAGTACTAAAGGGTGGAGTAATCACCGAGCGCTGTTATGGTTGTG
GCCGTTGCTTTCCAGTCTGCCCATATGATAAAATAAAGCTAGTCACATATGTAAGAGATGCAGCTACCACTGCTAAACTTGTAAAACGAGGCGACGTCGATGCGTTGGAG
ATTCACACCAATGGGAGGCAAACCACTTCTTTTCAAGAACTTTGGGATAATTTAGGGGACTCATCCAAATATCTAAGGCTAGTAGCAGTAAGCCTACCTAATATTGGGGA
TTTAACAGTATCTACAATGAAGACTATGTACTCGATCATGGAACCGCAGCTCCATTGTTTGAACTTATGGCAGTTAGATGGACGCCCGATGAGTGGAGATATCGGACGAG
GTGCCACAAGGGAAACGATTGCCTTTGCTGCTCAGTTAGCTCTTTCCAGTGATCGTCCCCCTGGCTTCCTTCAACTGGCTGGTGGCACAAATTTTCACACTGTTGATGGC
TTGAAGAAAGAGAGCCTTTTTCAATCAACATCTACTCTTACTACTTCGATGAACGAAGAGTTATTGGCAAAATCACCCAGTTCATTGCACGCTCTGATCGGTGGCATCGC
TTATGGGGGCTATGCTCGGAAGATTGTTGGAAGGGTCTTGAGTTCAATGCAGACACAAAATGGAGATGCCAATATCGAAGACTATCCCGACTATCTCCTGGCTGCACTTG
GGGAAGCCTTGGCTTTGATTACTATATTTGATGAGGTAGATTCCGATGCCGTGTCCAACAACGAAGAAGCTGCATCTGCATGTCAATGTGTTCGCTGCGCTGCGCCTCTC
TTTTTGCAGCAGACGACGAGTCATCAACAGTTCACAACCCAACACAACCTGCATCTTTCCTCGGTTAAAGCACTTCCACCCTCGGAGCTTGCTCTTTTGGTTCCCCGATT
GAGTTCAACTTCAAAGACGAAATCTTCATTTTTCGCCGATCAGAATCAGATTATAAGCGCAAGTTTAGAGCAAAAGCAAGCGCATGGCCGCCGTAGAAAGCTTTTGATGC
ACGGAAGATGCAGCAACAGAGATATAAGCATCTCGCAGAGCAAGGATTCAACATCGGGAATTCCGCAGTACATAGTTCAAATTGCAAACACTTGTCTCTCAGATTGCGCA
CCGTCCGATATTCATCTCCATTGCGGCTGGTTTGCTTCTGCGAGAATCGTAAACCCTAGAACTTTCAAGAGGATGTTTTACGACGATTGCTTGGTGAACGGAGGGAAGCC
ATTGAAGATCAGCGAAACCATCAGATTCACTTACTCCAACTCCTTCATGTATCCTCTCAGATTCAAGTCGGCCAAGTTCTGCTGA
Protein sequenceShow/hide protein sequence
MALSLSCHAALHVQHQVAPKHKNLDNVRDLVKRIGIASVQSSPLESLRNGHWVKLICGASFEDVVDIRNLSLVYTLAGVDCIDCAADASVVSAVNEGIQAARGIAAVRRP
WVMISVNDDQDLHFRKAEFDPENCPVDCSRPCEIVCPANAISLQEETMLELPQVASVSGVLKGGVITERCYGCGRCFPVCPYDKIKLVTYVRDAATTAKLVKRGDVDALE
IHTNGRQTTSFQELWDNLGDSSKYLRLVAVSLPNIGDLTVSTMKTMYSIMEPQLHCLNLWQLDGRPMSGDIGRGATRETIAFAAQLALSSDRPPGFLQLAGGTNFHTVDG
LKKESLFQSTSTLTTSMNEELLAKSPSSLHALIGGIAYGGYARKIVGRVLSSMQTQNGDANIEDYPDYLLAALGEALALITIFDEVDSDAVSNNEEAASACQCVRCAAPL
FLQQTTSHQQFTTQHNLHLSSVKALPPSELALLVPRLSSTSKTKSSFFADQNQIISASLEQKQAHGRRRKLLMHGRCSNRDISISQSKDSTSGIPQYIVQIANTCLSDCA
PSDIHLHCGWFASARIVNPRTFKRMFYDDCLVNGGKPLKISETIRFTYSNSFMYPLRFKSAKFC