; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc08G13680 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc08G13680
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPlant protein of unknown function (DUF247)
Genome locationClcChr08:24779087..24782968
RNA-Seq ExpressionClc08G13680
SyntenyClc08G13680
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsIPR004158 - Protein of unknown function DUF247, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0064945.1 UPF0481 protein [Cucumis melo var. makuwa]1.0e-2254.39Show/hide
Query:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR
        EN   F+ +  ++SEI     EA   + N + +I E  Q LC  +VIS++ M+ Q+ +IN + SIYR+PKQL EMNPKAY PQLISIGPFHH  CQ DF+
Subjt:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR

Query:  ATEQYKLRALINFL
         TEQYKL+AL+NFL
Subjt:  ATEQYKLRALINFL

KAA0064948.1 UPF0481 protein [Cucumis melo var. makuwa]6.1e-2358.95Show/hide
Query:  EMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFRATEQYKLRALINFL
        E+   N + +++ +ISE  Q LC  +VIS+E+M+ Q+ +IN + SIYRVPKQL +MNP+ Y PQLISIGPFHH  CQ DF+ATEQYKL+AL+NFL
Subjt:  EMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFRATEQYKLRALINFL

XP_038886293.1 UPF0481 protein At3g47200-like [Benincasa hispida]1.9e-2464.29Show/hide
Query:  SEIEMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGC-QKDFRATEQYKLRALINFL
        SE E+ EAND  +N+A+ SE D+ +C  +VIS+E M+KQLP IN +SSIYRV KQL  MNPKAY PQ+ISIGPFHH C Q DF+  EQYKL+ LINFL
Subjt:  SEIEMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGC-QKDFRATEQYKLRALINFL

XP_038886585.1 UPF0481 protein At3g47200-like [Benincasa hispida]2.3e-3070Show/hide
Query:  MEKSEIEMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL
        M KSEIEM   ND +HNV EISE D+QLCG + IS+ +M+KQLP +N ESSIYRVPKQL +MNPKAY PQLISIGP+HH  +KD  ATEQYKL+ LINFL
Subjt:  MEKSEIEMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL

XP_038890800.1 UPF0481 protein At3g47200-like [Benincasa hispida]3.2e-2461.39Show/hide
Query:  MEKSEIEMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINF
        ME SEIE  EAND TH+ V EISE DQ+LCG +VI +E+M++QLP +N + SIYR+ K+L E+N KAY PQLISIGP H G  KD  A E YKL+  INF
Subjt:  MEKSEIEMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINF

Query:  L
        L
Subjt:  L

TrEMBL top hitse value%identityAlignment
A0A1S3BD29 LOW QUALITY PROTEIN: UPF0481 protein At3g47200-like1.5e-2255.26Show/hide
Query:  ENEGRFQFQKMEKSEIEM--PEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR
        EN   F+ +  + SEI M   EA   ++N + +I E  Q LC  +VIS+++M+ Q+ +IN + SIYR+PKQL EMNPKAY PQLISIGPFHH   Q DF+
Subjt:  ENEGRFQFQKMEKSEIEM--PEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR

Query:  ATEQYKLRALINFL
        ATEQYKL+AL+NFL
Subjt:  ATEQYKLRALINFL

A0A1S3BDS2 UPF0481 protein At3g47200-like1.1e-2254.39Show/hide
Query:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR
        EN   F+ +  ++SEI     EA   + N + +I E  Q LC  +VIS++ M+ Q+  IN + SIYR+PKQL EMNPKAY PQLISIGPFHH  CQ DF+
Subjt:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR

Query:  ATEQYKLRALINFL
         TEQYKL+AL+NFL
Subjt:  ATEQYKLRALINFL

A0A5A7VBG0 UPF0481 protein2.9e-2358.95Show/hide
Query:  EMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFRATEQYKLRALINFL
        E+   N + +++ +ISE  Q LC  +VIS+E+M+ Q+ +IN + SIYRVPKQL +MNP+ Y PQLISIGPFHH  CQ DF+ATEQYKL+AL+NFL
Subjt:  EMPEANDTTHNVAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFRATEQYKLRALINFL

A0A5A7VF39 UPF0481 protein1.5e-2255.26Show/hide
Query:  ENEGRFQFQKMEKSEIEM--PEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR
        EN   F+ +  + SEI M   EA   ++N + +I E  Q LC  +VIS+++M+ Q+ +IN + SIYR+PKQL EMNPKAY PQLISIGPFHH   Q DF+
Subjt:  ENEGRFQFQKMEKSEIEM--PEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR

Query:  ATEQYKLRALINFL
        ATEQYKL+AL+NFL
Subjt:  ATEQYKLRALINFL

A0A5A7VGD0 UPF0481 protein5.0e-2354.39Show/hide
Query:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR
        EN   F+ +  ++SEI     EA   + N + +I E  Q LC  +VIS++ M+ Q+ +IN + SIYR+PKQL EMNPKAY PQLISIGPFHH  CQ DF+
Subjt:  ENEGRFQFQKMEKSEI--EMPEANDTTHN-VAEISEVDQQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHH-GCQKDFR

Query:  ATEQYKLRALINFL
         TEQYKL+AL+NFL
Subjt:  ATEQYKLRALINFL

SwissProt top hitse value%identityAlignment
Q9SD53 UPF0481 protein At3g472001.2e-0541.54Show/hide
Query:  EEMMKQLPAINRES-SIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL
        +E +  L +  +ES  I+RVP+    +NPKAY P+++SIGP+H+G +K  +  +Q+K R L  FL
Subjt:  EEMMKQLPAINRES-SIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL

Arabidopsis top hitse value%identityAlignment
AT3G47210.1 Plant protein of unknown function (DUF247)1.1e-0939.77Show/hide
Query:  ISEVDQQLCGKIVISMEEMMKQLPAINRES-SIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFLPVSIMTR
        IS +++Q+  ++    E+ +  L +  +ES  I+RVPK  +EMNP+AY P+++SIGP+HHG +K     +Q+KLR L  FL  + + R
Subjt:  ISEVDQQLCGKIVISMEEMMKQLPAINRES-SIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFLPVSIMTR

AT3G47250.1 Plant protein of unknown function (DUF247)5.0e-0737.84Show/hide
Query:  QLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL
        Q  GK +I +E   K          I+R+P  L+E+NPKAY P+++SIGP+H+G +   +  +Q+K R L  F+
Subjt:  QLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL

AT3G47250.2 Plant protein of unknown function (DUF247)5.0e-0737.84Show/hide
Query:  QLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL
        Q  GK +I +E   K          I+R+P  L+E+NPKAY P+++SIGP+H+G +   +  +Q+K R L  F+
Subjt:  QLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFL

AT3G50160.1 Plant protein of unknown function (DUF247)1.7e-0735.85Show/hide
Query:  FQKMEKSEIEMPEANDT-THNVAEISEVDQQLCGKI-VISMEEMMKQLPAINRESS-----IYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQ
        + ++E  E+   +  +T   +V  I + ++Q   +I VIS+ + MK L   N  +S     IYRVP  L E + K+Y PQ++SIGP+HHG  K     E+
Subjt:  FQKMEKSEIEMPEANDT-THNVAEISEVDQQLCGKI-VISMEEMMKQLPAINRESS-----IYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQ

Query:  YKLRAL
        +K RA+
Subjt:  YKLRAL

AT4G31980.1 unknown protein9.1e-0937.14Show/hide
Query:  IVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFLP
        +V S++  +  L +++ +  IY+VP +L  +NP AY P+L+S GP H G +++ +A E  K R L++F+P
Subjt:  IVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAATTTCTCTTTTGATTATCCATTTAGAAAGAAATTGTATGGAGTTGGTGACACTTGGAACAATTCCACGGCCAAATTATTGTGGTGGAAAGGATGAAATAAATAC
CACCACAGGCTTAGTGTGCAACTCCATCCATCTTCTTCCCTCATCATTTCATCTTCCAACAAACAACACTAACCATCTCTGCCTCCTTCCACCAAATTTGCTGACAAGGA
GATTTGAAAACGAGGGTCGGTTTCAATTCCAAAAAATGGAAAAGAGTGAGATTGAAATGCCTGAAGCAAATGATACAACTCACAATGTGGCAGAAATTAGTGAGGTTGAT
CAACAACTTTGTGGTAAAATTGTGATATCCATGGAAGAAATGATGAAACAATTGCCTGCTATTAATAGAGAAAGTAGCATCTATCGAGTTCCCAAACAGTTAAGCGAGAT
GAATCCTAAAGCCTATGCCCCTCAACTCATTTCCATAGGCCCTTTTCATCATGGATGTCAAAAGGATTTTAGAGCCACAGAACAATATAAGCTTCGAGCTCTTATTAACT
TTCTACCCGTATCAATAATGACAAGAAGGAATATTCATTGGAGGAGGAGATTGTGA
mRNA sequenceShow/hide mRNA sequence
ATGATAATTTCTCTTTTGATTATCCATTTAGAAAGAAATTGTATGGAGTTGGTGACACTTGGAACAATTCCACGGCCAAATTATTGTGGTGGAAAGGATGAAATAAATAC
CACCACAGGCTTAGTGTGCAACTCCATCCATCTTCTTCCCTCATCATTTCATCTTCCAACAAACAACACTAACCATCTCTGCCTCCTTCCACCAAATTTGCTGACAAGGA
GATTTGAAAACGAGGGTCGGTTTCAATTCCAAAAAATGGAAAAGAGTGAGATTGAAATGCCTGAAGCAAATGATACAACTCACAATGTGGCAGAAATTAGTGAGGTTGAT
CAACAACTTTGTGGTAAAATTGTGATATCCATGGAAGAAATGATGAAACAATTGCCTGCTATTAATAGAGAAAGTAGCATCTATCGAGTTCCCAAACAGTTAAGCGAGAT
GAATCCTAAAGCCTATGCCCCTCAACTCATTTCCATAGGCCCTTTTCATCATGGATGTCAAAAGGATTTTAGAGCCACAGAACAATATAAGCTTCGAGCTCTTATTAACT
TTCTACCCGTATCAATAATGACAAGAAGGAATATTCATTGGAGGAGGAGATTGTGA
Protein sequenceShow/hide protein sequence
MIISLLIIHLERNCMELVTLGTIPRPNYCGGKDEINTTTGLVCNSIHLLPSSFHLPTNNTNHLCLLPPNLLTRRFENEGRFQFQKMEKSEIEMPEANDTTHNVAEISEVD
QQLCGKIVISMEEMMKQLPAINRESSIYRVPKQLSEMNPKAYAPQLISIGPFHHGCQKDFRATEQYKLRALINFLPVSIMTRRNIHWRRRL