; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007643 (gene) of Snake gourd v1 genome

Gene IDTan0007643
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionDNA ligase 1 isoform X1
Genome locationLG02:96616741..96619109
RNA-Seq ExpressionTan0007643
SyntenyTan0007643
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7029449.1 hypothetical protein SDJN02_07788 [Cucurbita argyrosperma subsp. argyrosperma]1.5e-10464.64Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKE-KSSRSRDLSDRKR----KEAKDLPKEAKVEAEQL
        MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKSKE KE K   SR L+D+K+    KEAKD  +  KVEAEQL
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKE-KSSRSRDLSDRKR----KEAKDLPKEAKVEAEQL

Query:  EKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLT
        EKSGLTEEHGQPVWP SPGYLSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S  S+QTCS SG D S DQKRDEN     +++  C  
Subjt:  EKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLT

Query:  IADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA
         ++TA +VKD   SKP IKDP  HAVKD  +                                 SKPKIK P  HAVK+I +  GNV S+   PRT+SP 
Subjt:  IADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA

Query:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCRS SLWPRGQYL DADVYSLPYTIP+
Subjt:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

XP_022144272.1 chromatin assembly factor 1 subunit A-like [Momordica charantia]1.1e-10565Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKHKSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQ
        MSRCFPYPPPGYA KVARTEAALIESIKL SER++S+ +RKKEKSKH+KE+  KSK+KK + KE KEKSS S DL+D+K+KE    A+D  K  KVEAEQ
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKHKSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQ

Query:  LEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCL
        LEKSGLTEEHGQPVWPQSPGYLSDG+QINHKRKRDA L PN+ SKPGKIIRIKL SSLS QE  S  +QQTCSTSGR   +DQKRDEN  GP  QQKPC 
Subjt:  LEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCL

Query:  TIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA
        T ++T  +V++   KP IKD S                                                  S HAVKDI   QGNV  V  P RT+SPA
Subjt:  TIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA

Query:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        ES YEALFEKW+PPPLQLEQQM+DEEWLFGTRKQ  GQTTK T NKAFS VPSCRS SLWPRGQYLPDADVYSLPYTIPF
Subjt:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

XP_022997629.1 uncharacterized protein LOC111492505 isoform X1 [Cucurbita maxima]1.3e-10664.83Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE
        MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS   KE KEKSSRSRDL+D+K     KEAKD  +  KVEAE
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE

Query:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC
        QLEKSGLTEEHGQPVWP SPGYLSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S G + TCS SGRD S DQK DEN     +++  C
Subjt:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC

Query:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS
           ++TA +VKD   SKP IKDP  HAVKD  +                                 SKPKIK PS HAVK+I +  GNV S+   PRT+S
Subjt:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS

Query:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SLWPRGQYL  ADVYSLPYTIP+
Subjt:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

XP_023545923.1 uncharacterized protein LOC111805212 isoform X1 [Cucurbita pepo subsp. pepo]1.5e-10765.35Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEH---KEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE
        MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKSKE    KEKSSRSRDL+D+K+    KE KD  +  KVEAE
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEH---KEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE

Query:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC
        QLEKSGLTEEHGQPVWP SPGYLSDG+QIN KRKRD SL P++G KPGK+IRIKL SSLS+QE  S GS+Q CS SGRD S DQK DEN     +++  C
Subjt:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC

Query:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS
           ++TA +VKD   SKP IKDP  HAVKD  +                                 SKPKIK PS HAVK+I +  GNV S+   PRT+S
Subjt:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS

Query:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS VPSCRS SLWPRGQYL DADVYSLPYTIP+
Subjt:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

XP_038885448.1 DNA ligase 1 isoform X1 [Benincasa hispida]3.4e-10463.87Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK------KSKDKKHKSKEHKEKSSRSRDLSDRKR----KEAKDLPKEAKV
        MSRCFPYPPPGY RKVA TEAALIESIKL SERR+S+ + KKEKSKHKKEK      +SKDKKHKSKE KEKSS SRDL+D+K+    KEAK+L K  KV
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK------KSKDKKHKSKEHKEKSSRSRDLSDRKR----KEAKDLPKEAKV

Query:  EAEQLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKP
        EAEQLE+SGLTEEHGQPVWPQSPGYLSDG+QINHKRKRDASL  N+G KPGKIIRIKL  S     S GS+QTCSTSGRD S+DQKRDEN RG  IQQ  
Subjt:  EAEQLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKP

Query:  CLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS
          T A TA +V DP S                                 SKPKI+                      +VKDI + +GNV  VSLPPRT+S
Subjt:  CLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS

Query:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSC-RSLSLWPRGQYLPDADVYSLPYTIPF
        PAESAYEALFEKWV PPLQLEQQ +DE+WLFG  ++Q GQ+  TNNKAFS VPSC RS SLWPRGQYL DADVYSLPYTIPF
Subjt:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSC-RSLSLWPRGQYLPDADVYSLPYTIPF

TrEMBL top hitse value%identityAlignment
A0A6J1CT76 chromatin assembly factor 1 subunit A-like5.1e-10665Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKHKSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQ
        MSRCFPYPPPGYA KVARTEAALIESIKL SER++S+ +RKKEKSKH+KE+  KSK+KK + KE KEKSS S DL+D+K+KE    A+D  K  KVEAEQ
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEK--KSKDKKHKSKEHKEKSSRSRDLSDRKRKE----AKDLPKEAKVEAEQ

Query:  LEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCL
        LEKSGLTEEHGQPVWPQSPGYLSDG+QINHKRKRDA L PN+ SKPGKIIRIKL SSLS QE  S  +QQTCSTSGR   +DQKRDEN  GP  QQKPC 
Subjt:  LEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCL

Query:  TIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA
        T ++T  +V++   KP IKD S                                                  S HAVKDI   QGNV  V  P RT+SPA
Subjt:  TIADTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPA

Query:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        ES YEALFEKW+PPPLQLEQQM+DEEWLFGTRKQ  GQTTK T NKAFS VPSCRS SLWPRGQYLPDADVYSLPYTIPF
Subjt:  ESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTK-TNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

A0A6J1GHA3 uncharacterized protein LOC111454195 isoform X18.8e-9861.52Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRS--RDLSDRKRKEAKDLPKEAKVEAEQLEKS
        MSRCFPYPPPGYARKVARTEAALIE IKL+SERRE  TERKKEKSKHKKE KSKDKKHKSKEH++KSSRS  R  +D+K+KE KDL K  KVEAEQLEKS
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRS--RDLSDRKRKEAKDLPKEAKVEAEQLEKS

Query:  GLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRD-SSLDQKRDENRRGPPIQQKPCLTIA
        GLTEEHGQPVWPQSPGYLSDG+Q NHKRKR AS+ PN+  KPGK+IRIKL SSLS+QE  S GS+QTCST+GR  + L Q RDEN    PI QK  LT  
Subjt:  GLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRD-SSLDQKRDENRRGPPIQQKPCLTIA

Query:  DTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDP-----ISKPKIEVPSLH-AVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTK
        D              +D S   V  P+ K  +        ++      + K  + +P    AVKDPISKP IK    HAV DIGTH           R +
Subjt:  DTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDP-----ISKPKIEVPSLH-AVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTK

Query:  SPAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
         P++SAYE LF+KWVPP LQL QQ ++EEWLFG +KQ   + TKT N+AFSH P CRS SLWPRGQ++P+ADVY LPYTIPF
Subjt:  SPAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

A0A6J1KAB9 uncharacterized protein LOC111492505 isoform X22.4e-10364.04Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE
        MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS   KE KEKSSRSRDL+D+K     KEAKD  +  KVEAE
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE

Query:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC
        QLEKSGLTEEHGQPVWP SPGYLSDG+QINHKRKRD SL P++    GK+IRIKL SSLS+QE  S G + TCS SGRD S DQK DEN     +++  C
Subjt:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC

Query:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS
           ++TA +VKD   SKP IKDP  HAVKD  +                                 SKPKIK PS HAVK+I +  GNV S+   PRT+S
Subjt:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS

Query:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SLWPRGQYL  ADVYSLPYTIP+
Subjt:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

A0A6J1KC15 uncharacterized protein LOC111492505 isoform X16.1e-10764.83Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE
        MSRCFPYPPPGY RKVARTEAALIESIKL SERR+ +T+ KKEKSKHKKE KSKD+KHKS   KE KEKSSRSRDL+D+K     KEAKD  +  KVEAE
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKS---KEHKEKSSRSRDLSDRKR----KEAKDLPKEAKVEAE

Query:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC
        QLEKSGLTEEHGQPVWP SPGYLSDG+QINHKRKRD SL P++G KPGK+IRIKL SSLS+QE  S G + TCS SGRD S DQK DEN     +++  C
Subjt:  QLEKSGLTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPC

Query:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS
           ++TA +VKD   SKP IKDP  HAVKD  +                                 SKPKIK PS HAVK+I +  GNV S+   PRT+S
Subjt:  LTIADTAGSVKD-PISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS

Query:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        P ESAYEALFEKWVPPPLQLEQQM+DEEWLF T KQ  G++TKT N+AFS +PSCR+ SLWPRGQYL  ADVYSLPYTIP+
Subjt:  PAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

A0A6J1KTG3 uncharacterized protein LOC1114962561.6e-9961.87Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRS-RDLSDRKRKEAKDLPKEAKVEAEQLEKSG
        MSRCFPYPPPGYARKVARTEAALIESIKL+SERRE  TERKKEKSKHKKE KSKDKKHKSKEH++KSSRS RD +D+K+KE KDL +  KVEAEQLEKSG
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRS-RDLSDRKRKEAKDLPKEAKVEAEQLEKSG

Query:  LTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADT
        LTEEHGQPVWPQSPGYLSDG+Q NHKRKR ASL PN+  KPGK+IRIKL SSLS+QE  S GS+QTCST+GR +SL Q RDEN    PI QK   T  D 
Subjt:  LTEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQE--SVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADT

Query:  AGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEV-PSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESAY
           V++    P+++  S  ++         K   +   +  + K  + +  +  AV+DPIS+P IK    HAV    TH           R + P+ESAY
Subjt:  AGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEV-PSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESAY

Query:  EALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        E LF+KWVPP LQL QQ  DEEWLFGT+KQ   + TKT N+AFSH P CR  SLWPRGQ++P+ADVY LPYTIPF
Subjt:  EALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G20100.1 unknown protein1.5e-1229.21Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL
        MSR F  PPP YAR  A  +  L+E  K+     +S+   +KEK + KKEKK K K+ KS E K  ++                 K    E+EQLEKS L
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL

Query:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADTAGS
        TEE  QP      GYLSDGSQ + KR+R+ S        P  +                                  +   +  P+  KP          
Subjt:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADTAGS

Query:  VKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKP-KIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS---PAESAY
        ++    KP  K+  +   +DP       V S+   + P   P  + +PS       I    + VPS+          G VA +S   + K      ES Y
Subjt:  VKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKP-KIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKS---PAESAY

Query:  EALFEKWVPPPLQLEQ-QMNDEEWLFGT-RKQQVG---QTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
         +LF++ VPP + LE+   + ++WLFGT RK+ V     + KT+      + + R  S  PR   L +  ++SLPYT+PF
Subjt:  EALFEKWVPPPLQLEQ-QMNDEEWLFGT-RKQQVG---QTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

AT1G20100.2 unknown protein1.7e-0537.71Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL
        MSR F  PPP YAR  A  +  L+E  K+     +S+   +KEK + KKEKK K K+ KS E K  ++                 K    E+EQLEKS L
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL

Query:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDAS--LLPNQ---GSKPGKIIRIKLGSSLSRQESVGSQQ--TCSTSG
        TEE  QP      GYLSDGSQ + KR+R+ S  ++ +Q       GK +RI++     ++     Q+   CSTSG
Subjt:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDAS--LLPNQ---GSKPGKIIRIKLGSSLSRQESVGSQQ--TCSTSG

AT1G75860.1 unknown protein6.4e-0826.65Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL
        MSR    PP  +AR     +  L+ES KL     +S+   + EK + K+++K K +  + K HK     S   +D   K      K+   E++ LEKSGL
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGL

Query:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDAS-----LLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIA
        T+E  +P   +  GYLSDGSQ + KR RD S      L       GK +RI++                        + +K  E     P +   C T  
Subjt:  TEEHGQPVWPQSPGYLSDGSQINHKRKRDAS-----LLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIA

Query:  DTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESA
          + S +D I+  +    +S   K+         S+S A  D   K K                                           + +S  E  
Subjt:  DTAGSVKDPISKPVIKDPSSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESA

Query:  YEALFEKWVPPPL---QLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
        Y ALF+ W PP +         N + WLFG + Q+V    K   K           S WPR Q+L +  +YSLPYT+PF
Subjt:  YEALFEKWVPPPL---QLEQQMNDEEWLFGTRKQQVGQTTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

AT4G35940.1 unknown protein3.9e-0524.89Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEA
        MSRCFP+PPPGY     R EA ++ SIK V E+         R S  + KK+K + K++K+ K+KK K +E KE  S  R    R++++   +    K  
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEA

Query:  KVEAEQLEKSGLT---------------------------EEHGQP-------------VWPQSPGYLSDGSQINHKRKRDASLLPNQG-SKPGKIIRIK
        + E   LEKS LT                           +E  QP             V  Q P    DG   N+  KR     P  G        RI+
Subjt:  KVEAEQLEKSGLT---------------------------EEHGQP-------------VWPQSPGYLSDGSQINHKRKRDASLLPNQG-SKPGKIIRIK

Query:  LGSSLSRQESVGSQQTCS----TSGRDSSLDQKRDENR---------------RGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKP---
            L+ + +  +++        +GR ++ ++KR E +               +  P+  +     +++   ++ PI +   KDP          KP   
Subjt:  LGSSLSRQESVGSQQTCS----TSGRDSSLDQKRDENR---------------RGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKDPIAKP---

Query:  -KIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAES----------AYEALFEKWVPPPLQLEQQM
          IK+ SS     P  +P  + P   +  +  SKP+ +       +  G H     S S    T  P +S           +  + E WVP  ++    +
Subjt:  -KIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAES----------AYEALFEKWVPPPLQLEQQM

Query:  ---NDEEWLFGTRKQQVGQT--TKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF
            DEE  +  +K     T   K  N+  + +    +   WP  + LP+ADVY+LPYT+PF
Subjt:  ---NDEEWLFGTRKQQVGQT--TKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF

AT4G35940.2 unknown protein2.7e-0626.24Show/hide
Query:  MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEA
        MSRCFP+PPPGY     R EA ++ SIK V E+         R S  + KK+K + K++K+ K+KK K +E KE  S  R    R++++   +    K  
Subjt:  MSRCFPYPPPGYARKVARTEAALIESIKLVSER---------RESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLP---KEA

Query:  KVEAEQLEKSGLTEEHG--QPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSL------SRQESVGSQQTCSTSGRDSSLDQKRDE-
        + E   LEKS LT E    Q     S     + +++  K+K     L  + +      R++    L      + ++ V  QQ     GR ++ ++KR E 
Subjt:  KVEAEQLEKSGLTEEHG--QPVWPQSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSL------SRQESVGSQQTCSTSGRDSSLDQKRDE-

Query:  ----NRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDP---SSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDI
            N R     +K         G   +   K + K       H  K+   + +  +   H   D  S      P     KDPI + K       +    
Subjt:  ----NRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDP---SSHAVKDPIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDI

Query:  GTHQGNVASVSLPPRTKSPAESAYEALFEKWVPPPLQLEQQM---NDEEWLFGTRKQQVGQT--TKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPY
         T+Q   +  + PP         +  + E WVP  ++    +    DEE  +  +K     T   K  N+  + +    +   WP  + LP+ADVY+LPY
Subjt:  GTHQGNVASVSLPPRTKSPAESAYEALFEKWVPPPLQLEQQM---NDEEWLFGTRKQQVGQT--TKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPY

Query:  TIPF
        T+PF
Subjt:  TIPF


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGCGAGGAAGGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCGTATCTGAAAGACGAGAAAGCAG
GACTGAGAGAAAGAAAGAGAAAAGTAAGCACAAGAAAGAGAAGAAGAGTAAGGACAAGAAACACAAAAGCAAAGAACATAAAGAAAAATCTTCTCGTAGCCGTGACTTGA
GTGATCGGAAACGAAAGGAAGCCAAGGACCTTCCAAAGGAAGCCAAAGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCT
CAAAGCCCTGGCTACTTGTCTGACGGATCTCAGATCAACCACAAGAGGAAAAGGGATGCTTCATTACTGCCTAATCAAGGTTCTAAACCTGGAAAAATCATACGGATCAA
ACTGGGTTCTTCACTAAGCCGGCAAGAGTCAGTTGGCAGCCAACAAACGTGTTCTACATCTGGTCGTGATAGTTCTCTTGATCAAAAGAGAGATGAGAACAGACGTGGAC
CACCCATTCAGCAAAAACCTTGCCTCACAATTGCTGACACAGCTGGTTCTGTCAAGGATCCCATTTCTAAACCTGTGATCAAAGACCCTTCTTCGCATGCTGTCAAGGAT
CCCATTGCTAAACCTAAGATCAAAGTCTCTTCTTCGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCGAAGTCCCTTCTTTGCATGCTGTCAAGGATCCCATTTC
TAAACCTAAGATCAAAGTCCCTTCTTCGCATGCTGTCAAGGACATTGGTACTCATCAAGGTAATGTTGCGTCAGTGTCACTACCTCCCCGCACAAAAAGTCCTGCTGAAT
CTGCTTATGAGGCCTTATTTGAGAAGTGGGTACCACCTCCACTTCAGTTGGAGCAACAAATGAATGATGAAGAATGGCTCTTTGGAACAAGAAAACAGCAAGTGGGACAA
ACTACGAAGACCAACAACAAAGCTTTCAGTCATGTTCCCAGCTGTAGAAGTTTGAGTCTGTGGCCAAGAGGACAATATCTGCCGGATGCGGATGTTTATTCATTGCCTTA
TACGATCCCATTTTGA
mRNA sequenceShow/hide mRNA sequence
GCGTGAGAGGCATTTAGGTTTTCCAATCTGCCGCTTCATCGAAAGAGAAGTCTTCTTCGTTCTCCGCTAGGGTTTTCCATCGTGATCGTCTTTCCCTTCTGTTTTCTCAA
ATCCGAACTGCTTTTTCCTCTGTTTTCTTATAAATCTTCGACGAATCCTTCCCATAATAGACACAGATGTCTCGTTGCTTTCCTTACCCACCTCCTGGTTACGCGAGGAA
GGTGGCTAGGACCGAGGCGGCCTTGATCGAATCGATTAAGCTCGTATCTGAAAGACGAGAAAGCAGGACTGAGAGAAAGAAAGAGAAAAGTAAGCACAAGAAAGAGAAGA
AGAGTAAGGACAAGAAACACAAAAGCAAAGAACATAAAGAAAAATCTTCTCGTAGCCGTGACTTGAGTGATCGGAAACGAAAGGAAGCCAAGGACCTTCCAAAGGAAGCC
AAAGTTGAAGCAGAACAATTAGAAAAGAGTGGTCTCACTGAAGAGCATGGACAACCAGTATGGCCTCAAAGCCCTGGCTACTTGTCTGACGGATCTCAGATCAACCACAA
GAGGAAAAGGGATGCTTCATTACTGCCTAATCAAGGTTCTAAACCTGGAAAAATCATACGGATCAAACTGGGTTCTTCACTAAGCCGGCAAGAGTCAGTTGGCAGCCAAC
AAACGTGTTCTACATCTGGTCGTGATAGTTCTCTTGATCAAAAGAGAGATGAGAACAGACGTGGACCACCCATTCAGCAAAAACCTTGCCTCACAATTGCTGACACAGCT
GGTTCTGTCAAGGATCCCATTTCTAAACCTGTGATCAAAGACCCTTCTTCGCATGCTGTCAAGGATCCCATTGCTAAACCTAAGATCAAAGTCTCTTCTTCGCATGCTGT
CAAGGATCCCATTTCTAAACCTAAGATCGAAGTCCCTTCTTTGCATGCTGTCAAGGATCCCATTTCTAAACCTAAGATCAAAGTCCCTTCTTCGCATGCTGTCAAGGACA
TTGGTACTCATCAAGGTAATGTTGCGTCAGTGTCACTACCTCCCCGCACAAAAAGTCCTGCTGAATCTGCTTATGAGGCCTTATTTGAGAAGTGGGTACCACCTCCACTT
CAGTTGGAGCAACAAATGAATGATGAAGAATGGCTCTTTGGAACAAGAAAACAGCAAGTGGGACAAACTACGAAGACCAACAACAAAGCTTTCAGTCATGTTCCCAGCTG
TAGAAGTTTGAGTCTGTGGCCAAGAGGACAATATCTGCCGGATGCGGATGTTTATTCATTGCCTTATACGATCCCATTTTGATTTCGAATTCTTTTCTTCAGAGAGAGTA
ATATGTACAGTACTACGCCAGTAGGAACAATCTTTTTCCTTGGCGTTATAATTGTTTTAGAATCAATATAATTGTCATTCAATTTTGTTC
Protein sequenceShow/hide protein sequence
MSRCFPYPPPGYARKVARTEAALIESIKLVSERRESRTERKKEKSKHKKEKKSKDKKHKSKEHKEKSSRSRDLSDRKRKEAKDLPKEAKVEAEQLEKSGLTEEHGQPVWP
QSPGYLSDGSQINHKRKRDASLLPNQGSKPGKIIRIKLGSSLSRQESVGSQQTCSTSGRDSSLDQKRDENRRGPPIQQKPCLTIADTAGSVKDPISKPVIKDPSSHAVKD
PIAKPKIKVSSSHAVKDPISKPKIEVPSLHAVKDPISKPKIKVPSSHAVKDIGTHQGNVASVSLPPRTKSPAESAYEALFEKWVPPPLQLEQQMNDEEWLFGTRKQQVGQ
TTKTNNKAFSHVPSCRSLSLWPRGQYLPDADVYSLPYTIPF