; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg035907 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg035907
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontwinkle homolog protein, chloroplastic/mitochondrial isoform X1
Genome locationscaffold5:37733017..37754369
RNA-Seq ExpressionSpg035907
SyntenySpg035907
Gene Ontology termsGO:0006260 - DNA replication (biological process)
GO:0032508 - DNA duplex unwinding (biological process)
GO:0003697 - single-stranded DNA binding (molecular function)
GO:0005524 - ATP binding (molecular function)
GO:0043139 - 5'-3' DNA helicase activity (molecular function)
InterPro domainsIPR006171 - TOPRIM domain
IPR025558 - Domain of unknown function DUF4283
IPR026960 - Reverse transcriptase zinc-binding domain
IPR027032 - Twinkle-like protein
IPR034154 - Archaeal primase DnaG/twinkle, TOPRIM domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7019451.1 Twinkle-like protein, chloroplastic/mitochondrial, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-18976.69Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPP
        MRFLHHN CLYS FSKLSSLSSSF LMGS  LCKS+SL FLSP+    SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRPP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPP

Query:  AFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFE
         FMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                 
Subjt:  AFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFE

Query:  PLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREG
                 AFADGRSS++  GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                       
Subjt:  PLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREG

Query:  EVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLW
            I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYLW
Subjt:  EVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLW

Query:  NCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        NCKDYLSKASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  NCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

XP_022927084.1 twinkle homolog protein, chloroplastic/mitochondrial isoform X1 [Cucurbita moschata]3.4e-18976.52Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CLYS FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        P FMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGRSS++  GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

XP_022927085.1 twinkle homolog protein, chloroplastic/mitochondrial isoform X2 [Cucurbita moschata]3.4e-18976.52Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CLYS FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        P FMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGRSS++  GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

XP_023000969.1 twinkle homolog protein, chloroplastic/mitochondrial [Cucurbita maxima]1.5e-18976.74Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CL S FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        PAFMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGR S++S GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGDPPG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

XP_038894379.1 twinkle homolog protein, chloroplastic/mitochondrial [Benincasa hispida]1.7e-19377.73Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPISSSSSQ---RCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA
        MRFLHHN CLY+PFS LSS SSSFNLMG+IPLCKSTSL  LS +SSSSS    + FLYRSNPLLH  FPV+PMS  KPFSMKPNGVSSFTS +NVP PPA
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPISSSSSQ---RCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA

Query:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP
        F+ENPLDEAL ST LNVLRKKL+ELDIDTESCVPGQTNHLLCPMCKGGDSGER FSL+ISEDGGAA+WMCFRAKCGWKGRTL                  
Subjt:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP

Query:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE
                AFADG SSYR+LGQV L    QNKRKITVESLQLEPLCDELVAYFAERLISK+TLLRNSVMQKR NNQ                        
Subjt:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE

Query:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN
           I IAFTYYR GALISCKYRDVNKKFWQEANTEKIFYGLDDI GTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVS+ DVPP DQD KYQYLWN
Subjt:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN

Query:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        CKDYL+KASRIILATDGDPPG ALAEEIARRVGRERCWRVKWPKKNEVDHFKDANE L
Subjt:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

TrEMBL top hitse value%identityAlignment
A0A0A0LYM8 Uncharacterized protein2.7e-17671.83Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI---SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA
        MRFLH+NHCLY+PFSKLSS SS   LMGS PLCKSTSL FLS +   SSSSSQ+ FLYRS  LLH SFPV+P+S  KPF+MKPNGVSSFTS ANVPRPPA
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI---SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA

Query:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP
         +ENP D+A  ST+LN+LRKKL++LDID E+CVPGQ   LLCPMCKGGDS ERSFSL ISEDGGAAVW CFR KCGWKG TL                  
Subjt:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP

Query:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE
                AF DGRSSY+ LGQV L Q I   RKITVESLQLEPLCDELV YFAERLISK TLLRNSVMQKRS+NQ                        
Subjt:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE

Query:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN
           IA+AFTYYR GALISCKYRD NKKFWQE NTE+IFYG+DDIDG SDIIIVEGE+DKLSMAEAG HNCVSVPDGAP SVS+KDVPP D+D K+Q+LWN
Subjt:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN

Query:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        CKDYL+KASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNEVDHFKDANE L
Subjt:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

A0A6J1CLF6 twinkle homolog protein, chloroplastic/mitochondrial isoform X13.2e-18576.2Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPISSS---SSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA
        MRFLHHN C +SPF+KLSSLSSSFNLMGSIPLCKSTSL FLS ISSS   SSQR FLYR+N LLH SFPVQ MS  K FSMK NGVS FTS ANVP PP 
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPISSS---SSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPA

Query:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP
              DE L STQLNVLRKKLEEL+++TESCVPGQTNHLLCPMCKGGDSGERS SL+ISEDGGAAVW+CFRAKCGWKGRTL                  
Subjt:  FMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEP

Query:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE
                AFADGRSSY SLGQV LN+K   KRKITVESLQLEPLCDELVAYFAERLISK+TLLRNSVMQKRS+NQ                        
Subjt:  LTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGE

Query:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN
           IAIAFTY+R G L+SCKYRDVNKKFWQEANTEKIFYGLD IDG SDIIIVEGE+DKLSM EAGFHNCVSVPDGAPPSVSQKDVPPTD+DTKYQYLWN
Subjt:  VFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWN

Query:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        CK+YLSKASRIILATDGDPPG ALAEEIARRVGRERCWRVKWPKKNEVDHFKDANE L
Subjt:  CKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

A0A6J1EH06 twinkle homolog protein, chloroplastic/mitochondrial isoform X11.6e-18976.52Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CLYS FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        P FMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGRSS++  GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

A0A6J1EMW1 twinkle homolog protein, chloroplastic/mitochondrial isoform X21.6e-18976.52Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CLYS FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        P FMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGRSS++  GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGD PG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

A0A6J1KLG2 twinkle homolog protein, chloroplastic/mitochondrial7.3e-19076.74Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP
        MRFLHHN CL S FSKLSSLSSSF LMGS  LCKS+SL FLSP+     SSSSSQR FLY+SN +LH SFPV+PMS GK FSMKPNGVSSFTS ANVPRP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPI-----SSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRP

Query:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF
        PAFMENPL EAL  T+LN+L+KKL+ELDID E CVPGQTNHLLCPMCKGGDSGER+ SL ISEDGGAAVWMCFRAKCGWKGRTL                
Subjt:  PAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHF

Query:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE
                  AFADGR S++S GQ+TL QK   KRKITVESLQLEPLCDELVAYFAERLISK TLLRNSVMQKRSNNQ                      
Subjt:  EPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFRE

Query:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL
             I+IAFTY RRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDG SDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPS SQKDVPP DQDTKYQYL
Subjt:  GEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYL

Query:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        WNCKDYLSKASRIILATDGDPPG ALAEEIARRVGRERCWRVKWPKKNE +HFKDANE L
Subjt:  WNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

SwissProt top hitse value%identityAlignment
B5X582 Twinkle homolog protein, chloroplastic/mitochondrial8.0e-10146.97Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFN-LMGS---IPLCKSTSLAFLSPISSSSSQRCFLYRSNPLLHRSFPV---QPMSHGKPFSMKPNGVSSFTSDANVP
        MRFL     L  P      LS S + LMGS   +  C   S A      S SS R    + + +  R  PV   +P+S   P+  + NG+SS+ S   VP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFN-LMGS---IPLCKSTSLAFLSPISSSSSQRCFLYRSNPLLHRSFPV---QPMSHGKPFSMKPNGVSSFTSDANVP

Query:  RPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEV
          P   E   D+ ++ ++L  LR+KL E  +D E+C PGQ + L+CP C+GG+SGE+S SLFI+ DG +A W CFR KCG KG                 
Subjt:  RPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEV

Query:  HFEPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPF
                      ADG      L      +K++  RKITVE ++LEPLCDE+  YFA R IS+ TL RN VMQKR  ++                    
Subjt:  HFEPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPF

Query:  REGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQ
               I IAFTY++RG L+SCKYR + K F+QE  T +I YGLDDI+ TS++IIVEGEIDKL+M EAGF NCVSVPDGAP  VS K++P  D+DTKY+
Subjt:  REGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQ

Query:  YLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        +LWNC DYL KASRI++ATDGD PG A+AEEIARR+G+ERCWRVKWPKK+E +HFKDANE L
Subjt:  YLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

F4I6E6 Primase homolog protein1.4e-8948.58Show/hide
Query:  MSHGKPFSMKPNGVSSFTSDANVPRPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFR
        MS   P     NG SS  SD  VP      E    + ++ ++L  L +KL E  ID ++C PG  + L+CP C+ GDSGE+S +L+I  DG +A W C R
Subjt:  MSHGKPFSMKPNGVSSFTSDANVPRPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFR

Query:  AKCGWKGRTLVLIIIQFTMFLFEVHFEPLTGALSPRAFADGR-SSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQK
         KCG KG      ++Q                       DG+  S   +G+V        +RKITVES++LEPLCDE+  +FA R IS  TL RN VMQK
Subjt:  AKCGWKGRTLVLIIIQFTMFLFEVHFEPLTGALSPRAFADGR-SSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQK

Query:  RSNNQARPFTDCREMIQEFLLHPPFREGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCV
        R +++                           I IAFTY++RG L+SCKYR + KKF QE NT KI YGLDDI+ TS+IIIVEGE DKL+M EAGF NCV
Subjt:  RSNNQARPFTDCREMIQEFLLHPPFREGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCV

Query:  SVPDGAPPSVSQKDVPPTDQDTKYQYLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        SVPDGAP +VS K++P   +DT ++Y+WNC DYL KASRI++ATDGD PG ALAEE+ARR+G+ERCW VKWPKK+E +HFKDANE L
Subjt:  SVPDGAPPSVSQKDVPPTDQDTKYQYLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

Arabidopsis top hitse value%identityAlignment
AT1G30660.1 nucleic acid binding;nucleic acid binding1.0e-9048.58Show/hide
Query:  MSHGKPFSMKPNGVSSFTSDANVPRPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFR
        MS   P     NG SS  SD  VP      E    + ++ ++L  L +KL E  ID ++C PG  + L+CP C+ GDSGE+S +L+I  DG +A W C R
Subjt:  MSHGKPFSMKPNGVSSFTSDANVPRPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFR

Query:  AKCGWKGRTLVLIIIQFTMFLFEVHFEPLTGALSPRAFADGR-SSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQK
         KCG KG      ++Q                       DG+  S   +G+V        +RKITVES++LEPLCDE+  +FA R IS  TL RN VMQK
Subjt:  AKCGWKGRTLVLIIIQFTMFLFEVHFEPLTGALSPRAFADGR-SSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQK

Query:  RSNNQARPFTDCREMIQEFLLHPPFREGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCV
        R +++                           I IAFTY++RG L+SCKYR + KKF QE NT KI YGLDDI+ TS+IIIVEGE DKL+M EAGF NCV
Subjt:  RSNNQARPFTDCREMIQEFLLHPPFREGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCV

Query:  SVPDGAPPSVSQKDVPPTDQDTKYQYLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        SVPDGAP +VS K++P   +DT ++Y+WNC DYL KASRI++ATDGD PG ALAEE+ARR+G+ERCW VKWPKK+E +HFKDANE L
Subjt:  SVPDGAPPSVSQKDVPPTDQDTKYQYLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

AT1G30680.1 toprim domain-containing protein5.7e-10246.97Show/hide
Query:  MRFLHHNHCLYSPFSKLSSLSSSFN-LMGS---IPLCKSTSLAFLSPISSSSSQRCFLYRSNPLLHRSFPV---QPMSHGKPFSMKPNGVSSFTSDANVP
        MRFL     L  P      LS S + LMGS   +  C   S A      S SS R    + + +  R  PV   +P+S   P+  + NG+SS+ S   VP
Subjt:  MRFLHHNHCLYSPFSKLSSLSSSFN-LMGS---IPLCKSTSLAFLSPISSSSSQRCFLYRSNPLLHRSFPV---QPMSHGKPFSMKPNGVSSFTSDANVP

Query:  RPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEV
          P   E   D+ ++ ++L  LR+KL E  +D E+C PGQ + L+CP C+GG+SGE+S SLFI+ DG +A W CFR KCG KG                 
Subjt:  RPPAFMENPLDEALISTQLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEV

Query:  HFEPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPF
                      ADG      L      +K++  RKITVE ++LEPLCDE+  YFA R IS+ TL RN VMQKR  ++                    
Subjt:  HFEPLTGALSPRAFADGRSSYRSLGQVTLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPF

Query:  REGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQ
               I IAFTY++RG L+SCKYR + K F+QE  T +I YGLDDI+ TS++IIVEGEIDKL+M EAGF NCVSVPDGAP  VS K++P  D+DTKY+
Subjt:  REGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEANTEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQ

Query:  YLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL
        +LWNC DYL KASRI++ATDGD PG A+AEEIARR+G+ERCWRVKWPKK+E +HFKDANE L
Subjt:  YLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWPKKNEVDHFKDANEKL

AT2G02650.1 Ribonuclease H-like superfamily protein7.7e-0625.55Show/hide
Query:  LEAGLWKSKSPKRINILLWIMLNGNLNTSEVLQKKMPTHCILPSVCVLCLRDEDSLNHIFFSCSYAKSGWFKLFSIFNQHWVFSNNFRCNIYQLLYGLAL
        ++  +WK     +I   LW  + G L T+  L+ +   +     +C  C  +E++++HI F+C Y +S W     I    W   ++F  N+ +L+     
Subjt:  LEAGLWKSKSPKRINILLWIMLNGNLNTSEVLQKKMPTHCILPSVCVLCLRDEDSLNHIFFSCSYAKSGWFKLFSIFNQHWVFSNNFRCNIYQLLYGLAL

Query:  LNSKAKLLWINAVKAFLS-----ELWLQRNQRFFQNK
            +K    N++  FL       LW  RN   FQ K
Subjt:  LNSKAKLLWINAVKAFLS-----ELWLQRNQRFFQNK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGTTTTCTTCATCACAATCACTGTCTATATAGCCCCTTCTCCAAGCTCTCTTCCCTTTCCTCCTCGTTTAATCTAATGGGTTCTATTCCTTTATGCAAGTCCACTTC
TCTGGCATTCCTGTCACCCATTTCTTCTTCTTCCTCACAGAGGTGTTTCCTGTACAGAAGCAATCCTCTTCTCCATAGGTCGTTTCCTGTACAACCCATGTCCCATGGCA
AACCCTTTTCAATGAAACCAAATGGGGTTTCTTCTTTTACTTCTGATGCCAATGTTCCCAGACCTCCAGCCTTTATGGAGAATCCACTAGACGAGGCTTTAATCTCGACT
CAATTGAATGTTCTGAGAAAAAAATTGGAGGAACTTGACATTGACACCGAATCTTGTGTTCCAGGGCAGACAAACCACTTGCTTTGTCCAATGTGCAAAGGTGGCGATTC
AGGTGAACGGTCCTTCTCCCTATTTATCTCAGAAGATGGGGGGGCTGCTGTTTGGATGTGTTTTCGTGCAAAGTGTGGTTGGAAGGGCCGTACTCTGGTATTAATCATAA
TTCAATTTACTATGTTCCTATTTGAGGTTCATTTTGAACCTTTAACTGGCGCCCTTTCACCTAGGGCCTTTGCTGATGGTCGGTCATCATATCGAAGTTTAGGACAAGTT
ACACTTAACCAGAAGATCCAGAACAAACGGAAAATTACAGTGGAGAGTCTACAACTTGAACCACTGTGCGATGAGTTGGTTGCTTATTTTGCTGAGCGATTGATCTCCAA
GAGCACATTGTTAAGAAATTCAGTTATGCAGAAAAGATCCAATAATCAGGCGAGACCCTTTACAGACTGTAGGGAGATGATCCAGGAGTTCCTCCTCCATCCGCCTTTTC
GCGAAGGGGAGGTTTTTATGATTGCTATTGCATTTACATATTATCGACGTGGAGCATTGATTAGTTGCAAGTATCGTGATGTCAACAAAAAGTTCTGGCAGGAGGCAAAT
ACTGAGAAAATATTTTATGGATTGGATGACATAGATGGCACAAGTGATATCATCATTGTTGAAGGGGAGATAGACAAGCTTTCAATGGCAGAAGCTGGTTTCCATAATTG
CGTGAGTGTTCCAGATGGTGCACCACCATCAGTTTCCCAAAAGGACGTACCTCCTACAGATCAGGATACGAAGTATCAGTATTTATGGAACTGCAAAGATTACTTGAGTA
AGGCATCACGCATTATCCTTGCTACTGATGGTGATCCTCCTGGTCACGCTTTAGCAGAGGAGATTGCACGTCGTGTTGGAAGGGAAAGATGTTGGAGGGTCAAATGGCCA
AAAAAAAATGAGGTCGATCATTTCAAAGATGCAAATGAGAAATTAAGAGATTCAAATGGAACAATTCAGTTATCAAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAATG
CTCTGTTTGGCCCCTCTATGGTGGAAAGAAAAGAGTGCAAGTTCCTGTTGGTTACGCCAAGAATGGTTGGTCAATTTTATGGGAAATGATAAGAGACTTTCTTTTGAAAT
TTGTGGAAACAAAGTATGTTGAGAATATTTCAAAGAAGTCTAATTATGAAGAATCATCCAAGCCGGTTATTAATACTAGTAAGAATTTGGATAGAAGTTATGCTGATGTA
GTGAAGGTTAAATCTGGGGAACCTCATCTTGGTTCTCATTCTAAGAAGCTGCCAGTAATGTCTTCATTTTGGGTTAGAAAAGAAAAAGAAGTGGTGGATTTAAAGCTAGA
TGAATTTTGTGTGGTATCTAGAATGTTTGCACATAATACTTGGAAGGAAGTAAAGCAAGTTTTGGAAGATTATTTTCATTCTAAAGTTTTACTCAACCCTTTTATGGCAG
ATAAAGCTTTGGTAAAACTGAATGATAGCTTTTCTGAGCTGAAGTTTGATGGCAAGTGGAAACTTATTGGAAAATTTCATTTGAAAATTGAAAATTGGTCGTGTCACAAG
CATTTTCATCCTGAGGTGATTGAAGGATATGGAGGTTGGATAGCTTTGAAGAACTTACCTTTGCCATTCTGGAATCGTTACATTTTTGAAATCATTGGACATCACTTTGG
TGGATTGGTTAGCATTTCTTCTCAAACATTGAATCTTTTGGATTGTTCAGAAGCTCGTATAGAGGTGAAAATGAATTTTTGTGGATTTCTACCAGTTGAAATTGTGGTTA
CGGACAAGATTCATGGAAACTTTGCTCTTCGTTTTGGTGATATATCTTCTTTGGACCCTCCACATTTTATTCCTATGGATTTATCATTGAGTGATTTTGATAATGAAATT
GATTTAAAAAGAGTTTCTCAGGTTATGATGGATGAAAGATTTTCCTCATCACAAGAAGATCTTAATTCCTTCAATGATCAAGGATTAAACCTTCTAGCCTTGCAAACTCG
GGTAAATCAAGAAAACTTTTTATCTTCTAAAGATCCCAACGACATTATTGATGATTCAATGGAAATATTGAATGATGGAGGTTCAAATTTAGAAAAGTCGGTTGAGTTAT
CAAAGGAAAAGGCTGAGTTATTACAAGAAAATGCCTTTAATGACTTGGTTTCACGTTCCAAGGAAGCATTAATTGACGATGTTAATTGTAATTTAATTGGGCCGGTTGAG
TTTTCAAAGGAGAAGAGTGCTTTGTTGCAAGAGAAGGATTTTAATGCCAACGGTAAGGTTATTAATGCCATCGTTTCAGATATTAGTGAAGCATTAACTAATGGAGCTTT
GCATGAGTCCCAGGGTTTATTATTCACGCCTATTCATGACCCACCTTTGGGTTTGAAGAGTTGTAATGCAGCTGGTTTGGAAGAAGATGAACCGATTGTTTCTAAGGCTT
TAAAGAAGCAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAATTCCAATTATAATCCTGATGTTATT
GAAGTATCTTGTTCTCAATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGCTGTCAAACTAATCTTAATGAGTTATCAAATTCCACATCATCCAATCAGTACATTCTTTC
AAACATTCAATCTAACCTTTCTTTAACAAAGGGAGTTTTTATTCCTTCATCCAAAGTTGATCAATCATATTCATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTA
TTAGTAGTGTCGAGGCTGAAAATCAGTATTTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCACTGGCTTTTAATCGGATTTTCCAGAAAAATGAAGAT
GTTTCTGAAGTTCAGTTGAATGCTTGTGATGTTTTGGCAACACCCTTAGAGACTAAGATTGTCTCTTTTGATCAACGTTTGATAAAATCTATATGGAGCTCTAAAGATGT
TGGTTGGGTTAATGTGAAATCATGGGGAAGATCGGGAGGATTACTGATTTTGTGGGATGAGAGAAAACTGAAAATTGTGGAATTTCTTCAAGGGGATGATTCTTCTCAAT
CTCTTATCGACAGATTTCTGATTTCCAAGGAATGGGATGTGATGTTTGATAATTCTAGAGTCTCCAAACAGGTTCGTACTATTTCTGATCATTTCCCTCTCCTTCTTGAA
GCTGGTTTGTGGAAGTCCAAGAGCCCCAAGAGAATCAACATTCTTTTGTGGATTATGTTAAATGGGAACCTCAATACTTCAGAAGTTCTACAAAAGAAGATGCCAACTCA
TTGCATTTTGCCGTCAGTTTGTGTTCTATGTCTTCGGGATGAGGATTCTCTCAACCATATATTCTTTAGTTGTTCTTACGCCAAATCGGGTTGGTTCAAGTTGTTTTCGA
TATTCAATCAGCATTGGGTTTTTTCTAATAACTTCCGTTGCAACATTTATCAACTCTTATATGGATTAGCTCTTCTAAACTCTAAAGCTAAGTTGCTTTGGATTAATGCG
GTCAAAGCTTTTCTTTCAGAATTATGGTTGCAGCGAAATCAAAGATTTTTTCAGAACAAGTATCTTCCTTGGATTGATCGTTTTGAAGCTTCTCGTTTAAAGGCATCAAC
GTGGTGTTCTTTGTCCAAATTGTTCATGGGATTCTCTCTTCAGGACATATGTTTGAATTGGAACGTGTTTATATATCCTTCATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGTTTTCTTCATCACAATCACTGTCTATATAGCCCCTTCTCCAAGCTCTCTTCCCTTTCCTCCTCGTTTAATCTAATGGGTTCTATTCCTTTATGCAAGTCCACTTC
TCTGGCATTCCTGTCACCCATTTCTTCTTCTTCCTCACAGAGGTGTTTCCTGTACAGAAGCAATCCTCTTCTCCATAGGTCGTTTCCTGTACAACCCATGTCCCATGGCA
AACCCTTTTCAATGAAACCAAATGGGGTTTCTTCTTTTACTTCTGATGCCAATGTTCCCAGACCTCCAGCCTTTATGGAGAATCCACTAGACGAGGCTTTAATCTCGACT
CAATTGAATGTTCTGAGAAAAAAATTGGAGGAACTTGACATTGACACCGAATCTTGTGTTCCAGGGCAGACAAACCACTTGCTTTGTCCAATGTGCAAAGGTGGCGATTC
AGGTGAACGGTCCTTCTCCCTATTTATCTCAGAAGATGGGGGGGCTGCTGTTTGGATGTGTTTTCGTGCAAAGTGTGGTTGGAAGGGCCGTACTCTGGTATTAATCATAA
TTCAATTTACTATGTTCCTATTTGAGGTTCATTTTGAACCTTTAACTGGCGCCCTTTCACCTAGGGCCTTTGCTGATGGTCGGTCATCATATCGAAGTTTAGGACAAGTT
ACACTTAACCAGAAGATCCAGAACAAACGGAAAATTACAGTGGAGAGTCTACAACTTGAACCACTGTGCGATGAGTTGGTTGCTTATTTTGCTGAGCGATTGATCTCCAA
GAGCACATTGTTAAGAAATTCAGTTATGCAGAAAAGATCCAATAATCAGGCGAGACCCTTTACAGACTGTAGGGAGATGATCCAGGAGTTCCTCCTCCATCCGCCTTTTC
GCGAAGGGGAGGTTTTTATGATTGCTATTGCATTTACATATTATCGACGTGGAGCATTGATTAGTTGCAAGTATCGTGATGTCAACAAAAAGTTCTGGCAGGAGGCAAAT
ACTGAGAAAATATTTTATGGATTGGATGACATAGATGGCACAAGTGATATCATCATTGTTGAAGGGGAGATAGACAAGCTTTCAATGGCAGAAGCTGGTTTCCATAATTG
CGTGAGTGTTCCAGATGGTGCACCACCATCAGTTTCCCAAAAGGACGTACCTCCTACAGATCAGGATACGAAGTATCAGTATTTATGGAACTGCAAAGATTACTTGAGTA
AGGCATCACGCATTATCCTTGCTACTGATGGTGATCCTCCTGGTCACGCTTTAGCAGAGGAGATTGCACGTCGTGTTGGAAGGGAAAGATGTTGGAGGGTCAAATGGCCA
AAAAAAAATGAGGTCGATCATTTCAAAGATGCAAATGAGAAATTAAGAGATTCAAATGGAACAATTCAGTTATCAAAGTTCAATTCCCAGCAAGGTTGGTTCCTTGAATG
CTCTGTTTGGCCCCTCTATGGTGGAAAGAAAAGAGTGCAAGTTCCTGTTGGTTACGCCAAGAATGGTTGGTCAATTTTATGGGAAATGATAAGAGACTTTCTTTTGAAAT
TTGTGGAAACAAAGTATGTTGAGAATATTTCAAAGAAGTCTAATTATGAAGAATCATCCAAGCCGGTTATTAATACTAGTAAGAATTTGGATAGAAGTTATGCTGATGTA
GTGAAGGTTAAATCTGGGGAACCTCATCTTGGTTCTCATTCTAAGAAGCTGCCAGTAATGTCTTCATTTTGGGTTAGAAAAGAAAAAGAAGTGGTGGATTTAAAGCTAGA
TGAATTTTGTGTGGTATCTAGAATGTTTGCACATAATACTTGGAAGGAAGTAAAGCAAGTTTTGGAAGATTATTTTCATTCTAAAGTTTTACTCAACCCTTTTATGGCAG
ATAAAGCTTTGGTAAAACTGAATGATAGCTTTTCTGAGCTGAAGTTTGATGGCAAGTGGAAACTTATTGGAAAATTTCATTTGAAAATTGAAAATTGGTCGTGTCACAAG
CATTTTCATCCTGAGGTGATTGAAGGATATGGAGGTTGGATAGCTTTGAAGAACTTACCTTTGCCATTCTGGAATCGTTACATTTTTGAAATCATTGGACATCACTTTGG
TGGATTGGTTAGCATTTCTTCTCAAACATTGAATCTTTTGGATTGTTCAGAAGCTCGTATAGAGGTGAAAATGAATTTTTGTGGATTTCTACCAGTTGAAATTGTGGTTA
CGGACAAGATTCATGGAAACTTTGCTCTTCGTTTTGGTGATATATCTTCTTTGGACCCTCCACATTTTATTCCTATGGATTTATCATTGAGTGATTTTGATAATGAAATT
GATTTAAAAAGAGTTTCTCAGGTTATGATGGATGAAAGATTTTCCTCATCACAAGAAGATCTTAATTCCTTCAATGATCAAGGATTAAACCTTCTAGCCTTGCAAACTCG
GGTAAATCAAGAAAACTTTTTATCTTCTAAAGATCCCAACGACATTATTGATGATTCAATGGAAATATTGAATGATGGAGGTTCAAATTTAGAAAAGTCGGTTGAGTTAT
CAAAGGAAAAGGCTGAGTTATTACAAGAAAATGCCTTTAATGACTTGGTTTCACGTTCCAAGGAAGCATTAATTGACGATGTTAATTGTAATTTAATTGGGCCGGTTGAG
TTTTCAAAGGAGAAGAGTGCTTTGTTGCAAGAGAAGGATTTTAATGCCAACGGTAAGGTTATTAATGCCATCGTTTCAGATATTAGTGAAGCATTAACTAATGGAGCTTT
GCATGAGTCCCAGGGTTTATTATTCACGCCTATTCATGACCCACCTTTGGGTTTGAAGAGTTGTAATGCAGCTGGTTTGGAAGAAGATGAACCGATTGTTTCTAAGGCTT
TAAAGAAGCAATATGAATCATTTCCTCTTCATTATTCTCGAAGGAAATATGAAAAGTCAGAAATTTTGGACTCAATTCCCATTAATTCCAATTATAATCCTGATGTTATT
GAAGTATCTTGTTCTCAATTTTTGCTCCCTGCTTTGAATCAGCCTAGGTGCTGTCAAACTAATCTTAATGAGTTATCAAATTCCACATCATCCAATCAGTACATTCTTTC
AAACATTCAATCTAACCTTTCTTTAACAAAGGGAGTTTTTATTCCTTCATCCAAAGTTGATCAATCATATTCATCTCCTATTGATTCCGATGATGATTCAGTGGTGAGTA
TTAGTAGTGTCGAGGCTGAAAATCAGTATTTGAATGATGAAAACAATGAATTATTGGAGGAAGACTCTTTTGCACTGGCTTTTAATCGGATTTTCCAGAAAAATGAAGAT
GTTTCTGAAGTTCAGTTGAATGCTTGTGATGTTTTGGCAACACCCTTAGAGACTAAGATTGTCTCTTTTGATCAACGTTTGATAAAATCTATATGGAGCTCTAAAGATGT
TGGTTGGGTTAATGTGAAATCATGGGGAAGATCGGGAGGATTACTGATTTTGTGGGATGAGAGAAAACTGAAAATTGTGGAATTTCTTCAAGGGGATGATTCTTCTCAAT
CTCTTATCGACAGATTTCTGATTTCCAAGGAATGGGATGTGATGTTTGATAATTCTAGAGTCTCCAAACAGGTTCGTACTATTTCTGATCATTTCCCTCTCCTTCTTGAA
GCTGGTTTGTGGAAGTCCAAGAGCCCCAAGAGAATCAACATTCTTTTGTGGATTATGTTAAATGGGAACCTCAATACTTCAGAAGTTCTACAAAAGAAGATGCCAACTCA
TTGCATTTTGCCGTCAGTTTGTGTTCTATGTCTTCGGGATGAGGATTCTCTCAACCATATATTCTTTAGTTGTTCTTACGCCAAATCGGGTTGGTTCAAGTTGTTTTCGA
TATTCAATCAGCATTGGGTTTTTTCTAATAACTTCCGTTGCAACATTTATCAACTCTTATATGGATTAGCTCTTCTAAACTCTAAAGCTAAGTTGCTTTGGATTAATGCG
GTCAAAGCTTTTCTTTCAGAATTATGGTTGCAGCGAAATCAAAGATTTTTTCAGAACAAGTATCTTCCTTGGATTGATCGTTTTGAAGCTTCTCGTTTAAAGGCATCAAC
GTGGTGTTCTTTGTCCAAATTGTTCATGGGATTCTCTCTTCAGGACATATGTTTGAATTGGAACGTGTTTATATATCCTTCATAA
Protein sequenceShow/hide protein sequence
MRFLHHNHCLYSPFSKLSSLSSSFNLMGSIPLCKSTSLAFLSPISSSSSQRCFLYRSNPLLHRSFPVQPMSHGKPFSMKPNGVSSFTSDANVPRPPAFMENPLDEALIST
QLNVLRKKLEELDIDTESCVPGQTNHLLCPMCKGGDSGERSFSLFISEDGGAAVWMCFRAKCGWKGRTLVLIIIQFTMFLFEVHFEPLTGALSPRAFADGRSSYRSLGQV
TLNQKIQNKRKITVESLQLEPLCDELVAYFAERLISKSTLLRNSVMQKRSNNQARPFTDCREMIQEFLLHPPFREGEVFMIAIAFTYYRRGALISCKYRDVNKKFWQEAN
TEKIFYGLDDIDGTSDIIIVEGEIDKLSMAEAGFHNCVSVPDGAPPSVSQKDVPPTDQDTKYQYLWNCKDYLSKASRIILATDGDPPGHALAEEIARRVGRERCWRVKWP
KKNEVDHFKDANEKLRDSNGTIQLSKFNSQQGWFLECSVWPLYGGKKRVQVPVGYAKNGWSILWEMIRDFLLKFVETKYVENISKKSNYEESSKPVINTSKNLDRSYADV
VKVKSGEPHLGSHSKKLPVMSSFWVRKEKEVVDLKLDEFCVVSRMFAHNTWKEVKQVLEDYFHSKVLLNPFMADKALVKLNDSFSELKFDGKWKLIGKFHLKIENWSCHK
HFHPEVIEGYGGWIALKNLPLPFWNRYIFEIIGHHFGGLVSISSQTLNLLDCSEARIEVKMNFCGFLPVEIVVTDKIHGNFALRFGDISSLDPPHFIPMDLSLSDFDNEI
DLKRVSQVMMDERFSSSQEDLNSFNDQGLNLLALQTRVNQENFLSSKDPNDIIDDSMEILNDGGSNLEKSVELSKEKAELLQENAFNDLVSRSKEALIDDVNCNLIGPVE
FSKEKSALLQEKDFNANGKVINAIVSDISEALTNGALHESQGLLFTPIHDPPLGLKSCNAAGLEEDEPIVSKALKKQYESFPLHYSRRKYEKSEILDSIPINSNYNPDVI
EVSCSQFLLPALNQPRCCQTNLNELSNSTSSNQYILSNIQSNLSLTKGVFIPSSKVDQSYSSPIDSDDDSVVSISSVEAENQYLNDENNELLEEDSFALAFNRIFQKNED
VSEVQLNACDVLATPLETKIVSFDQRLIKSIWSSKDVGWVNVKSWGRSGGLLILWDERKLKIVEFLQGDDSSQSLIDRFLISKEWDVMFDNSRVSKQVRTISDHFPLLLE
AGLWKSKSPKRINILLWIMLNGNLNTSEVLQKKMPTHCILPSVCVLCLRDEDSLNHIFFSCSYAKSGWFKLFSIFNQHWVFSNNFRCNIYQLLYGLALLNSKAKLLWINA
VKAFLSELWLQRNQRFFQNKYLPWIDRFEASRLKASTWCSLSKLFMGFSLQDICLNWNVFIYPS