; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0008652 (gene) of Snake gourd v1 genome

Gene IDTan0008652
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionprotein FAR1-RELATED SEQUENCE 4-like
Genome locationLG05:47820536..47826780
RNA-Seq ExpressionTan0008652
SyntenyTan0008652
Gene Ontology termsGO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR006564 - Zinc finger, PMZ-type
IPR007527 - Zinc finger, SWIM-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022145820.1 uncharacterized protein LOC111015181 [Momordica charantia]4.9e-3751.57Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG LQ+ +Y RR  A++    ++ YA  +      S RRHVV  +DQF F+V D  L G V+L+  TC CREFDYF+IPCSHAI A 
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P +LC EAY  +SW+ AYA P+  +GHVS W  S EF +  V PPK V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

XP_022153146.1 uncharacterized protein LOC111020715 [Momordica charantia]2.4e-3651.57Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG LQ+ +Y RR  AS+    ++ YA   L     ++RRHVV  +DQF  +V DG L G V+ ++RTC CREFDYF+IPCSHAI   
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P TLC EAY  +SWV AYA P+  +GHVS W  S +F D  V  P  V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

XP_022154610.1 uncharacterized protein LOC111021833 [Momordica charantia]1.9e-3650.31Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG L + +Y RR  A++    ++ YA  +      S+RRHVV  +DQF F+V DG   G V+L+  TC+CREFDYF+IPCSH I A 
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P +LC EAY  +SW+ AYA P+  +GHVS W  S EF +  V PPK V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

XP_022154964.1 protein FAR1-RELATED SEQUENCE 4-like [Momordica charantia]3.2e-3648.02Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQIL-----EARRAS-------------SRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCT
        AR      LLDHIRG LQ  +Y+RR  AS+    ++ YA +++      ARR S             +RRH+V  +DQF FEV DG L G V+L ++TCT
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQIL-----EARRAS-------------SRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCT

Query:  CREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
        CREFDYF++PCSHAI A   R+I+P TLC EAY V+SW+ AYA P+  +G  S W+ S  F + +V PPK+V RVGR
Subjt:  CREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

XP_022159268.1 uncharacterized protein LOC111025678 [Momordica charantia]1.4e-3652.32Show/hide
Query:  LLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPI
        LLDHIRG LQ+ +Y RR  A++    ++ YA  +      S+RRHVV  +DQF F+V DG L G V+L+   C+CREFDYF+IPCSHAI A   RNI+P 
Subjt:  LLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPI

Query:  TLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
        +LC EAY  +SW+ AYA P+  +GH+S W  S EF +  V PPK V RVGR
Subjt:  TLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

TrEMBL top hitse value%identityAlignment
A0A6J1CVL4 uncharacterized protein LOC1110151812.4e-3751.57Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG LQ+ +Y RR  A++    ++ YA  +      S RRHVV  +DQF F+V D  L G V+L+  TC CREFDYF+IPCSHAI A 
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P +LC EAY  +SW+ AYA P+  +GHVS W  S EF +  V PPK V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

A0A6J1DJT1 uncharacterized protein LOC1110207151.2e-3651.57Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG LQ+ +Y RR  AS+    ++ YA   L     ++RRHVV  +DQF  +V DG L G V+ ++RTC CREFDYF+IPCSHAI   
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P TLC EAY  +SWV AYA P+  +GHVS W  S +F D  V  P  V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

A0A6J1DK35 uncharacterized protein LOC1110218339.1e-3750.31Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC
        AR      LLDHIRG L + +Y RR  A++    ++ YA  +      S+RRHVV  +DQF F+V DG   G V+L+  TC+CREFDYF+IPCSH I A 
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGAC

Query:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
          RNI+P +LC EAY  +SW+ AYA P+  +GHVS W  S EF +  V PPK V RVGR
Subjt:  AFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

A0A6J1DNT3 protein FAR1-RELATED SEQUENCE 4-like1.5e-3648.02Show/hide
Query:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQIL-----EARRAS-------------SRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCT
        AR      LLDHIRG LQ  +Y+RR  AS+    ++ YA +++      ARR S             +RRH+V  +DQF FEV DG L G V+L ++TCT
Subjt:  ARSCQSPGLLDHIRGWLQSKYYKRRNKASAWPHRITKYACQIL-----EARRAS-------------SRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCT

Query:  CREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
        CREFDYF++PCSHAI A   R+I+P TLC EAY V+SW+ AYA P+  +G  S W+ S  F + +V PPK+V RVGR
Subjt:  CREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

A0A6J1DYC4 uncharacterized protein LOC1110256786.9e-3752.32Show/hide
Query:  LLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPI
        LLDHIRG LQ+ +Y RR  A++    ++ YA  +      S+RRHVV  +DQF F+V DG L G V+L+   C+CREFDYF+IPCSHAI A   RNI+P 
Subjt:  LLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPI

Query:  TLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR
        +LC EAY  +SW+ AYA P+  +GH+S W  S EF +  V PPK V RVGR
Subjt:  TLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G49920.1 MuDR family transposase3.7e-0631.82Show/hide
Query:  GRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEW
        G V L++ TCTC EF   + PC HA+  C    I+P+    + Y V+ +   Y++    +  +S W
Subjt:  GRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEW

AT1G64255.1 MuDR family transposase7.4e-0727.35Show/hide
Query:  HVVRPVDQFEFEVDDGYLGGR--VNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLP
        ++V P+D   F+V      G   V L + +CTC +F  ++ PC HA+  C     +P+    + Y ++     YA+    +  +S W  +S      +LP
Subjt:  HVVRPVDQFEFEVDDGYLGGR--VNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEWEMSSEFQDYEVLP

Query:  PKQVSRVGRLPRPPNRV
        P  V      P PP  V
Subjt:  PKQVSRVGRLPRPPNRV

AT1G64260.1 MuDR family transposase1.3e-0627.84Show/hide
Query:  LEARRASSRRHVVRPVDQFEFEVDDGYLGGR--VNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEW
        LE     S  +V+  +++  F+V +        V L+  TCTCR+F  ++ PC HA+       I+P+    E Y V+ +   YA+    +  V+ W
Subjt:  LEARRASSRRHVVRPVDQFEFEVDDGYLGGR--VNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVDSWVDAYASPVHSLGHVSEW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGAGTTTCACCCTAATCCAAAACCATGGGCCAAACACTCCCTTAGTATTCCACAAGAGGTTGAGCTTGGAAATAAGAGGAGAAGTTAGAAGCCCAAAGGGAGC
ATGTGTAATGCACACCTCTAACATCATGCCGCGTGTTTGGGTTTGTTTTGGGGGTATATGGAACGATGGAGAAAAGGATTACGATGGTGGCGAGGTTAGGGGACTTGATG
TGGATGTAGAAATTAAATTTGAGGAGTTTTTGGGTCTAGTCTATGAGATAAGTTATACTGACCGGAATGATTTTAATCTTGTAATGAGATGTATATTGCCGTTAAGATCC
AAATCCCCAGCATTTGTTATTAAAAATGATGTAGATCTCAAATGTTTCCTTACTTTGGAAGACGTCTCATCAATTCCACTCTACATATCTACATCTCCTACCTTTTCAAG
AAGTCATGCATTTCAACCCTATTCAATTCCTTGTAAGGTACTAGATAATCCATATATGGGAGTACAAAATTATCCATATGTTTCATCCATGACTACGTCTACAGTTGTCC
CTCATAACACCATAAATGGTCCCTTGGAGGAAGATGATGTGGAGGTTGAAAGGATAAAAACTCATGACGAGTTTCAAGAGGCAGATGCACGGAGTTGCCAATCACCTGGT
TTATTGGATCATATTAGAGGTTGGCTTCAATCTAAATACTATAAACGTCGGAATAAAGCATCTGCATGGCCACATCGAATAACGAAGTATGCTTGCCAAATTCTTGAAGC
CCGAAGAGCTAGTTCAAGGAGACATGTAGTCCGACCAGTTGATCAGTTCGAGTTTGAGGTAGATGATGGTTACCTGGGTGGGCGTGTAAATCTCCATAATAGAACTTGTA
CTTGTCGAGAGTTTGATTACTTTGAAATTCCTTGTTCACATGCAATTGGAGCATGTGCATTCCGTAATATAGACCCAATCACACTATGTTCTGAAGCATATCATGTTGAT
TCATGGGTCGACGCGTATGCAAGTCCTGTACATTCATTAGGTCATGTGTCAGAGTGGGAAATGTCATCTGAATTTCAAGACTACGAAGTGTTACCACCGAAGCAAGTATC
TAGAGTGGGTCGCCTGCCACGACCTCCGAATCGAGTCCGAATCAACCTCCTCTTCGGTTTTTGGTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGAGTTTCACCCTAATCCAAAACCATGGGCCAAACACTCCCTTAGTATTCCACAAGAGGTTGAGCTTGGAAATAAGAGGAGAAGTTAGAAGCCCAAAGGGAGC
ATGTGTAATGCACACCTCTAACATCATGCCGCGTGTTTGGGTTTGTTTTGGGGGTATATGGAACGATGGAGAAAAGGATTACGATGGTGGCGAGGTTAGGGGACTTGATG
TGGATGTAGAAATTAAATTTGAGGAGTTTTTGGGTCTAGTCTATGAGATAAGTTATACTGACCGGAATGATTTTAATCTTGTAATGAGATGTATATTGCCGTTAAGATCC
AAATCCCCAGCATTTGTTATTAAAAATGATGTAGATCTCAAATGTTTCCTTACTTTGGAAGACGTCTCATCAATTCCACTCTACATATCTACATCTCCTACCTTTTCAAG
AAGTCATGCATTTCAACCCTATTCAATTCCTTGTAAGGTACTAGATAATCCATATATGGGAGTACAAAATTATCCATATGTTTCATCCATGACTACGTCTACAGTTGTCC
CTCATAACACCATAAATGGTCCCTTGGAGGAAGATGATGTGGAGGTTGAAAGGATAAAAACTCATGACGAGTTTCAAGAGGCAGATGCACGGAGTTGCCAATCACCTGGT
TTATTGGATCATATTAGAGGTTGGCTTCAATCTAAATACTATAAACGTCGGAATAAAGCATCTGCATGGCCACATCGAATAACGAAGTATGCTTGCCAAATTCTTGAAGC
CCGAAGAGCTAGTTCAAGGAGACATGTAGTCCGACCAGTTGATCAGTTCGAGTTTGAGGTAGATGATGGTTACCTGGGTGGGCGTGTAAATCTCCATAATAGAACTTGTA
CTTGTCGAGAGTTTGATTACTTTGAAATTCCTTGTTCACATGCAATTGGAGCATGTGCATTCCGTAATATAGACCCAATCACACTATGTTCTGAAGCATATCATGTTGAT
TCATGGGTCGACGCGTATGCAAGTCCTGTACATTCATTAGGTCATGTGTCAGAGTGGGAAATGTCATCTGAATTTCAAGACTACGAAGTGTTACCACCGAAGCAAGTATC
TAGAGTGGGTCGCCTGCCACGACCTCCGAATCGAGTCCGAATCAACCTCCTCTTCGGTTTTTGGTATTAG
Protein sequenceShow/hide protein sequence
MVLSFTLIQNHGPNTPLVFHKRLSLEIRGEVRSPKGACVMHTSNIMPRVWVCFGGIWNDGEKDYDGGEVRGLDVDVEIKFEEFLGLVYEISYTDRNDFNLVMRCILPLRS
KSPAFVIKNDVDLKCFLTLEDVSSIPLYISTSPTFSRSHAFQPYSIPCKVLDNPYMGVQNYPYVSSMTTSTVVPHNTINGPLEEDDVEVERIKTHDEFQEADARSCQSPG
LLDHIRGWLQSKYYKRRNKASAWPHRITKYACQILEARRASSRRHVVRPVDQFEFEVDDGYLGGRVNLHNRTCTCREFDYFEIPCSHAIGACAFRNIDPITLCSEAYHVD
SWVDAYASPVHSLGHVSEWEMSSEFQDYEVLPPKQVSRVGRLPRPPNRVRINLLFGFWY